Big Data Engineer

Location

United States

Posted

23 days ago

Salary

$120K - $160K / year

Seniority

Mid Level

Job Description

Big Data Engineer

Clear Fracture LLC

Role Description Clear Fracture is building AI-driven data integration systems that enable organizations to connect, transform, and reason over complex data using agentic workflows. Our platform operates across cloud and on-prem environments and is designed to support multi-tenant, production-scale use cases. We are looking for a Data Engineer who operates as a software engineer first, with strong experience in data modeling and data systems. You will play a key role in building the core data layer that powers our agentic platform—designing schemas, implementing data services, and enabling reliable, scalable data flows. In addition to building core data infrastructure, you will also develop real use cases on the platform itself, helping shape how users interact with data. This includes designing data interfaces, abstractions, and tooling that make it easier to understand, model, and work with data across the system. This is not a traditional ETL-only role. You will write production code, design systems, and help define how data is represented, accessed, and understood across the platform. Qualifications - Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent practical experience. - 6+ years of professional experience in software engineering and/or data engineering roles. - Due to the nature of the work, U.S. Citizenship and the ability to obtain a Secret Clearance are required. - Strong programming skills in Python (or similar backend language). - Experience designing and implementing data models for production systems, with advanced knowledge of dimensional modeling topics like slowly changing dimensions and entity relationship diagrams. - Proficiency in SQL and experience with relational databases (e.g., PostgreSQL). - Experience building backend services or APIs that interact with data systems. - Experience designing and operating data pipelines (ETL/ELT). - Familiarity with NoSQL databases and different data storage paradigms. - Experience working with large datasets and performance optimization. - Experience with Docker and containerized development workflows. - Familiarity with Kubernetes-based environments. - Strong understanding of software engineering fundamentals (testing, version control, system design). Requirements - Design and implement logical and physical data models for complex, evolving datasets. - Define schemas and access patterns that support multi-tenant usage and application-level workflows. - Balance normalization, performance, and flexibility across different storage systems. - Partner with product and engineering teams to translate requirements into scalable data designs. - Develop real-world data use cases on top of the platform to validate and extend its capabilities. - Design and build data interfaces and abstractions that help users understand and work with data. - Contribute to systems such as data glossaries, semantic layers, and metadata and schema discovery tools. - Help define how users explore, model, and interact with data within the platform. - Translate complex data structures into intuitive, usable representations. - Build backend services and APIs that expose and operate on data models. - Implement data access layers that are reliable, maintainable, and performant. - Contribute to core application architecture where data and services intersect. - Write clean, testable, production-grade code. - Design and implement pipelines for ingesting, transforming, and validating data. - Support both batch and near-real-time processing workflows. - Build systems that handle structured, semi-structured, and unstructured data. - Enable data flows that support AI-driven and agent-based workflows. - Work with embeddings, context retrieval, and data representations used in modern AI systems. - Help design systems that make data accessible and useful for autonomous agents. - Implement validation, monitoring, and testing for data systems. - Ensure correctness, consistency, and observability of data pipelines and services. - Diagnose and resolve data-related issues in production environments. Benefits - Engineering mindset: You approach data systems as software systems, not just pipelines. - Data intuition: You understand how to model real-world complexity into clear, usable structures. - Product thinking: You care about how users interact with and understand data, not just how it is stored. - Systems thinking: You see how data flows through services, APIs, and AI systems. - Ownership: You take responsibility for the reliability and usability of what you build. - Pragmatism: You balance ideal design with real-world constraints. - Collaboration: You work effectively across engineering disciplines.

Related Categories

Related Job Pages

More Data Engineer Jobs

Empower logo

Director, Data Engineering – Automation

Empower

We are an equal opportunity employer with a commitment to diversity. All individuals, regardless of personal characteristics, are encouraged to apply. All qualified applicants will receive consideration for employment without regard to age, race, color, national origin, ancestry, sex, sexual orientation, gender, gender identity, gender expression, marital status, pregnancy, religion, physical or mental disability, military or veteran status, genetic information, or any other status protected by applicable state or local law.

Data Engineer24 days ago
Full TimeRemoteTeam 10,001+H1B Sponsor

• Lead a team of data engineers transforming data from disparate systems to enable insights and analytics for business stakeholders. • Create technical roadmaps and recommend strategies for data pipelines and integration. • Leverage cloud-based infrastructure to implement scalable, resilient, and efficient data engineering solutions. • Collaborate with data analysts, data scientists, database administrators, cross-functional teams, and business stakeholders to solve problems. • Influence architectural decisions and design patterns across the data platform. • Provide technical leadership across the software development lifecycle, from design to deployment, including hands-on contribution. • Develop project plans, facilitate prioritization timelines, allocate resources, and take ownership of assigned technical projects in a fast-paced environment. • Perform code reviews and ensure data engineers follow best-practice coding standards. • Define and validate test cases to ensure data quality, reliability, and a high level of confidence. • Continuously improve quality, efficiency, and scalability of data pipelines, reducing gaps and inconsistencies.

United States
$138K - $200.1K / year
Job Closed
Full TimeRemoteTeam 1,001-5,000Since 1966H1B No Sponsor

• Own and deliver impactful data products within WSI’s medallion architecture. • Transform raw and conformed data into governed, high-quality datasets for analytics, AI, and operational use. • Design, build, and optimize data solutions on Microsoft Fabric, including pipelines, Lakehouse/Warehouse structures, PySpark notebooks, and semantic models. • Evolve and implement data architecture patterns (medallion, SCD, CDC, orchestration, CI/CD), adapting them to real-world scale, performance, and business needs. • Ensure data quality, observability, and performance at scale. • Implement validation frameworks, monitoring, SLAs, and cost-optimized storage and compute strategies. • Partner with stakeholders to translate requirements into reusable, scalable data models and curated data products. • Drive consolidation of legacy reporting and BI tools into a unified, governed analytics platform. • Embed security, governance, and best practices, including role-based access, cataloging, and release management. • Act as a technical leader, contributing to standards, mentoring peers, and elevating overall engineering quality. • Leverage modern dev tools (e.g., GitHub Copilot or similar) to accelerate delivery and engineering efficiency.

Wisconsin
$125K - $200K / year
ABC Supply Co. Inc. logo

IT Data Warehouse Engineer

ABC Supply Co. Inc.

North America's largest wholesale distributor of roofing and other select exterior and interior building products.

Data Engineer24 days ago
Full TimeRemoteTeam 10,001+H1B No Sponsor

• Develops batch integration solutions for ABC Supply. • This includes traditional DW workloads and nightly large extracts that are scheduled. • Design and Build Data models – star schema, snowflake • Create ADF pipelines to bring new data from various sources • Create Data bricks notebooks for Data transformation • Documents all solutions as needed using ABC standard documentation. • Plans, reviews, and performs the implementation of database changes for integrations/DW work. • Maintain integration documentation and audit tools. • To include developing/updating the integration dashboard. • Work with BI team, PO to build required tables and transform data to load into Snowflake • Provides support for database/database servers as a member of the Data Management team. • Works with project management and business analysis team to provide estimates and ensure documentation of all requirements. • Provide logical layers (database views) for end-user access to data in database systems. • Partners with functional support and help desk teams to ensure communication, collaboration and compliance with support process standards at ABC. • Performs data management tasks as needed.

United States
Job Closed
Kainos logo

Senior Data Engineer

Kainos

Thinking Beyond Limitations

Data Engineer24 days ago
Full TimeRemoteTeam 1,001-5,000H1B No Sponsor

• Responsible for designing and developing data processing and data persistence software components for solutions which handle data at scale • Develop data processing software primarily for deployment in Big Data technologies • Encompasses the full software lifecycle including design, code, test and defect resolution • Work with Architects and Lead Engineers to ensure the software supports non-functional needs • Collaborate with colleagues to resolve implementation challenges and ensure code quality and maintainability remains high • Leads by example in code quality • Work with operations teams to ensure operational readiness • Advise customers and managers on the estimated effort and technical implications of user stories and user journeys • Coach and mentor team members

United Kingdom