Job Closed

This listing is no longer active.

Cummins Inc.

Senior Data Engineer

Data EngineerData EngineerOther Remote SeniorTeam 10,001+Since 1919H1B No SponsorCompany Site LinkedIn

Location

United States

Posted

132 days ago

Salary

$76.8K - $115.2K / year

Seniority

Senior

Bachelor Degree5 yrs expEnglishApache HTTP Server AWS Azure ETL Python Scala Apache Spark SQL Unity

Job Description

• Streamlining Data Integration You’ll design and automate scalable systems to ingest and transform data from diverse sources, ensuring seamless and efficient data flow across the organization. • Safeguarding Data Quality By implementing robust monitoring frameworks, you’ll proactively detect and resolve data integrity issues, maintaining trust in analytics and reporting. • Establishing Data Governance You’ll lead the development of governance processes to manage metadata, access, and retention, ensuring compliance and secure data usage for internal and external stakeholders. • Building Scalable Data Pipelines You’ll architect reliable and high-performance ETL/ELT pipelines with built-in monitoring and alerts, enabling timely and accurate data delivery for business needs. • Optimizing Database Design and Performance Through thoughtful physical data modeling and indexing strategies, you’ll enhance database efficiency and scalability for large-scale operations. • Modernizing Data Infrastructure You’ll develop and operate advanced storage and processing solutions using distributed and cloud platforms, supporting big data initiatives and analytics. • Automating Data Workflows By leveraging modern tools and techniques, you’ll reduce manual data preparation tasks, boosting productivity and minimizing errors. • Mentoring and Agile Collaboration You’ll coach junior team members and contribute to agile practices like DevOps and Scrum, accelerating delivery of critical analytics projects and fostering team growth.

Job Requirements

Minimum of 5 years of hands-on experience in data engineering with expertise in Azure Databricks and programming in Scala or Python.
Proven experience in building and maintaining structured streaming pipelines using Spark.
Strong knowledge of big data technologies, including Delta Lake, Apache Spark, Structured Streaming, and SQL.
Experience with Git for version control and CI/CD pipeline management.
Nice to Have (Preferences): Data Engineering Certification (e.g., Databricks Certified Data Engineer, Apache Spark Professional Data Engineer, or equivalent).
Exposure to real-time data ingestion frameworks and cloud-native data services (e.g., Azure Event Hub, Azure Data Lake, AWS SQS, etc).
Familiarity with data governance, access control (e.g., Unity Catalog or Immuta), and performance monitoring tools in cloud environments.

Related Categories

Data Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Principal Data Architect

Unqork

Using CaaS (Codeless-as-a-Service) to accelerate time-to-market & eliminate legacy code for the enterprise 🚀

Data Engineer132 days ago

Other RemoteTeam 201-500Since 2017H1B Sponsor

Company Site LinkedIn

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We're seeking an experienced, visionary Principal Data Architect and Leader to drive the strategy, design, and operations of our enterprise-grade data architecture and infrastructure. This critical role leads our data functions, ensuring our platform provides a robust, scalable, secure, and highly available foundation for customers developing, deploying, and hosting applications using this robust data layer. The ideal candidate will possess deep expertise in data architecture, cloud-native technologies, and operational excellence, directly impacting our ability to serve thousands of customer applications. Report to our Engineering Manager Architecture & Strategy - Define and own the long-term data architecture strategy for Unqork's core platform, covering data modeling, query design, storage topology, and access patterns across a complex data environment that includes MongoDB as well as integration with Relational and Columnar database models. - Ensure the data layer meets the security, performance, scalability, and resilience requirements of enterprise-grade, mission-critical applications. - Evaluate and recommend the right database technologies, indexing strategies, caching layers, ETL and search infrastructure for each class of workload — including when to use MongoDB Atlas Search, Caching (e.g., Redis), Querying (e.g., Kafka) and Streaming components. - Own the end-to-end solutions around data transfers using solutions like ETL for customers. - Own the data architecture for Unqork's AI-driven development layer — defining persistence, versioning, and query standards for AI-generated configurations. - Lead the design of declarative data models and schemas that enable non-technical users to build complex logic while maintaining strict data integrity. - Define the architectural boundary between database-layer computation (aggregation pipelines, indexing) and application-layer computation (Node.js post-processing, in-memory caching), and establish standards for which work belongs where. - Create and maintain comprehensive documentation including data architecture blueprints, indexing governance policies, query standards, and migration playbooks. - Own capacity planning and cost modeling for data infrastructure resources as Unqork scales. Leadership & Data Operations - Mentor and grow a team of data engineers and database engineers responsible for the health and performance of Unqork's data platform. - Establish data modeling best practices and enforce standardization across all environments — including schema conventions, index lifecycle management, and pagination contract design. - Oversee the design and operation of Unqork's database infrastructure — defining thresholds, coverage policies, write amplification limits, and manual override processes. - Drive data operational excellence by implementing and refining query performance monitoring, slow query alerting, explain plan review processes, and incident response playbooks for database degradation events. - Define and enforce data access governance — including RBAC data model standards, cache TTL policies, and the rules under which eventual consistency is acceptable vs. when strong consistency is required. - Comply with security regulations while working on data designs and patterns for Unqork platform. - Partner with Product to translate product requirements into data model decisions, and identify where relaxing a product constraint unlocks a disproportionate architectural improvement. Qualifications - 10+ years of progressive experience in data architecture, database engineering, or a related field, with at least 3 years in a principal or architect-level role. - Extensive experience designing and managing enterprise-grade, multi-tenant data infrastructure for SaaS platforms. - Expert-level proficiency with MongoDB — including aggregation pipeline design, index strategy (B-tree, text, vector), replica sets, sharding, and query execution plan analysis (IXSCAN vs. COLLSCAN). - Deep, hands-on expertise with our core data technology stack: - MongoDB / MongoDB Atlas (aggregation pipelines, Atlas Search, Atlas Vector Search, sharding) - Relational/SQL Databases (Operational and Business Intelligence schema and query partners) - Redis (caching strategy, TTL design, cache invalidation, pub/sub) - Node.js (application/database boundary, worker threads, event loop awareness) - RBAC and access control data patterns (denormalization, write-time materialization, owner list caching) - AI/ML data infrastructure (semantic search, LLM-friendly schema design, columnar database design) - Proven ability to write architectural decision records that hold up over time — capturing not just the recommendation but the alternatives considered and the conditions under which the decision should be revisited. - Proven ability to lead technical teams, manage complex data migration projects, and influence cross-functional stakeholders including Product and Engineering leadership. - Strong understanding of data security principles, multi-tenant isolation patterns, and enterprise compliance requirements (SOC 2, ISO 27001). Benefits - 💻 Work from home with a remote-first community - 🏝 Unlimited PTO (and the encouragement to use it) - 📝 Student loan payback program - 🏥 100% employer-covered medical, dental, and vision options available to you and your dependents - 💸 Flexible Spending Account (FSA) - 🏠 Monthly stipend toward your WFH setup, vacation, development and more - 💰 Employer-sponsored 401(k) with contribution match - 🏋🏻‍♀️ Subsidized ClassPass Membership - 🍼 Generous Paid Parental Leave Hiring Ranges - Tier 1: $229,000 - $286,200 - Tier 2: $215,100 - $268,900 Company Description Unqork embraces a culture of security and privacy awareness by consistently safeguarding sensitive information, adhering to company policies, and actively participating in training and initiatives to protect our data and the privacy of our stakeholders. Unqork is an equal opportunity employer. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.

View details: Principal Data Architect

United States

$215.1K - $286.2K / year

Apply

Job Closed

Senior Data Engineer

MUTT DATA

We are Data Nerds. Astronomer & Amazon Consulting Partners.

Data Engineer132 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Collaborate with the team to define goals and deliver custom data solutions. • Innovate with new tools to improve Mutt Data's infrastructure and processes. • Design and implement ETL processes, optimize queries, and automate pipelines. • Own projects end-to-end—build, maintain, and improve data systems while working with clients. • Develop tools for the team and assist in tech migrations and model design. • Build scalable, high-performance data architectures. • Focus on code quality—review, document, test, and integrate CI/CD.

Airflow AWS Azure Docker ETL GCP NumPy Pandas Python SQL

View details: Senior Data Engineer

Argentina

Apply

Databricks Architect

Stitch

We help marketers get more from their tech stacks — with a focus on driving success with Braze and Segment.

Data Engineer132 days ago

Other RemoteTeam 11-50H1B Sponsor

Company Site LinkedIn

Stitch is the global leader in helping brands drive CRM performance through Braze and Databricks. We work with Fortune 1000 brands to design and execute martech-driven solutions and lifecycle marketing strategies that drive meaningful outcomes—producing award-winning work for brands like Taco Bell and e.l.f. cosmetics. Our fast-growing team spans the US, UK, and Canada. We’re actively building the best customer relationship marketing (CRM) consultancy in the world. Not the biggest—the best. We’re looking for teammates who are energized by this challenge. Who You Are First and foremost, you’re motivated by impact. To be the best CRM consultancy in the world, we have to be motivated by performance—both in how our work drives outcomes for our customers, and for how our work drives outcomes for Stitch. This means you’re not content with the status quo. You’re always looking towards what’s next and seeking opportunities to grow yourself, grow our customers, and grow Stitch. You have relentless standards and uphold them unapologetically. You move nimbly, take ownership, drive things forward, and make the people around you better. If this sounds like you, you will thrive here. The Role As a Technical Architect — Databricks, you’ll play a pivotal role in scaling Stitch’s Databricks consulting practice. You’ll own end-to-end Lakehouse architecture, from Braze and Segment integrations to AI-driven customer 360 solutions, combining deep technical expertise with the agility of a high-growth consultancy. You’ll be instrumental in shaping how Stitch delivers Databricks solutions and building something that didn’t exist here before. To thrive in this role, you’ll need to be as comfortable presenting architecture recommendations to senior client stakeholders as you are writing PySpark. You bring a strong point of view, a curiosity-first approach to client problems, and the drive to make a real impact. What You’ll Do - Lead and drive client working sessions focused on data architecture and data mapping, bringing clarity and direction to complex discussions with senior stakeholders. - Design and own end-to-end Databricks Lakehouse architecture for marketing data ingestion, transformation, storage, and consumption—leveraging Unity Catalog, Delta Lake, Delta Live Tables, and Databricks SQL Warehouse. - Design integrations between Databricks and martech platforms (Braze, Segment, CDPs) to enable customer 360 views, churn prediction, media mix modeling, dynamic pricing, and lifetime value analysis. - Design and implement robust data models—including medallion architecture, Star Schema, and Data Vault—optimized for marketing analytics workloads. - Enable marketers to query data conversationally and generate actionable insights without SQL through Databricks Genie and AI-powered analytics. - Support the design and implementation of AI agents using Agent Bricks and the Mosaic AI Agent Framework. - Review and interpret technical capability maps and frameworks, connecting them to data architecture and design decisions. - Architect data governance, security, and access control policies using Unity Catalog, including lineage tracking, audit logging, PII masking, and marketing data privacy compliance. - Integrate Databricks workflows into CI/CD pipelines using Terraform, Git, and Azure DevOps; implement infrastructure-as-code and automated deployment for notebooks, jobs, and clusters. - Partner with clients on roadmaps, POCs, and migrations; support pre-sales efforts with technical proposals, whiteboarding sessions, and project estimation. - Manage your time to meet billable targets of 36+ hours per week. - Travel up to 20% for client meetings, workshops, and onsite engagements. How You Consult At Stitch, every role is expected to operate as a trusted advisor whose expertise helps our clients with their business needs. As a Technical Architect, your credibility comes from deep technical depth combined with the ability to communicate clearly across technical and business audiences. - Lead requirements, validation, and ideation sessions with technical and business stakeholders. - Advise clients on data architecture and platform tradeoffs. Navigate ambiguity and translate between marketers and engineers. - Build relationships with client technical leads. Become the architect they trust and request. - Bring a strong point of view and recommendations—not just options. Anticipate what the client needs before they ask. - Own outcomes on your technical workstream. Lead without being asked. - Share feedback with peers regularly that makes all of us better. What Success Looks Like - Clients see you as a trusted data architecture advisor. You’re confident meeting with senior stakeholders and bringing a strong point of view around best practices and creative solutions. - You show up to every meeting prepared and engaged. Every touchpoint—Slack, email, Zoom, or onsite—is an opportunity to add value. You’re responsive and work with a sense of urgency. - You consistently meet deadlines. We’re a professional services business and our success depends on client satisfaction. Hitting your deadlines gives the Stitchers depending on your work enough runway to do theirs. - You manage your weekly schedule intentionally to meet quarterly billable targets. You’ll be context-switching between deep technical work, client meetings, internal collaboration, certifications, and time-tracking—oftentimes across multiple clients at once. Staying organized is what makes it all feel manageable. Who Your Team Will Be Stitchers come with diverse backgrounds and experiences—but we are tied together by our Common Threads. These are the values that define our Stitchers: Form Lasting Bonds — We’re a people business. We consistently show up for our team, our partners, and our clients in ways that build trust and champion others. Seek Solutions — We aren’t order takers. We’re solution designers. We solve problems, move fast, deliver excellence, and drive results. We always have a point of view. Don’t Settle — We don’t accept the way things have always been done. We think bigger, pursue growth, and find motivation in the discomfort that scares most people away. Take the Lead — We forge our own path without waiting for permission. We push boundaries, lead our team and customers, and find the answer when one doesn’t exist yet. What You’ll Get at Stitch - The opportunity to work within multiple Fortune 1000 brands across the retail, media, gaming, quick-service, financial services, travel, and healthcare industries. - Working alongside the most concentrated group of Braze experts in the world—backed by more Braze credentials and experience than any other company across the globe. - The ability to feel your impact and shape how Stitch grows as we continue to scale through hypergrowth. - The chance to learn new technologies (like Braze, Databricks, and AI) and grow faster than you ever have before. - Of course, we offer competitive compensation, benefits, and growth opportunities, too.

View details: Databricks Architect

United States

Apply

Job Closed

Senior Data Engineer

Highmark Health

Creating remarkable health experiences, freeing people to be their best.

Data Engineer132 days ago

Other RemoteTeam 10,001+Since 1852H1B Sponsor

Company Site LinkedIn

Company : enGenJob Description : JOB SUMMARY We are seeking a highly skilled and adaptable Senior FHIR Interoperability Engineer to drive the reliable movement, transformation, and storage of complex healthcare data using Fast Healthcare Interoperability Resources (FHIR) standards. In this pivotal senior-level role, you will collaborate with data architects, analysts, and other engineers to design, develop, implement, and optimize FHIR-enabled data pipelines. This includes ensuring data quality, integrity, and security across various platforms, from initial concept to ongoing support. This role is essential for the secure and efficient exchange of patient clinical outcome data and payer claim and membership data between payers and providers. Your expertise will be critical in ensuring compliance with federal and Blue Cross Blue Shield Association (BCBSA) mandates and directly contributing to Highmark Health's strategic "quintuple aim" objectives: reducing the cost of care, addressing health equity and access, improving health outcomes, and enhancing both customer and clinician/provider experiences. Key Skills: - Fast Healthcare Interoperability Resources (FHIR) - ETL Tools & Scripting Languages: Python, PySpark, DBT - Cloud & Big Data Platforms: Databricks, Google Cloud Platform (GCP), Google Healthcare Data Engine (HDE), BigQuery, PostgreSQL - Data Virtualization tools and frameworks: Starburst - EHR Systems: Epic, Cerner (and similar) - Cloud Orchestration: Terraform ESSENTIAL RESPONSIBILITIES - Design, develop, and maintain robust data processes and solutions to ensure the efficient movement and transformation of data across multiple systems - Develop and maintain data models, databases, and data warehouses to support business intelligence and analytics needs - Collaborate with stakeholders across IT, product, analytics, and business teams to gather requirements and provide data solutions that meet organizational needs - Monitor work against production schedule, provide progress updates, and report any issues or technical difficulties to lead developers regularly - Implement and manage data governance practices, ensuring data quality, integrity, and compliance with relevant regulations. - Collaborate on the design and implementation of data security measures, including access controls, encryption, and data masking - Mentor other associate and intermediate data engineers as needed - Perform data analysis and provide insights to support decision-making across various departments - Stay current with industry trends and emerging technologies in data engineering, recommending new tools and best practices as needed - Other duties as assigned or requested. EXPERIENCE Required - 5 years of experience in design and analysis of algorithms, data structures, and design patterns in the building and deploying of scalable, highly available systems - 5 years of experience in a data engineering, ETL development, or data management role. - 5 years of experience in SQL and experience with database technologies (e.g., MySQL, PostgreSQL, MongoDB). - 5 years of experience in data warehousing concepts and experience with data warehouse solutions (e.g., Snowflake, Redshift, BigQuery) Preferred - Experience with data streaming and workflow management tools (e.g., Confluent Kafka/Flink, Google Dataflow), data virtualization tools and frameworks (e.g. Starburst), and SQL-structured pipeline development tools (DBT). - Experience with cloud infrastructure provisioning automation and scripting, including Terraform. - 7 years of experience defining system architectures and exploring technical feasibility trade-offs for optimizing short term execution while planning for long term technical capabilities - 7 years of experience working with a variety of technology systems, designing solutions or developing data solutions in healthcare - 7 years of experience with cloud platforms (AWS, Azure, GCP) and their respective data services - 7 years of experience in data governance, data quality, and data security best practices - 7 years of experience translating requirements, design mockups, prototypes or user stories into technical designs - 7 years of experience in producing data-related code that is fault-tolerant, efficient, and maintainable SKILLS - Demonstrated ability to achieve stretch goals in a highly innovative and fast-paced environment - Adaptability: Strong ability to take on diverse tasks and projects, adapting to the evolving needs of the organization - Analytical Thinking: Strong analytical skills with a focus on detail and accuracy - Interest and ability to learn other data development technologies/languages as needed - Technical Proficiency: Comfortable with a range of data tools and technologies, with a willingness to learn new skills as needed - Strong track record in designing and implementing large-scale data sources - Strong sense of ownership, urgency, and drive - Demonstrated passion for user experience and improving usability - Team Collaboration: A team player who can work effectively in cross-functional environments - Experience and willingness to mentor junior data engineers and help develop their skills and leadership EDUCATION Required - Bachelor’s degree in Computer Science, Information Systems, Data Science, Computer Engineering or related field Preferred - Master's degree in Computer Science, Information Systems, Data Science, Computer Engineering or related field LICENSES or CERTIFICATIONS Required - None Preferred - None Language (Other than English): None Travel Requirement: 0% - 25% PHYSICAL, MENTAL DEMANDS and WORKING CONDITIONS Position Type Office- or Remote-based Teaches / trains others Occasionally Travel from the office to various work sites or from site-to-site Rarely Works primarily out-of-the office selling products/services (sales employees) Never Physical work site required No Lifting: up to 10 pounds Constantly Lifting: 10 to 25 pounds Occasionally Lifting: 25 to 50 pounds Rarely Disclaimer: The job description has been designed to indicate the general nature and essential duties and responsibilities of work performed by employees within this job title. It may not contain a comprehensive inventory of all duties, responsibilities, and qualifications required of employees to do this job. Compliance Requirement: This job adheres to the ethical and legal standards and behavioral expectations as set forth in the code of business conduct and company policies. As a component of job responsibilities, employees may have access to covered information, cardholder data, or other confidential customer information that must be protected at all times. In connection with this, all employees must comply with both the Health Insurance Portability Accountability Act of 1996 (HIPAA) as described in the Notice of Privacy Practices and Privacy Policies and Procedures as well as all data security guidelines established within the Company’s Handbook of Privacy Policies and Practices and Information Security Policy. Furthermore, it is every employee’s responsibility to comply with the company’s Code of Business Conduct. This includes but is not limited to adherence to applicable federal and state laws, rules, and regulations as well as company policies and training requirements. Pay Range Minimum: $78,900.00 Pay Range Maximum: $147,500.00 Base pay is determined by a variety of factors including a candidate’s qualifications, experience, and expected contributions, as well as internal peer equity, market, and business considerations. The displayed salary range does not reflect any geographic differential Highmark may apply for certain locations based upon comparative markets. Highmark Health and its affiliates prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities and prohibit discrimination against all individuals based on any category protected by applicable federal, state, or local law. We endeavor to make this site accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact the email below. For accommodation requests, please contact HR Services Online at HRServices@highmarkhealth.org California Consumer Privacy Act Employees, Contractors, and Applicants Notice

View details: Senior Data Engineer

United States

$78.9K - $147K / year

Apply

Job Closed

Senior Data Engineer

Job Description

Job Requirements

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Principal Data Architect

Senior Data Engineer

Databricks Architect

Senior Data Engineer