Chartbeat

Analytics and optimization tools for digital content creators. See our latest insights on blog.chartbeat.com.

Staff Engineer – Data Engineering

Data EngineerData EngineerFull Time Remote LeadTeam 51-200Since 2009H1B SponsorCompany Site LinkedIn

Location

United States

Posted

83 days ago

Salary

$220K - $240K / year

Seniority

Lead

Postgraduate Degree8 yrs expEnglishAirflow BigQuery Cloud Kafka Kubernetes Python Spark

Job Description

• Design, own and contribute to the cross-division data architecture, establishing patterns and standards that scale across both Chartbeat and Tubular platforms • Lead the technical strategy for integrating data products across divisions, enabling new cross-brand features and insights • Drive the AI and backend infrastructure roadmap - from data pipelines that feed models to deployment patterns for LLM-powered features • Identify and close skill gaps across the data engineering team, actively mentoring engineers and elevating the collective technical ceiling • Collaborate with product, engineering, and ML teams to translate business needs into scalable architectural decisions • Establish and evolve best practices for data modeling, pipeline design, and system observability across the organization • Evaluate and prototype emerging AI and generative AI technologies for application to real-world product challenges • Participate in engineering planning and contribute to roadmap prioritization with a cross-division perspective • Be part of engineering on call rotation to monitor and maintain the health of our production systems.

Job Requirements

8+ years of experience in data engineering, data architecture, or a closely related discipline
Proven experience designing large-scale data systems across complex, multi-product or multi-division environments
Strong proficiency with cloud data warehouse technologies (Snowflake, BigQuery, or equivalent)
Experience building and maintaining data pipelines at scale, streaming and batch, using tools such as Kafka, Spark, Airflow, or similar
Demonstrated experience integrating AI and ML capabilities into data systems, including LLMs or foundation models in production environments
Strong Python and Data Architecture experience
Excellent communication skills - able to translate complex architectural decisions for both technical and non-technical audiences
A track record of elevating engineering teams through mentorship, documentation, and technical leadership
Experience with Kubernetes or containerized infrastructure is a plus

Benefits

Comprehensive Health, Dental, and Vision Insurance
401K with company match (100% of the first 3% and 50% of the next 2%)
Fully Paid Parental Leave - 18 weeks for birthing parents, 12 weeks for non-birthing parents
Phone and internet stipend
Wellness, learning, and coworking reimbursements
Flexible work hours
Unlimited PTO
11 paid holidays and December holiday closure
Annual In-Person Event

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Director, Product Management – AI/Data Platform

SailPoint

At SailPoint, we believe enterprise security must start with identity at the foundation. Today’s enterprise runs on a diverse workforce of not just human but also digital identities—and securing them all is critical. Through the lens of identity, SailPoint empowers organizations to seamlessly manage and secure access to applications and data at speed and scale. Our unified, intelligent, and extensible platform delivers identity-first security, helping enterprises defend against dynamic threats while driving productivity and transformation. Trusted by many of the world’s most complex organizations, SailPoint secures the modern enterprise.

Data Engineer83 days ago

Full Time RemoteTeam 1,001-5,000Since 2005H1B Sponsor

Company Site LinkedIn

• Responsible for the foundational data infrastructure that powers all of SailPoint's AI-driven insights and intelligent features. • Build and scale the platform that collects, processes, and enriches a vast identity data. • Designed to intelligently collect, transform, normalize, correlate, summarize, and store customer-specific and third-party data. • Provide leverage for R&D organization, ensuring consistency, quality, and speed in delivery. • Deliver significant new platform capabilities that are used by multiple product teams to launch AI-powered features.

Cloud Kafka

View details: Director, Product Management – AI/Data Platform

Texas

$172.8K - $291.2K / year

Apply

Job Closed

Data Engineering Lead

Seer Interactive

Performance marketing agency where people & data intersect. Focused on PPC, SEO, Analytics, Creative, & CRO Services.

Data Engineer83 days ago

Full Time RemoteTeam 51-200Since 2002H1B No Sponsor

Company Site LinkedIn

• Set the technical direction for Seer’s data platform — define the architecture, the standards, and the roadmap that other engineers build against. • Architect Seer’s data platform end-to-end across BigQuery, Dataflow, Airflow, Cloud Run, Cloud Functions, and Cloud Storage, making the cost, latency, and freshness tradeoffs that let us scale without surprises. • Set the standard for how we model and serve data, from dbt project structure to LookML, Looker explores, and embedded Looker experiences inside client portals and internal tools. • Mentor data engineers and analysts across the team — pair on tough problems, lead architectural reviews, leave PR comments that make people better (not just their code), and run the occasional workshop on TDD, git internals, or LLM-augmented engineering. • Champion engineering practice — TDD, modular Pytest suites, clean Git workflows, and CI/CD pipelines with multi-stage deploys. Make the bar visible, then help everyone meet it. • Tune what’s expensive: hunt down the slow query, the redundant DAG, the leaky storage policy. Cut BigQuery costs and dashboard load times, and bring the team along with you. • Leverage tools like ChatGPT, Claude, Gemini, and prompt-evaluation harnesses to accelerate research, code review, and pipeline development — and design AI-driven workflows that scale Seer’s engineering throughput. • Document and teach. Write playbooks that survive system evolution, decision records that explain *why* not just *what*, and onboarding paths that compound across new hires.

Airflow BigQuery Cloud Google Cloud Platform Python SQL

View details: Data Engineering Lead

United States

$120K - $130K / year

Apply

Senior AI Data Engineer

Cortica

Powering The Future With Autonomous AI

Data Engineer83 days ago

Full Time RemoteTeam 51-200Since 2007H1B Sponsor

Company Site LinkedIn

Role Description Cortica is looking for a Senior AI Data Engineer to join its growing team! The Senior AI Data Engineer will serve as both architect and builder of our data ecosystem. Every initiative will follow a complete engineering lifecycle: - Gathering stakeholder requirements - Designing the solution - Building and testing it - Shipping it to production This role will work across data lakes, analytics pipelines, and lightweight application development— the multi-disciplinary data equivalent of a full-stack developer. The Senior AI Data Engineer will work closely with the data science, finance, and clinical operations teams to design intelligent, automated data solutions that power care decisions, financial planning, and operational efficiency. AI augmentation is not optional — it is the standard working mode. What will you do? - Full Lifecycle Project Ownership: - Engage stakeholders directly to gather, clarify, and document project requirements. - Translate requirements into architected data solutions: choose the right storage, pipeline, modeling, and delivery approach for each problem. - Own testing end-to-end — unit tests, data quality checks, reconciliation, and integration tests before anything reaches production. - Deploy solutions to production and monitor post-deployment health, iterating rapidly based on real-world feedback. - AI-First Data Engineering: - Run parallel AI coding sessions (Claude Code, Cursor, Codex) across different facets of a pipeline simultaneously — orchestrate, verify, and integrate the outputs. - Build and maintain context files (CLAUDE.md equivalents) for data projects that encode schema conventions, pipeline patterns, and institutional knowledge — making every future AI session smarter. - Design verification loops: automated data quality checks, dbt tests, CI hooks, and pipeline monitors that give AI agents concrete feedback on correctness. - Build MCP (Model Context Protocol) or equivalent integrations to connect AI agents directly to Snowflake, Amazon Athena, Postgresql, MySql, Power BI APIs, Salesforce, and internal tooling. - Prefer frontier models for complex architectural decisions and rely on AI acceleration to dramatically increase engineering throughput. - Data Platform & Pipeline Engineering: - Design and build complex, reliable data pipelines ingesting from AWS, Azure, Salesforce, MuleSoft, and multiple third-party APIs into our AWS Data Lake and Snowflake warehouse. - Implement and evolve data models using Kimball methodology to support financial, operational, and clinical analytics. - Optimize pipeline performance, manage data quality, and perform root-cause analysis on data anomalies — internal and external. - Develop and maintain orchestration workflows in Python, and AWS Glue. - Continuously evolve the data schema as business and engineering requirements change. - Analytics & Reporting Enablement: - Build and support Power BI data models and reports; empower analytics team members to self-serve on a reliable data foundation. - Work with data analysts and data scientists to build reusable, well-documented pipeline components they can extend independently. - Deliver data products that drive clinical care decisions, financial planning, and operational performance improvements. - Application Development: - Build lightweight internal data applications and tooling where needed; data entry interfaces, operational dashboards, automation scripts that bridge the gap between data pipelines and end users. - Design for agentic workflows: build AI-powered data tools accessible via web interfaces or Slack that surface insights proactively. - Integrate with Salesforce Health Cloud and other platforms using APIs and event-driven patterns. - Security, Governance & Collaboration: - Ensure data security and HIPAA compliance in all pipeline and application work. Partner with IT to enforce data governance standards. - Document decisions, tradeoffs, and architecture clearly so that future engineers (and AI agents) can build on your work effectively. - Collaborate across IT, finance, clinical operations, and data science — acting as the connective tissue between data infrastructure and business outcomes. Qualifications - 5+ years of hands-on data engineering experience, including building and operating production data pipelines. - Expert-level Python skills for ETL, pipeline orchestration, and automation. - Deep SQL proficiency — query optimization, data modeling, stored procedures. - 2+ years’ experience working with AI first development workflows. - 4+ years’ experience with AWS (S3, Glue, Lambda, Redshift), and/or Azure big data services. - 1+ year of experience with Snowflake. - 2+ years of experience with orchestration frameworks. - 2+ years of Salesforce experience with Apex and configurations. - Experienced with Kimball dimensional modeling — built star schemas and conformed dimensions in production. - Power BI (or equivalent BI tool) experience — data model design and report development. - API integration experience — REST, GraphQL, event streaming (Kafka, Kinesis, or similar). - Application development literacy — comfortable building lightweight web tooling (Python/Flask, Node, or similar) to complement data products. Benefits - Medical, dental, and vision insurance - 401(k) plan with company matching and rapid vesting - Paid holidays and wellness days - Life insurance - Disability insurance options - Tuition reimbursements for professional development and continuing education - Referral bonuses Compensation The base pay range for this opening is $160,000 to $200,000. According to your skill level, relevant experience, education level, and location, you will receive compensation that fits appropriately within the range.

AI Mode Data Engineering dbt AI Agents Snowflake Amazon Athena PostgreSQL MySQL Power BI Salesforce AWS Azure Python Slack ETL SQL Amazon S3 Amazon Lambda Amazon Redshift GraphQL Apache Kafka Amazon Kinesis Flask

View details: Senior AI Data Engineer

United States

$160K - $200K / year

Apply

Job Closed

Senior AI Data Engineer

Cortica

Powering The Future With Autonomous AI

Data Engineer83 days ago

Full Time RemoteTeam 51-200Since 2007H1B Sponsor

Company Site LinkedIn

Role Description The Senior AI Data Engineer will serve as both architect and builder of our data ecosystem. Every initiative will follow a complete engineering lifecycle: - Gathering stakeholder requirements - Designing the solution - Building and testing it - Shipping it to production This role will work across data lakes, analytics pipelines, and lightweight application development— the multi-disciplinary data equivalent of a full-stack developer. The Senior AI Data Engineer will work closely with the data science, finance, and clinical operations teams to design intelligent, automated data solutions that power care decisions, financial planning, and operational efficiency. AI augmentation is not optional — it is the standard working mode. Qualifications - 5+ years of hands-on data engineering experience, including building and operating production data pipelines. - Expert-level Python skills for ETL, pipeline orchestration, and automation. - Deep SQL proficiency — query optimization, data modeling, stored procedures. - 2+ years’ experience working with AI first development workflows. - 4+ years’ experience with AWS (S3, Glue, Lambda, Redshift) and/or Azure big data services. - 1+ year of experience with Snowflake. - 2+ years of experience with orchestration frameworks. - 2+ years of Salesforce experience with Apex and configurations. - Experience with Kimball dimensional modeling — building star schemas and conformed dimensions in production. - Power BI (or equivalent BI tool) experience — data model design and report development. - API integration experience — REST, GraphQL, event streaming (Kafka, Kinesis, or similar). - Application development literacy — comfortable building lightweight web tooling (Python/Flask, Node, or similar) to complement data products. - Reside in one of the specified states. Requirements - Engage stakeholders directly to gather, clarify, and document project requirements. - Translate requirements into architected data solutions: choose the right storage, pipeline, modeling, and delivery approach for each problem. - Own testing end-to-end — unit tests, data quality checks, reconciliation, and integration tests before anything reaches production. - Deploy solutions to production and monitor post-deployment health, iterating rapidly based on real-world feedback. - Run parallel AI coding sessions across different facets of a pipeline simultaneously — orchestrate, verify, and integrate the outputs. - Build and maintain context files for data projects that encode schema conventions, pipeline patterns, and institutional knowledge. - Design verification loops: automated data quality checks, dbt tests, CI hooks, and pipeline monitors. - Build MCP (Model Context Protocol) or equivalent integrations to connect AI agents directly to various platforms. - Design and build complex, reliable data pipelines ingesting from multiple sources into our AWS Data Lake and Snowflake warehouse. - Implement and evolve data models using Kimball methodology to support analytics. - Optimize pipeline performance, manage data quality, and perform root-cause analysis on data anomalies. - Develop and maintain orchestration workflows in Python and AWS Glue. - Continuously evolve the data schema as business and engineering requirements change. - Build and support Power BI data models and reports; empower analytics team members to self-serve on a reliable data foundation. - Work with data analysts and data scientists to build reusable, well-documented pipeline components. - Deliver data products that drive clinical care decisions, financial planning, and operational performance improvements. - Build lightweight internal data applications and tooling where needed. - Design for agentic workflows: build AI-powered data tools accessible via web interfaces or Slack. - Ensure data security and HIPAA compliance in all pipeline and application work. - Document decisions, tradeoffs, and architecture clearly. - Collaborate across IT, finance, clinical operations, and data science. Benefits - Medical, dental, and vision insurance. - 401(k) plan with company matching and rapid vesting. - Paid holidays and wellness days. - Life insurance and disability insurance options. - Tuition reimbursements for professional development and continuing education. - Referral bonuses.

AI Mode Data Engineering Python ETL SQL AWS Amazon S3 Amazon Lambda Amazon Redshift Azure Snowflake Salesforce Power BI GraphQL Apache Kafka Amazon Kinesis Flask dbt AI Agents Slack

View details: Senior AI Data Engineer

United States

$160K - $200K / year

Apply

Job Closed

Staff Engineer – Data Engineering

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Director, Product Management – AI/Data Platform

Data Engineering Lead

Senior AI Data Engineer

Senior AI Data Engineer