Job Closed

This listing is no longer active.

Samsara

Pioneer of the Connected Operations Cloud

Data Engineer II

Data EngineerData EngineerOther Remote Mid LevelTeam 1,001-5,000Since 2015H1B SponsorCompany Site LinkedIn

Location

United States

Posted

142 days ago

Salary

$101K - $153K / year

Seniority

Mid Level

Databricks Apache Spark Python SQL ETL dbt AI

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Samsara’s Revenue Operations AI & Data Team is building the future of how we go to market — with intelligence, personalization, and speed. We’re a high-impact team of builders, scientists, and strategists focused on transforming sales operations through AI. Our mission is to help sellers reach the right customer at the right time with the right message — and to put everything they need at their fingertips, whether that’s data from Salesforce, context from a past call, or content that wins deals. As a Data Engineer II , you’ll own the data platforms that power Samsara’s GTM AI engine. You’ll be responsible for: - Building, scaling, and optimizing our Databricks data store, visualization store, and AI store. - Enabling large-scale generative AI jobs in Databricks. - Ensuring that our AI applications are grounded in clean, reliable, and well-structured data from CRM pipelines, CS Systems to GenAI-powered copilots. - Partnering closely with data scientists, AI engineers, and business stakeholders to deliver the infrastructure that fuels innovation at scale. This role is open to candidates residing in the US except the San Francisco Bay Metro Area, NYC Metro Area, and Washington, D.C. Metro Area. You should apply if: - You want to impact the industries that run our world. - You are the architect of your own career. - You’re energized by our opportunity. - You want to be with the best. In this role, you will: - Build and maintain ETL/ELT data pipelines in Databricks and Spark, ensuring data is ingested, transformed, and delivered reliably for analytics and AI use cases. - Develop and evolve logical and physical data models to support reporting, experimentation, and advanced workflows (e.g., scoring models, signal generation). - Implement monitoring, alerts, and testing for data quality, timeliness, and lineage to ensure trustworthy data delivery. - Support workflow orchestration with Databricks Jobs, DBT, or equivalent scheduling tools to operate at scale. - Contribute to data pipelines and tooling that support retrieval-augmented generation (RAG), vector integrations, or embedding workflows. - Design and optimize bulk GenAI data pipelines in Databricks to support generative AI applications at scale. - Partner with AI engineers and data scientists to enable experimentation, model training, and production-grade deployments. - Develop frameworks for data ingestion, transformation, governance, and monitoring across CRM, sales, and revenue systems. - Work with RevOps, sales, and customer success stakeholders to translate business needs into data requirements and stable technical implementations. Qualifications - 2-3 years of industry experience in data engineering, with significant experience building large-scale data platforms. - Hands-on experience working with modern data technologies stack, such as Databricks, DBT, Redshift, RDS, Snowflake or similar solutions. - Proficiency in Python and SQL, with experience in designing robust ETL/ELT pipelines. - Experience orchestrating data workflows at scale and enabling machine learning or AI use cases. - Strong understanding of data modeling, performance optimization, and cost-efficient infrastructure design. - Located in and authorized to work in the United States (this is a fully remote role). Requirements - Experience enabling generative AI workflows in Databricks or similar platforms. - Familiarity with vector databases, embeddings, and retrieval systems. - Experience with Salesforce, Gainsight, Gong, Outreach, or other CRM/enablement tools as data sources. - Proven ability to automate repetitive tasks, improve data hygiene, and enable experimentation across GTM data use cases. - Exposure to observability, monitoring, and governance best practices for data and AI systems. - Ability to collaborate closely with AI/ML teams while driving technical excellence in data engineering. Benefits - Annual on-target earnings (OTE) range for full-time employees for this position is $101,745 — $153,900 USD. - Above-market total compensation through a combination of base salary, performance-based bonus/variable pay, and equity (for eligible roles). - Flexible, employee-led remote model. - Professional development stipend. - Comprehensive health and parental leave plans.

Job Requirements

2-3 years of industry experience in data engineering, with significant experience building large-scale data platforms.
Hands-on experience working with modern data technologies stack, such as Databricks, DBT, Redshift, RDS, Snowflake or similar solutions.
Proficiency in Python and SQL, with experience in designing robust ETL/ELT pipelines.
Experience orchestrating data workflows at scale and enabling machine learning or AI use cases.
Strong understanding of data modeling, performance optimization, and cost-efficient infrastructure design.
Located in and authorized to work in the United States (this is a fully remote role).
Experience enabling generative AI workflows in Databricks or similar platforms.
Familiarity with vector databases, embeddings, and retrieval systems.
Experience with Salesforce, Gainsight, Gong, Outreach, or other CRM/enablement tools as data sources.
Proven ability to automate repetitive tasks, improve data hygiene, and enable experimentation across GTM data use cases.
Exposure to observability, monitoring, and governance best practices for data and AI systems.
Ability to collaborate closely with AI/ML teams while driving technical excellence in data engineering.

Benefits

Annual on-target earnings (OTE) range for full-time employees for this position is $101,745 — $153,900 USD.
Above-market total compensation through a combination of base salary, performance-based bonus/variable pay, and equity (for eligible roles).
Flexible, employee-led remote model.
Professional development stipend.
Comprehensive health and parental leave plans.

Related Categories

Data Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Data Ingestion Specialist

Thyme Care

Data Engineer142 days ago

Other Remote

Company Site

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description As a Data Ingestion Specialist on our Data Ingestion & Care Enablement (DICE) team, you’ll be part of the horizontal layer that keeps partner and vendor data flowing reliably into Thyme Care. In this position, you will collaborate closely with our Product Manager and other Data teammates focused on data ingestion and analytics engineering. You’ll work across many deals and data sources, supporting Data Scientist deal owners by making ingestion consistent, debugging failures, and raising the reliability bar through better tests, monitoring, and data contracts. In the course of your work you should also expect to: - Gain a deep understanding of our data platform and contribute to improving our data models and pipelines using SQL, dbt, and python (generally data-focused packages, e.g., pandas, polars) - Support ingestion of a wide range of healthcare-related sources (claims, eligibility, prior auth, ADT, etc.) by: - Configuring net-new ingestions (parsing file specs, validating assumptions, communicating inconsistencies) - Debugging issues in ongoing ones - Helping standardize our processes and pipelines - Collaborate with data scientist deal owners and internal stakeholders to turn messy, ambiguous requirements into concrete mapping/validation logic and durable data contracts - Use Dagster and GitHub Actions to orchestrate and automate the early stages of our data pipelines, improving run reliability and reducing manual intervention - Work hands-on with raw data using Jupyter Notebooks in Databricks to investigate data issues, validate assumptions, and unblock processing - Design and support incremental data loads (append/merge/upsert patterns) and safe reprocessing (idempotent runs, late-arriving data, backfills) - Learn to use Datadog and PagerDuty to monitor pipelines, triage incidents during business hours, communicate impact clearly, and drive root-cause fixes to prevent recurrences - Contribute to a complex, self-hosted dbt monorepo: implement transformations, incremental models, tests, documentation, and conventions that scale across deals Qualifications - Strong SQL skills - Familiarity with dbt (and an interest in ramping up your expertise), including working in larger/complex projects - Working knowledge of Python for data investigation in notebooks - Experience operating data pipelines: debugging failures, tracing issues across systems, and communicating clearly about root cause and mitigation - Experience with testing and data quality: writing and maintaining tests and using failures/alerts to drive durable fixes - Responsiveness and the ability to stay calm and organized when triaging failing ingestion runs or pipelines - Willingness to learn new domains and tools quickly (new partner file formats, evolving standards, Databricks), and apply feedback without ego - The ability to engage technical and non-technical stakeholders to explain what’s happening in our pipelines and identify opportunities to improve transparency and alerting - Nice-to-have: healthcare data exposure (claims/eligibility/ADT/etc.) Requirements - Act with our members in mind: Thyme Care’s mission, and in particular our member experience, matters deeply to you. - Move with purpose: You’re biased to action. You know how to identify and prioritize your initiative’s needs, and do what it takes to ensure that urgent and important needs are acted on immediately. - Seek diverse perspectives: You are humble and actively seek feedback from others, eager to learn and share knowledge. - Reliability mindset: You treat recurring failures as solvable system problems and systematically reduce them with testing and automation. - Practical judgment: You know when to ship a safe fix now vs. invest in a sturdier improvement, and you document the why. - Clear collaboration: You work smoothly with DS, PM, and Platform counterparts, understanding competing priorities and turning ambiguity into concrete next steps. - Comfort with ambiguity: You have a successful track record working at scaling organizations, in fast-paced environments, and at ambitious startups. You navigate through challenges and find solutions in uncertain situations. Benefits - Base salary for this role is $139,500-$155,000. - Salary ranges are based on paying competitively for our size and industry, and are one part of the total compensation package that also includes equity, benefits, and other opportunities at Thyme Care. - Individual pay decisions are based on several factors, including qualifications, experience level, skillset, and balancing internal equity relative to other Thyme Care employees. Company Description We recognize a history of inequality in healthcare. We’re here to challenge the status quo and create a culture of inclusion through the care we give and the company we build. We embrace and celebrate a diversity of perspectives in reflection of our members and the members we serve. We are an equal-opportunity employer.

SQL dbt Python Databricks GitHub Actions Datadog

View details: Data Ingestion Specialist

United States

$139K - $155K / year

Apply

Job Closed

IT Manager/Engagement Lead for Enterprise Data

Horace Mann

We're here for all school employees! Helping them live better and retire happier.

Data Engineer142 days ago

Other RemoteTeam 1,001-5,000Since 1945H1B Sponsor

Company Site LinkedIn

Horace Mann is seeking an experienced and strategic IT Manager/Engagement Lead for Enterprise Data to assist with coordinating across multiple teams: Data Governance, Data Enablement Real-Time Services, Data Warehousing, and Data Reporting. This leader will play a pivotal role in ensuring stability, efficiency, visibility, and continuous improvement of enterprise data capabilities, balancing evolving business priorities with a commitment to technical excellence and innovation. The ideal candidate will possess practical analytical expertise, deep understanding of business needs, and exceptional decision-making skills to evaluate and prioritize requests that maximize business value. The candidate will encourage partners to use generative AI to enhance productivity and creativity. This role requires someone who is highly organized, equally effective communicator for both business and technical partners, and driven to make a meaningful and measurable impact across the enterprise. This Engagement Lead must be able to navigate competing demands, align technology work with strategic goals, and create a structured approach to data support, incremental enhancements and strategic advancements. Key Responsibilities Strategic Prioritization & Business Alignment - Evaluate and prioritize incoming requests based on business impact, urgency, and technical feasibility. - Set clear expectations with stakeholders, ensuring alignment with organizational goals. - Communicate trade-offs transparently, advocating for business value over reactive task execution. - Facilitate progression of effort from roadmap to project intake and project initiation, including prioritization, cost-benefit analysis, and other pre-project tasks. - Develop a structured framework for prioritization that ensures mission-critical support needs are met while also advancing right-sized enhancements that drive continuous improvement. Leadership & Team Management - Ensure operational metrics, roadmaps, goals/outcomes, project status, and other critical reports outs for the data department are updated appropriately. - Identify and help remove operational roadblocks that limit team effectiveness, including process inefficiencies, unclear responsibilities, or misaligned priorities. - Facilitate necessary resource capacity/demand activities include planning, onboarding, tracking, and support for both employee and contractual staff across all enterprise data teams. - Ensure cross-team collaboration, fostering a shared understanding of priorities, dependencies, and technical solutions. - Create and uphold accountability structures to ensure work is completed efficiently and of high quality. - Drive a culture of ownership, problem-solving, and continuous learning. - Coach team leads on best practices for planning, reporting, and cross-functional collaboration. - Act as a central point of escalation and resolution for cross-team resource conflicts or competing priorities. - Promote the use of generative AI tools to boost team creativity, efficiency, and skill development. Technical & Business Understanding - Serve as the bridge between technology teams and business stakeholders, ensuring both sides understand priorities, constraints, and impacts. - Maintain a strong grasp of enterprise data systems, workflows, and potential challenges, ensuring informed decision-making. Stakeholder Communication & Expectation Management - Set firm but fair expectations with business stakeholders, ensuring they understand how and why prioritization decisions are made. - Provide regular updates on progress, challenges, and key outcomes related to support and enhancements. - Communicate realistic timelines, risks, and trade-offs when managing requests. Qualifications Education - Bachelor’s degree in Computer Science, Information Technology, Business Administration, or a related field. Experience - 6+ years of experience in IT leadership, with a focus on managing IT projects, technical support, and/or system enhancements. - Proven track record of making strategic prioritization decisions that drive business value. - Experience implementing generative AI technologies in development workflows. Skills & Competencies Technical & Business Expertise - Familiarity with at least one area of Data Management, including but not limited to, data intensive applications, data development lifecycles, data analytics, business intelligence, data governance, and/or data-driven decision making. - Ability to translate business needs into technical priorities and vice versa. - Ability to view the enterprise data ecosystem holistically and identify systemic issues that affect multiple teams or business areas. - Familiarity with modern planning and ticketing systems (e.g., ServiceNow, Enterprise One, Monday, Jira, Azure DevOps) and IT project/portfolio management best practices. - Familiarity with change management frameworks to support adoption of new processes, tools, or prioritization frameworks across data teams. - Proficiency in leveraging generative AI tools like ChatGPT, Copilot, or similar to enhance IT workflows. Leadership & Communication - Strong leadership skills, with the ability to motivate, support, and guide teams in a fast-paced environment. - Experience with IT portfolio planning, tracking, and senior level report outs. - Ability to set clear expectations and communicate priorities firmly yet diplomatically to stakeholders. - Experience in negotiating and influencing senior leaders on data priorities and trade-offs. - Strong facilitation and conflict resolution skills, with the ability to mediate between diverse stakeholders and align differing viewpoints. Preferred Qualifications - Experience leading/shaping IT data and continuous improvement teams in a large enterprise environment. - Experience within Insurance, Financial Services, or related business domain. - Familiarity with cloud platforms (AWS, Azure), ITIL frameworks, and Agile methodologies. Pay Range: - $105,200.00 - $147,950.00 Salary is commensurate to experience, location, etc. #VIZI #LI-LM1 Horace Mann was founded in 1945 by two Springfield, Illinois, teachers who saw a need for quality, affordable auto insurance for teachers. Since then, we’ve broadened our mission to helping all educators protect what they have today and prepare for a successful tomorrow. And with our broadened mission has come corporate growth: We serve more than 4,100 school districts nationwide, we’re publicly traded on the New York Stock Exchange (symbol: HMN) and we have more than $12 billion in assets. We’re motivated by the fact that educators take care of our children’s future, and we believe they deserve someone to look after theirs. We help educators identify their financial goals and develop plans to achieve them. This includes insurance to protect what they have today and financial products to help them prepare for their future. Our tailored offerings include special rates and benefits for educators. EOE/Minorities/Females/Veterans/Disabled. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status For applicants that are California residents, please review our California Consumer Privacy Notice All applicants should review our Horace Mann Privacy Policy

View details: IT Manager/Engagement Lead for Enterprise Data

United States

$105K - $147K / year

Apply

Database Developer

Greater Kansas City Community Foundation

Data Engineer142 days ago

Other RemoteTeam 51-200

Description The Database Developer is responsible for designing, building, and maintaining reliable data integration pipelines that power enterprise reporting, analytics, and operational systems. This role focuses on the consistent and secure movement of data across applications, warehouses, and cloud platforms. The Database Developer works across the organization to provide reliable, analytics-ready data that informs effective decision-making. This is a full-time, exempt, salaried position reporting to the Director of Data Technologies. Candidates must be local to Kansas City, MO, and after a successful training period, there are opportunities to work remotely. Requirements Data Pipelines & Integrations - Design, build, and maintain scalable batch and near-real-time data pipelines. - Develop integrations using tools such as Azure Data Factory, Fabric Data Factory, SSIS, or equivalent technologies. - Maintain clear, well-documented data processes that ensure secure, reliable data delivery. - Ensure pipelines are secure, reliable, and well-documented. ETL / ELT Development - Develop structured ETL and ELT processes that support data warehouse models and downstream analytics. - Partner with staff to ensure data structures align with reporting and semantic model needs. Scheduling & Automation - Manage orchestration, scheduling, and dependencies across data workflows. - Implement automation to improve reliability, monitoring, and recovery from failures. AI Platform Readiness - Partner with leadership to evaluate, govern, and leverage AI-enabled capabilities within the data platform ecosystem. - Leverage AI-assisted tooling to improve platform reliability and operations efficiencies. Data Reliability & Monitoring - Monitor pipeline execution and proactively identify failure patterns. - Implement improvements to increase resiliency, observability, and operational predictability. Source System Integration - Work with application owners (e.g., financial systems, CRM platforms) to understand data structures, APIs, and integration requirements. - Maintain clear documentation for data flows, lineage, integration logic, and operational processes. - Support ingestion of files, APIs, JSON/XML payloads, and database sources. Education & Experience - At least 3-6 years of related experience with a bachelor's degree. An equivalent combination of education and experience will be considered. Required Technical Background - Hands-on experience developing and supporting ETL/ELT pipelines in Azure, SQL Server, or comparable data environments. - Strong SQL skills with the ability to troubleshoot data quality, transformation, and performance issues. - Experience integrating data from APIs, SaaS platforms, files, and relational databases (including JSON and XML payloads). - Experience managing job orchestration, scheduling, documentation, retries, and failure of recovery for data workflows. - Familiarity with version control (Git) and deployment practices for data pipelines and integration code. - Ability to monitor pipeline execution and proactively identify and remediate reliability issues. Preferred Background - Experience with Azure Data Factory, Fabric Data Factory, SSIS, or similar tools. - Exposure to cloud data warehouses or analytics platforms. - Experience working in regulated, audit-conscious, or highly governed environments. Physical Requirements - Office & Computer Work: Ability to work regularly at a computer terminal in a fast-paced environment with frequent interruptions. - Noise & Communication: Able to work in an office with moderate noise levels. Ability to communicate and interpret detailed information effectively. This job description serves as a summary of the employment-at-will relationship and is not a contract. Responsibilities may evolve, and other duties may be assigned as needed.

View details: Database Developer

United States

Apply

Job Closed

Senior Data Engineer

Get the right leads for the right price. Every single time.

Data Engineer142 days ago

Full Time RemoteTeam 51-200H1B Sponsor

Company Site LinkedIn

• Design and build a modern, scalable data platform on Azure • Own the end to end data lifecycle: ingestion, transformation, storage, and serving • Build reliable, observable, and maintainable data pipelines • Work closely with product, analytics, and engineering to turn raw data into trusted datasets • Make architectural decisions that will shape the platform for years to come • Help define best practices around data quality, performance, and cost efficiency

Azure Kubernetes Python Apache Spark SQL

View details: Senior Data Engineer

Poland

Apply

Job Closed

Data Engineer II

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Ingestion Specialist

IT Manager/Engagement Lead for Enterprise Data

Database Developer

Senior Data Engineer