Job Closed

This listing is no longer active.

SmithRx

SmithRx is a tech-forward PBM committed to changing the way pharmacy benefits are managed.

Staff Data Engineer

Data EngineerData EngineerFull Time Remote LeadTeam 51-200Since 2018H1B No SponsorCompany Site LinkedIn

Location

United States

Posted

110 days ago

Salary

Seniority

Lead

6 yrs expEnglishAirflow Apache HTTP Server ETL PySpark Python Apache Spark SQL

Job Description

• Build Resilient Systems: Design flexible, difficult-to-misuse software components and data pipelines. Reduce complex designs into simple foundational components to proactively avoid scaling issues and lower maintenance costs. • Manage Cross-Boundary Integrations: Create coherent ecosystem designs across multiple pipelines and API boundaries. Ensure reliable feature rollouts with appropriate monitoring, paging, and well-characterized failure domains. • Improve Developer Efficiency: Continuously look for ways to simplify code and infrastructure. Implement solutions that measurably improve developer efficiency (e.g., cycle time, ramp-up time) for your team. • Drive Large-Scale Projects: Autonomously define and deliver technical roadmaps for larger projects with cross-team dependencies. Execute against tight deadlines while preemptively identifying and resolving technical risks. • Unblock the Team: Assess and eliminate the root causes of technical and process barriers. Own your decisions and mistakes, taking action to prevent them in the future and sharing learnings widely. • Align Technical Output with Business Value: Partner with your manager to define team priorities based on SmithRx’s business strategy. Design clear success metrics for systems and achieve them consistently post-launch. • Cross-Functional Collaboration: Break down silos within and across functions. Write crisp narratives to create understanding, get buy-in, and enable effective decision-making among stakeholders. • Navigate Ambiguity: Act thoughtfully in critical situations. Balance conflicting perspectives, encourage productive debate, and commit to moving key company priorities forward even when making unpopular decisions. • Mentor & Develop Talent: Serve as a role model and technical guide for less-experienced team members (SDE1/SDE2), tailoring your coaching and feedback to their specific skills and working styles. • Champion Culture & Hiring: Actively participate in the hiring process for senior candidates and managers (interviews, debriefs, 1:1 sell chats). Foster psychological safety, encourage a growth mindset, and consistently personify SmithRx’s core values.

Job Requirements

6+ years of industrial experience in software or data engineering. (Start-up and healthcare experience is highly desirable).
Strong programming skills in PySpark, SQL, and Python. Hands-on experience with leading ETL tools and frameworks (e.g., Apache Spark, Apache Airflow, dbt, Looker, Superset).
Demonstrated expertise in data warehouse technologies (e.g., Snowflake) and mastery of data modeling concepts through production-grade implementations.
A proven track record of successfully executing complex projects with cross-team dependencies from design to post-launch monitoring.
The ability to tailor technical messaging to various audiences, explaining how business priorities inform engineering choices.

Benefits

Highly competitive wellness benefits including Medical, Pharmacy, Dental, Vision, and Life Insurance and AD&D Insurance
Flexible Spending Benefits
401(k) Retirement Savings Program
Short-term and long-term disability
Discretionary Paid Time Off
12 Paid Company Holidays
Wellness Benefits
Commuter Benefits
Paid Parental Leave benefits
Employee Assistance Program (EAP)
Well-stocked kitchen in office locations
Professional development and training opportunities

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Senior Staff Data Engineer

SmithRx

SmithRx is a tech-forward PBM committed to changing the way pharmacy benefits are managed.

Data Engineer110 days ago

Full Time RemoteTeam 51-200Since 2018H1B No Sponsor

Company Site LinkedIn

• Define Technical Vision: Define the long-term data engineering strategy, factoring in company-wide priorities, customer needs, and system limitations. • Design Ecosystems: Create coherent designs across multiple pipelines and API boundaries. Reduce complex concepts to foundational components and simplify infrastructure to lower maintenance costs. • Drive Engineering Excellence: Make high-impact technical choices—including "build vs. buy" and framework selections—and provide clear rationale to rally the team. Review cross-team designs to preemptively identify and resolve technical risks before they jeopardize projects. • Improve Developer Efficiency: Implement solutions that measurably improve developer efficiency (e.g., cycle time, ramp-up time) and establish engineering-wide quality and best practices. • Deliver at Scale: Roll out major features and systems reliably, ensuring appropriate monitoring, failure domain characterization, and clear success metrics are established post-launch. • Align with Business Goals: Leverage a deep understanding of SmithRx’s business strategy to identify group-wide opportunities. Proactively refocus team efforts when projects are off-course or not moving the needle for the business. • Manage Data Quality & Governance: Enforce data governance policies (PII/PHI protection, security, compliance) and implement data quality principles to raise the bar for the reliability of data shared internally and externally. • Influence Cross-Functionally: Influence the roadmaps of other SmithRx teams. Act thoughtfully and decisively in critical situations, seeking diverse perspectives but ultimately leading decision-making (disagree and commit) to move priorities forward. • Mentor and Develop: Serve as a role model and coach for SDE2/SDE3 engineers, taking into account their unique skills and providing constructive feedback to maximize their impact. • Executive Communication: Develop focused messaging and effectively present technical strategies and business cases at the executive level. • Champion Organizational Change: Break down silos, build deep cross-functional relationships, and create excitement to drive the adoption of new technologies or processes across the organization.

Airflow Apache HTTP Server Distributed Systems ETL Java PySpark Python Scala Apache Spark SQL

View details: Senior Staff Data Engineer

United States

Apply

Data Engineer

LM IT Services AG

Full-service provider in the range of training, consulting, digital transformation, development & event management.

Data Engineer110 days ago

Full Time RemoteTeam 51-200Since 1994H1B No Sponsor

Company Site LinkedIn

• Interface to the data landscape: you integrate REST services and extract data from a wide range of sources. • Modern data engineering: you design and automate data pipelines and process and transform data using Apache Spark and Spark Notebooks (PySpark/SparkSQL). • Data modeling: you prepare data for analysis and take care of semantic modeling. • Customer focus: you analyze customer requirements and develop appropriate solutions together with the team. • Teamwork & community: you support junior consultants and working students, contribute to community projects, and share your knowledge, e.g. in blog posts.

Apache HTTP Server AWS Azure ETL PySpark Python Apache Spark

View details: Data Engineer

Germany

Apply

Data Engineer

ultima milla

Logistic Management System for E-commerce & Retail in Mexico. Raised +$7M USD from Y Combinator, FJLabs, & more.

Data Engineer110 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Recolectar datos de diversas fuentes, incluyendo bases de datos, archivos CSV y APIs, utilizando herramientas ETL para realizar limpieza y preprocesamiento. Eliminar valores atípicos, gestionar nulos y preparar datos para análisis eficiente. • Diseñar y construir pipelines de datos eficientes utilizando herramientas como Matillion, garantizando la automatización en la recolección, transformación y carga (ETL) de datos. • Integrar y consumir datos de APIs externas para enriquecer conjuntos de datos y mejorar la calidad de la información. • Contribuir al diseño de arquitecturas eficientes para el manejo de grandes volúmenes de datos, utilizando tecnologías como Python y SQL.

ETL Matillion NumPy Oracle Database Pandas Python SQL SSIS

View details: Data Engineer

Colombia

Apply

Job Closed

Data Engineer Ssr / Sr — Python, AWS, Terraform, Pinecone

OneSeven Tech

Founded by James Sullivan, OneSeven Tech is a premier digital product agency serving startups and enterprises. Our clients have collectively raised over $100M in VC, and our enterprise partners include 2,000+ person hospitality groups and NASDAQ-listed companies. We work fully remote, move fast with 1-week sprints, and focus on elegant solutions to complex problems.

Data Engineer110 days ago

Full Time RemoteTeam 11-50

Location: Remote — Argentina, Colombia, México, Chile, Uruguay Job Type: Full-time contract Date Posted: March, 2026 Project Brief We're building a cloud-native data infrastructure powering AI-driven products — from ingestion pipelines and vector search to MCP-based integrations and multi-tenant architectures. You'll own the design, deployment, and reliability of data pipelines end-to-end, working across AWS, Terraform, and Pinecone in a fast-moving environment where architectural decisions matter and ownership is real. Must-Haves - Strong Python skills — Lambda functions, scripting, API consumption, and data processing pipelines - Hands-on AWS experience across: Lambda, S3, API Gateway, ECS/Fargate, Secrets Manager, Cognito, CloudWatch, and SQS + DLQ - Terraform for infrastructure-as-code — remote state with S3 + DynamoDB, modular resource definitions, and multi-environment management - Docker — containerization, Dockerfile hardening, and ECS task image builds - Pinecone — index creation, namespace design for multi-tenancy, metadata schema, upsert logic, and hybrid search configuration - Experience with embedding models and ETL/data transformation pipelines - MCP protocol experience — FastMCP SDK, Streamable HTTP transport, tool definition - REST API consumption — Cognito-authenticated requests, response parsing, and S3 persistence - Git/GitHub — trunk-based branching, semantic PRs, and branch-per-feature workflows - JSON schema design for pipeline service contracts and API response structures - Retry and error handling patterns — exponential backoff and failure architecture - Clear English communication and ability to work independently with minimal supervision Nice to Have - Incremental update and change detection logic for pipeline efficiency - CloudWatch alarm design and pipeline health dashboards - Content-hash deduplication for embedding cost optimization - Lambda memory and concurrency tuning - VPN site-to-site configuration - LangChain / LangGraph experience - Prompt engineering fundamentals - GitHub Actions — CI/CD pipeline setup and deployment automation - Marketing Business Acumen What You Will Do - Design, build, and maintain cloud-native data pipelines across AWS Lambda, S3, ECS/Fargate, and SQS - Manage and evolve infrastructure-as-code using Terraform across multiple environments - Build and optimize vector search workflows in Pinecone — including multi-tenant namespace design, metadata schemas, and hybrid search - Develop and improve MCP server integrations using FastMCP SDK and Streamable HTTP transport - Implement and harden authentication flows using AWS Cognito and Secrets Manager - Monitor pipeline health via CloudWatch — log groups, alarms, and observability for Lambda and ECS - Apply cost modeling discipline — flag and mitigate high-cost operations (e.g. embedding calls at scale) - Work within an established SA governance model, contributing to architectural decisions with clear documentation and standards - Collaborate with engineering and applied science teams to deliver reliable, scalable data infrastructure Why This Could Be Your Next Big Move - 🧱 Own the data foundation — You're not maintaining legacy systems. You're designing the infrastructure that AI products are built on top of. - 🤖 Work at the AI/data intersection — Vector search, embeddings, MCP integrations, and multi-tenant architectures. This is where data engineering and AI meet. - ☁️ Deep AWS stack — Lambda, ECS, Cognito, SQS, Secrets Manager and more — a real opportunity to broaden and deepen your cloud expertise across a full production stack. - 🔝 High ownership, high impact — Fast-moving environment where your architectural decisions shape the product directly, with visibility across the entire engineering org. Benefits & Compensation - 💵 $[3,500] – $[5,000]/month — paid in USD, bi-weekly via Deel - 🌎 Fully Remote — work from anywhere in Latin America - 📄 Full-time contract with a U.S. company — 6-month initial term, full-time conversion - 🏖️ Paid PTO — competitive package, grows with tenure - 🤝 Referral Program — earn a bonus for referring talent that gets hired To Apply Please send your resume in English. Include the following depending on your role: - 🔗 LinkedIn Profile URL (required) - 💻 GitHub Portfolio or code samples — show us your pipeline and infrastructure work (required) - ✉️ Cover Letter — tell us about a data pipeline or cloud architecture you built and the impact it had (optional but encouraged) About OneSeven Tech Founded by James Sullivan, OneSeven Tech is a premier digital product agency serving startups and enterprises. Our clients have collectively raised over $100M in VC, and our enterprise partners include 2,000+ person hospitality groups and NASDAQ-listed companies. We work fully remote, move fast with 1-week sprints, and focus on elegant solutions to complex problems. Salary: 3500 - 5000 USD Per month

View details: Data Engineer Ssr / Sr — Python, AWS, Terraform, Pinecone

Argentina

$3.5K - $5K / month

Apply

Staff Data Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Senior Staff Data Engineer

Data Engineer

Data Engineer

Data Engineer Ssr / Sr — Python, AWS, Terraform, Pinecone