Come join the movement....we are a vehicle to healthy living!
Senior Data Engineer
Location
United States
Posted
145 days ago
Salary
$115.9K - $184.3K / year
Seniority
Senior
Job Description
Senior Data Engineer
iHerb, LLC
• Designs and builds scalable data extracts, integrations, transformations, and data models. • Ensures successful deployment and provisioning of data solutions across required environments. • Designs and implements data architectures and applications that enable speed, quality, and operational efficiency. • Interacts with cross-functional stakeholders to gather and define requirements and translate them into technical designs. • Develops deep familiarity with enterprise datasets, builds domain knowledge, and advances data quality. • Reviews requirements, identifies gaps, and drives resolution with stakeholders. • Identifies and recommends continuous improvement opportunities, ensuring integrations are automated, governed, and observable. • Serves as a key team member in designing and deploying a ground-up cloud data platform and pipeline. • Partners with data scientists to design, build, and maintain reproducible machine-learning pipelines. • Implements CI/CD for data and ML workflows. • Builds and maintains production-grade ML infrastructure such as feature stores, model registries, and data versioning. • Ensures ML models follow best-practice governance, including automated model performance monitoring and alerting. • Designs scalable data pipelines optimized for ML workloads. • Establishes MLOps standards, coding practices, and automation patterns that scale across teams.
Job Requirements
- Bachelor or Master`s degree in technical discipline such as Computer Science, Information Systems or another technical field
- 5+ years of experience as a Data Engineer within a data and analytics environment.
- Expertise with Databricks and other cloud data warehousing solutions such as S3, Redshift, or BigQuery.
- Hands-on experience building data pipelines and ETL/ELT workflows using PySpark for semi-structured data.
- Advanced knowledge of Python and advanced working SQL skills including query optimization.
- Ability to write, test, and debug RESTful APIs.
- Experience working in agile, cross-functional environments.
- Strong analytical, problem-solving, and critical-thinking capabilities.
- Strong communication skills with the ability to present complex concepts clearly.
- Experience in data quality initiatives such as Master Data Management (MDM).
- Experience operationalizing machine-learning models in production environments.
- Hands-on experience with ML tooling such as MLflow, SageMaker, Databricks ML, Kubeflow, or similar.
- Experience implementing CI/CD pipelines for data and ML workloads, including automated testing, deployment pipelines, and environment configuration.
- Understanding of model lifecycle management, data versioning, feature store design, and model monitoring concepts.
Benefits
- Employees (and their families) that meet eligibility criteria as outlined in applicable plan documents are eligible to participate in our medical, dental, vision, and basic life insurance programs and may enroll in our company’s 401(k) plan.
- Employees will also be eligible for Time Off and Paid Sick Leave pursuant to the company’s policies.
- Employees will enjoy paid holidays throughout the calendar year.
- Hired applicant may be awarded Restrict Stock Units and receive annual bonuses pursuant to eligibility and performance criteria defined in the respective plan documents and policies.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Build infrastructure for ingestion, transformation, and loading an exponentially increasing volume of data from a variety of sources using Spark, SQL, AWS, and Databricks • Building an organic entity resolution framework capable of correctly merging hundreds of billions of individual entities into a number of clean, consumable datasets. • Developing CI/CD pipelines and anomaly detection systems capable of continuously improving the quality of data we're pushing into production. • Dreaming up solutions to largely undefined data engineering and data science problems.
Ingeniero de Datos
Metova, Inc.Helping companies transform their business through technology to meet the growing expectations of their customers.
• Design, develop, and maintain data pipeline architectures. • Optimize data ingestion, storage, and processing workflows. • Collaborate with data scientists and analysts to understand data needs and convert requirements into technical specifications. • Ensure data quality and integrity throughout the data lifecycle. • Implement data security and compliance measures. • Monitor and troubleshoot data systems performance issues. • Stay up-to-date with industry trends, technologies, and best practices in data engineering.
• Enable efficient data access by creating and maintaining data pipelines. • Collaborate with ML engineers to design and maintain automation for machine learning training, quality assessment, and model release process. • Build data infrastructure from the vast amount of data for analytics, hypothesis testing and company metrics. • Identify, design and implement improvement to internal processes allowing to optimize data delivery, automate manual processes. • Design new and improve current patterns for building data models and implement necessary modifications.
Big Data Engineer
Sigma Software GroupWe support enterprises, product houses, and startups with custom software solutions development and IT consulting.
• Develop and maintain ETL pipelines and data integration services using Python and SQL • Work with AWS services (S3, DynamoDB, Lambda, Glue) and NoSQL databases (MongoDB, DynamoDB) • Design, optimize, and validate data flows, ensuring data quality across systems • Collaborate with Senior engineers on architecture and performance improvements • Troubleshoot production data issues and perform root cause analyses • Contribute to the continuous improvement of development practices and performance monitoring




