Job Closed
This listing is no longer active.
The world's largest online marketplace for authenticated, consigned luxury goods. Nasdaq listed: REAL #TheRealReal
Senior Data Engineer
Location
California
Posted
78 days ago
Salary
$147.5K - $163.9K / year
Seniority
Senior
Job Description
Senior Data Engineer
The RealReal
• Design, develop, and maintain scalable, high-performance data infrastructure • Build reliable, reusable services and APIs for data platform interaction • Build software tools for monitoring and managing ML infrastructure • Collaborate with senior management and other engineers in the development of data products • Develop tools to monitor, debug, and analyze data and ML pipelines • Design and implement data schemas and models that can scale • Mentor team members to build overall expertise
Job Requirements
- At least 5 years of proven experience as a Data Engineer or MLOps Engineer
- Strong programming skills in languages such as Python, Java or Scala
- Experience with cloud platforms (GCP, AWS, AZURE)
- Experience with BigQuery or similar (Redshift, Snowflake, other MPP databases)
- Hands-on experience with ML frameworks (TensorFlow, PyTorch)
- Practical experience with containerization and orchestration tools (Docker, Kubernetes)
- Experience with feature stores and feature engineering best practices
- Experience building data pipelines & ETL
- Excellent communication and collaboration skills
Benefits
- Employee Stock Purchase Plan
- 401K with Company Match
- Medical, Dental & Vision Insurance
- Paid Parental Leave
- 9 Paid Company Holidays
- Flexible Time Off (With Manager Approval)
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Develop and maintain scalable data pipelines to support business requirements. • Collaborate with cross-functional teams to gather and analyze data requirements. • Implement data integration solutions using ETL tools and technologies. • Optimize and improve data processing workflows to enhance performance. • Ensure data quality and integrity through rigorous testing and validation.
• Architect and maintain cutting-edge data systems that power analytics, AI, and operational decision-making. • Take ownership of end-to-end data lifecycles, designing pipelines, models, and architectures that support real-time insights and machine learning at scale. • Build modern, cloud-native data platforms (AWS, Snowflake, Databricks) supporting batch and streaming use cases. • Automate ETL/ELT workflows, optimize data models, and enable self-serve analytics and AI. • Manage ingestion, storage, processing, and delivery of structured and unstructured data. • Continuously tune infrastructure for high concurrency, low latency, and cost efficiency. • Ingest telemetry, API, and application data in real time to power dashboards and AI-driven tools. • Provision datasets for ML/AI workloads, integrating with SageMaker, Snowflake ML, and MLOps best practices. • Ensure robust data governance, compliance (GDPR, SOC 2), and enterprise-grade security. • Work closely with Product, Engineering, DevOps, and Analytics teams to align data solutions with business goals.
Data Engineer
ATPCOATPCO is committed to providing the best flight shopping experiences through reliable pricing data and innovative retail technology. Positioning itself as "the foundation of modern
• Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and design data models and schemas that facilitate data analysis and reporting • Design, develop, and maintain scalable and efficient data pipelines and ETL processes to ingest, process, and transform large volumes of data from various sources into usable formats • Build and optimize data storage and processing systems, including data warehouses, data lakes, and big data platforms, using AWS services such as Amazon Redshift, AWS Glue, AWS EMR, AWS S3, and AWS Lambda, to enable efficient data retrieval and analysis • Implement and manage real-time data streaming architectures using AWS services like Amazon Kinesis or Apache Kafka to enable real-time data processing and analytics • Perform data profiling, data cleansing, and data transformation tasks to prepare data for analysis and reporting • Implement data security and privacy measures to protect sensitive and confidential data using AWS security services and features • Design and implement data architectures following Data Mesh principles within the AWS environment, including domain-oriented data ownership, self-serve data infrastructure, and federated data governance • Provide technical guidance and mentorship to junior data engineers, reviewing their work and ensuring adherence to best practices and standards
Senior Data Engineer – Healthcare Domain
Sigma Software GroupWe support enterprises, product houses, and startups with custom software solutions development and IT consulting.
• Collaborate with the Product Owner and team leads to define and design efficient pipelines and data schemas • Build and maintain infrastructure using Terraform for cloud platforms • Design and implement large-scale cloud data infrastructure, self-service tooling, and microservices • Work with large datasets to optimize performance and ensure seamless data integration • Develop and maintain squad-specific data architectures and pipelines following ETL and Data Lake principles • Discover, analyze, and organize disparate data sources into clean, understandable schemas

