Job Closed
This listing is no longer active.
We are changing how the world lives and ages at home.
Manager, Data Platform – Engineering
Location
Texas
Posted
38 days ago
Salary
0
Seniority
Senior
Job Description
Manager, Data Platform – Engineering
TheKey
• Contribute to and execute a roadmap that positions TheKey's data platform as AI-ready — well-structured, richly documented, and accessible for machine learning and generative AI use cases. • Champion the adoption of AI and automation tools within the data engineering team to increase delivery velocity and reduce manual effort. • Partner with business and technology stakeholders to align platform capabilities with analytics and AI initiatives. • Lead the design and governance of a metadata-rich data lake in Google BigQuery, ensuring datasets are tagged, documented, and contextualized. • Establish and enforce standards for data cataloging, semantic tagging, lineage tracking, and business glossary definitions. • Drive adoption of tools such as Google Dataplex or equivalent for automated metadata management and data quality enforcement. • Build and maintain scalable data pipelines, leveraging AI-assisted development tools to accelerate development and reduce errors. • Implement AI-driven testing and observability frameworks to automatically validate pipeline outputs, detect anomalies, and enforce data quality. • Define and enforce data governance frameworks that support both regulatory compliance and AI readiness. • Own master data management practices to ensure accuracy, consistency, and a single source of truth for critical business entities. • Directly manage a team of data engineers – providing mentorship, clear expectations, and career development support.
Job Requirements
- Bachelor's or Master's degree in Computer Science, Information Systems, Data Engineering, or a related field.
- 5+ years of experience in data engineering, with at least 1–2 years in a team lead or people management capacity.
- Hands-on expertise with Google BigQuery and the broader Google Cloud data ecosystem (Dataflow, Pub/Sub, Dataplex, Looker, etc.).
- Demonstrated experience building and managing cloud-native data pipelines and ELT/ETL processes at scale.
- Working knowledge of AI/ML workflows and what it takes to prepare data for model training, RAG pipelines, or AI-powered analytics.
- Experience implementing data catalogs, metadata frameworks, and data lineage tooling.
- Familiarity with AI-assisted development tools and a track record of integrating them into engineering workflows.
- Strong data governance background including security, compliance, and quality frameworks.
- Strong communication and stakeholder management skills.
Benefits
- Medical/Dental/Vision Insurance
- TouchCare VirtualCare
- Life Insurance
- Health Savings Account
- Flexible Spending Account
- 401(k) Matching
- Employee Assistance Program
- PTO Plan for Non-Exempt Employees
- Flexible PTO Plan for Exempt Employees
- Holidays and Floating Holidays
- Pet Insurance
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Lead the design, development, and optimization of scalable data solutions. • Define data architecture strategies and ensure best practices are followed. • Oversee and guide the creation and automation of data pipelines and platforms. • Establish and enforce data quality and governance frameworks. • Collaborate with Architects, Product Owner, Data Scientists, and DevOps to align data solutions with business needs. • Research and evaluate emerging data technologies and methodologies. • Ensure seamless integration of data management solutions into client environments. • Develop risk mitigation strategies and implement data recovery plans. • Lead the development of data repositories, including data warehouses, data lakes, and operational data stores. • Mentor and support the development of junior and senior team members.
Staff Data Engineer
NateraFounded in 2004 and led by CEO Steve Chapman, Natera is a company in the biotechnology market that offers genetic testing and diagnostics on a global scale. Ope
• Design, develop, and maintain robust, scalable, and secure data pipelines for ingesting, transforming, and validating clinical real-world data (RWD). • Implement software solutions using programming languages (e.g., Python, SQL) and cloud technologies (e.g., AWS). • Collaborate with software/quality engineers/analysts and product managers. • Communicate testing progress, risks, and quality metrics. • Analyze and fix defects identified within applications, services, or data pipelines. • Write SQL, Python, ELT code for querying clinical and genomic data. • Create data marts and perform data transformation and de-identification tasks. • Develop scalable data pipelines and processing workflows using AWS services. • Experience with Apache Airflow for workflow orchestration. • Apply version control (Git), CI/CD pipelines, and agile methodologies. • Support staff recruitment and onboarding, providing mentorship.
• Develop, maintain and evolve data ingestion and transformation pipelines following a medallion architecture; • Build and monitor integrations between the data lake and external applications (Octadesk, Salesforce and others); • Implement observability mechanisms for proactive failure detection, performance monitoring and cost control; • Collaborate with product, BI and engineering teams to understand needs and design scalable data solutions; • Document processes, pipelines and technical decisions to facilitate maintenance and onboarding of new team members.
Senior Data Engineer, Databricks
Codvo.aiBuilding Advance AI & Cloud Native Software Using The "Virtual Silicon Valley" Model. Let’s Talk AI, Cloud and Outcomes.
• Design, develop, and maintain scalable data pipelines using Databricks and Python • Build and optimize ETL workflows for structured and unstructured data • Work with Databricks Lakehouse architecture and implement best practices • Manage and implement data governance using Unity Catalog • Integrate AWS data services such as S3, IAM, VPC, and other relevant services • Collaborate with cross-functional teams to deliver high-quality data solutions • Build dashboards and support data visualization requirements • Ensure data quality, reliability, and performance optimization




