Job Closed
This listing is no longer active.
Vida Health is a healthcare technology company offering “virtual care for mental and physical health” via an online platform that seeks to empower people to
Tech Lead, Data Engineering
Location
United States
Posted
97 days ago
Salary
$165K - $175K / year
Seniority
Senior
Job Description
Tech Lead, Data Engineering
Vida Health
• Own delivery end to end from technical specs, phased migrations, deprecation plans and more. • Take architectural plans and turn them into shipped software- sequenced, risk-managed and on schedule. • Design and implement backend services that handle complex business logic- configuration management, workflow orchestration, data validation and integrations across internal and external systems. • Design APIs, model data, write tests and review pull requests that make the whole team better. • Collaborate on architecture, define domain boundaries, set API standards and bring your own ideas about how to solve hard problems. You will have real influence over how these systems evolve. • Pilot tools where they help, document patterns and scale what works. AI assists; engineers remain accountable for correctness, security and privacy. • Define testing strategies, establish observability standards and strengthen runbooks. • Mentor engineers through code reviews, pairing and technical guidance. • Partner with Product, Partnerships and Clinical Operations to understand requirements and partner-facing implications. • Coordinate with Data/AI on integration points.
Job Requirements
- Bachelors Degree at a minimum.
- 6+ years of backend engineering experience (or equivalent), with at least 2 years in a technical leadership role- Lech Lead, Senior Engineer leading a team or similar.
- Strong Python fluency. You’ve built and operated production services in Python- ideally with FastAPI or similar async frameworks.
- Deep relational database experience. You understand schema design, indexing strategies, query performance, data integrity constraints and migration patterns; PostgreSQL preferred.
- Experience designing and building APIs that other teams depend on. You think about contracts, versioning, backward compatibility and clear error semantics.
- Solid understanding of data modeling for complex business domains. You’ve worked in systems where the domain logic is the hard part, not just the infrastructure.
- Experience with service-oriented or event-driven architectures. You’ve built or decomposed systems that communicate through well-defined interfaces and domain events.
- You’ve built systems where events can arrive out of order or twice- and you design for idempotency, retries and safe backfills.
- You’ve worked on systems with complex state machines, workflow orchestration or multi-step business processes.
- You think about schema evolution, backward compatibility and how systems change over time without breaking consumers.
- You can take a technical spec or architecture document and turn it into a clear execution plan with milestones, dependencies and risk callouts.
- You’ve led small teams (3-8 engineers) through complex projects and shipped on time. You know how to scope aggressively and cut scope wisely.
- You default to pragmatism over perfection. You know when to build it right and when to build it now.
- You communicate technical complexity clearly to both engineers and non-technical stakeholders.
- You’re opinionated about code quality but not dogmatic. You care about readability, testability and maintainability.
- You’re excited about AI-assisted development and actively use modern tools to improve your own workflow.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Lead the design and evolution of scalable data architectures, supporting batch, streaming, and AI-driven workloads. • Own end-to-end data pipelines —from ingestion and transformation through to serving analytics and ML/GenAI use cases. • Define and enforce data engineering standards across modelling, orchestration, observability, and reliability. • Mentor and guide data engineers through code reviews, design discussions, and architectural decisions. • Translate business problems into scalable technical solutions, balancing speed, quality, and long-term maintainability. • Drive the use of agent-based solutions across the development lifecycle, designing autonomous and semi-autonomous workflows that deliver measurable business value. • Clearly document architectures and workflows to support shared understanding and operational excellence. • Build and optimize data pipelines using Databricks, Spark (PySpark), Snowflake, Apache Airflow, and Terraform. • Design performant data models and lakehouse structures (Delta, Unity Catalog) for analytics and downstream AI consumption. • Leverage AWS-native services (e.g. S3, EMR, DynamoDB) to deliver cost-efficient, production-grade solutions. • Implement robust data quality, testing, and monitoring (e.g. Great Expectations, logging, alerting). • Design data pipelines that power Generative AI applications, including data preparation, enrichment, and feature generation. • Integrate 3rd party APIs into data workflows for use cases such as: Automated data enrichment and classification Intelligent summarization and insight generation Metadata generation and semantic search enablement AI-assisted reporting and decision support. • Collaborate with ML and Product teams on prompt design, evaluation, and governance, ensuring responsible and reliable AI usage.
Senior Clinical Data Research Engineer
CytovaleCytovale is a biotechnology company headquartered in San Francisco, California, focused on revolutionizing early sepsis detection through its FDA‑cleared Inte
The Senior Clinical Data Research Engineer will be an individual contributor in the Medical Affairs department. The role will encompass data analysis and curation from multiple large clinical datasets to inform internal reports, external presentations, manuscripts, and publications, as well as to guide future clinical trial design. This is a remote role. Interpret clinical case reports and develop an understanding of clinical science of immune-mediated conditions, including sepsis, to inform study design and content of case report forms. Support the design, interpretation, reporting, and publication of clinical studies, including detailed participation in clinical endpoint design and process, supporting EDC builds, and study execution. Perform data analysis and develop data-driven models to track disease and outcome trends, assess value propositions, and evaluate assay clinical utility. Support quality improvement activities for customers by building systems and tools for post-implementation data analysis. Utilize data to track the performance and effectiveness of the IntelliSep solution in improving clinical outcomes, operational efficiency, and financial performance, and provide insights into customer-related metrics and the potential impact on patient outcomes and hospital reimbursement. Collaborate with cross-functional teams to gather data and gain insights into current-state workflows and performance related to sepsis management and clinical workflows within the emergency department. Appropriately apply visualization best practices and data storytelling techniques and deliver a clear and concise presentation of findings tailored to the audience. Develop documentation and methodologies for analyses and deliverables. Develop statistical models using clinical and biological data to inform clinical trial design. Write statistical analysis plans, including statistical methodology and programming procedure. Contribute analysis and graphs to educational and marketing materials, company reports, and scientific publications.
• Design and administer data systems: Implement cloud-native data solutions across cloud services such as AWS Glue, AWS Lambda, AWS Step Functions, Redshift, Aurora, Amazon S3, MWAA-Airflow. • Design and implement scalable data architectures using AWS cloud services, including data lakes, data warehouses, bulk data ingestion, and transaction processing. • Collaborate with customers and teammates to comprehend data requirements and translate business needs into well-designed data models. • Assemble large, complex data sets that meet functional business requirements. • Stay current with AWS services and recommend suitable tools for specific data engineering tasks. • Develop and support the internal applications using Python, SQL, and Stored Procedures. • Build reliable data processes: Develop, implement, and maintain ETL processes to ingest, clean, and transform complex health care data into our application data platforms. • Automate manual processes and to optimize data delivery and reduce operational friction. • Optimize data pipeline performance: Monitor and tune data pipelines and data storage for performance, cost, and reliability. • Implement caching mechanisms and data partitioning strategies to enhance query efficiency and reduce data processing times. • Lead and collaborate company-wide: Provide technical guidance and mentorship within the data engineering team and foster a collaborative and innovative environment. • Partner closely with product and analytics team to understand business goals and engineer solutions to the right problems. • Protect and govern data responsibly: Implement and enforce data security measures and maintain data governance standards and access controls to protect sensitive data. • Ensure compliance with health care data regulations and industry best practices.
Principal GTM Data Engineer – Architect
Tempo SoftwareAdaptive SPM for AI-Accelerated Innovation | Modular Solutions, Compounding Value | 30,000+ Customers
• Architect and implement the unified GTM data spine across CRM, marketing automation, product telemetry, billing, enrichment providers, partner data, and the warehouse. • Establish identity resolution, deduplication, and source-of-truth logic across systems. • Define canonical schemas that we’ll use to operate the business, enable AI across teams and drive results. • Build scalable ELT/ETL pipelines and orchestration workflows tailored to revenue activation. • Implement governance, lineage, access control, and quality monitoring for GTM data assets. • Partner closely with the BI / enterprise data team to align on shared infrastructure while owning GTM-specific models and activation layers. • Develop and productionize segmentation and scoring models (ICP scoring, ABM prioritization, expansion propensity, trial health, pipeline likelihood). • Apply statistical and machine learning techniques to improve scoring, targeting, routing, and revenue predictability. • Design experimentation and measurement frameworks for GTM programs. • Operationalize predictive outputs directly into GTM systems via agents, reverse ETL, or custom integration (Salesforce workflows, outbound automation, lifecycle programs, ABM platforms, partner programs). • Architect customer and prospect intelligence systems integrating first-party, enrichment, and ecosystem signals. • Develop frameworks for ecosystem intelligence (partner influence, marketplace signals, derived demand). • Enable real-time or near-real-time signal activation to sales and marketing teams. • Design structured data systems that power future AI-driven revenue workflows. • Manage and direct external agency/consulting partners executing against the GTM data roadmap. • Establish architectural standards and technical review processes. • Define the build vs. buy strategy for GTM data systems. • Develop the long-term roadmap for in-house data capability.



