Mental health, built around you.
Sr. Data Engineer, AI/ML
Location
United States
Posted
80 days ago
Salary
$142K - $176K / year
Seniority
Senior
No structured requirement data.
Job Description
Sr. Data Engineer, AI/ML
Octave
About the Company: Octave is a modern behavioral health practice creating a new standard for care delivery that’s both high-quality and accessible. With in-person and virtual clinics in multiple states, the company offers evidence-based individual, couples, and family therapy, while pioneering relationships with payers to make care more affordable through insurance. By raising the bar on how care is delivered and how providers are supported, we are building a sustainable system that values equity, affordability, and effectiveness. About the Role: We’re looking for a Sr. Data Engineer with strong data platform experience to help evolve our modern data stack and contribute to the foundation of our emerging AI and ML platform. This role sits at the intersection of data engineering, platform architecture and machine learning enablement and will bring high-quality, scalable, and ethical AI into real-world use. You will partner closely with data scientists, analysts, and product managers to ensure our platform supports reliable data pipelines, scalable analytics, and production ready machine learning systems in addition to defining new architecture, best practices, and patterns for fellow engineers to inherit. The ideal candidate is both a systems thinker and a hands-on builder who thrives in evolving environments and is passionate about creating reliable data infrastructure that enables peers and partner teams to move faster with data. Responsibilities Include: - Design, build, and maintain scalable systems for ingestion, transformation, and storage of data, with a focus on testing and observability. - Implement frameworks, tooling, and automation to safely increase development velocity. - Develop foundational end-to-end AI/ML workflows from (1) source ingestion and preparation, (2) training and tuning, (3) experimentation and productionization, and (4) downstream systems integration (EHR modules, micro-services, dashboards). - Support iterative model development and production operations and observability (accuracy, drift, bias, fairness, reproducibility). - Contribute to a culture of continuous improvement, knowledge-sharing and mentoring of peer engineers. Preferred Qualifications: - Bachelor’s degree (or equivalent) in Computer Science, Data Science, Statistics, Engineering or a related field. - 5+ years of experience in data engineering, platform engineering, or ML engineering. - Experience working with major cloud data platforms and tools: - Preferred experience: - Healthcare, behavioral health, EHR systems, and/or regulated industries. - Specific expertise with: AWS/GCP, dbt, AirflowAirbyte, Redshift/BigQuery. - Proficiency in SQL and Python with strong familiarity towards modern data engineering frameworks, infrastructure, and tooling. - Proficiency with data ops best practices, monitoring, pipeline automation, and CI/CD. - Knowledge of modern compute and ML frameworks/libraries (i.e., Spark, TensorFlow, PyTorch, scikit-learn). - Ability to build production APIs and services, inclusive of MCP servers that expose internal data/services to LLMs. - A collaborative mindset, dependable execution, drive to reflect and improve, and humility to ask questions and learn. Octave's Company Values: The below values drive our day-to-day operations. - We’re human beings first. We operate with empathy and kindness – with our clients, with our collaborators, and with ourselves. - People deserve better than status quo. We’re willing to tackle the intractable problems, no matter how big, because someone should. We ask big questions, we craft big solutions, and we challenge ourselves and others to make it happen. - No bystanders. No stars. No tourists. Each person has been selected to be here, and with that comes a responsibility to bring your expertise, share your ideas, and help make this company better. - Partnership paves the path ahead. We don’t operate in a silo, internally or externally. To transform the system, we believe in working with others to create something bigger, better, and stronger. - Quality is crucial at scale. Quality is core to our business, and we refuse to sacrifice it as we grow. - Progress is a process. In the pursuit of progress, we iterate, reflect, learn, adjust – and always leave things better than we found them. - There are people behind every data point. We recognize that numbers tell only one part of the story, and we also do the work to understand impacts at the individual level. Physical Requirements: - Prolonged periods sitting at a desk and working on a computer. - Must be able to frequently communicate with others through virtual meeting applications such as Zoom and Google Meet. - Must be able to observe and communicate information on company provided laptop. - Move up to 10 pounds on occasion. - Must be eligible to work in the United States without sponsorship now or in the future. Compensation: Octave is committed to pay equity. To maintain our commitment to pay equity, Octave will follow Pay Transparency regulations on all open job postings. Current Pay Transparency laws require companies to include a position's salary or hourly wage range (not including bonuses or equity-based compensation) in any internal or external job posting. This requirement extends to job postings published by a third party at an employer's request. Octave will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with Octave’s legal duty to furnish information. Starting pay for qualified applicants will depend on a combination of job-related factors, which may include education, training, experience, location, business needs, or market demands. The expected salary range for this role is set forth below and this range may be modified in the future. The salary range for Geo 1 (all states, excluding those in Geo 2 or Geo 3, and D.C.) is $142,600 - $153,100. The salary range for Geo 2 (CO, HI, MD, RI) is $156,900 - $168,400. The salary range for Geo 3 (AK, CA, CT, MA, NJ, NY, WA) is $164,000 - $176,000. All Geos are eligible for equity in the form of stock options. Additionally, this position is eligible for the following benefits: company sponsored life insurance, disability and AD&D plans. Voluntary benefits such as 401k retirement, medical, dental, vision, FSA, HSA, dependent care and commuter/parking options are also available. Octave offers generous Paid Time Off as well as paid parental leave benefits. This job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice. How We Use Technology in Hiring: As part of our hiring process, we may use technology tools, including AI-supported systems, to assist with reviewing applications or documenting interviews. These tools are designed to support our team, not replace human judgment, and final hiring decisions are always made by our team. This job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice. Application Instructions: Please complete the following application. Please note that the U.S. Equal Opportunity Employment Information questions below are used for the purposes of EEOC reporting and are optional to complete. Octave is unable to change these questions and we acknowledge that many of the U.S. Equal Opportunity Employment Information questions are not inclusive or affirming of all aspects of cultural identity. Octave is committed to an inclusive workplace environment, and this information will not inform how we approach hiring or employment.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer – Architect
In All MediaImagine the future of business. Ideas for a Digital Renaissance.
• Define data strategy and implement a long-term roadmap • Design and optimize data models for operational and analytical uses • Lead database performance tuning and improvements • Identify critical business insights • Establish data governance practices • Utilize Python for automation tasks • Collaborate cross-functionally with stakeholders and teams
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description At Thermo Fisher Scientific team, you’ll discover impactful work, innovative thinking and a culture dedicated to working the right way, for the right reasons - with the customer always top of mind. The work we do matters, like helping customers find cures for cancer, protecting the environment, and supporting our customers’ medical related inquiries. As the world leader in serving science, with the largest investment in R&D in the industry, our colleagues are empowered to realize their full potential as part of a fast-growing, global organization that values passion and unique contributions. Our commitment to our colleagues across the globe is to provide the resources and opportunities they need to make a difference in our world while building a fulfilling career with us. The Senior Databricks Engineer will architect, build, and optimize data solutions that support Thermo Fisher Scientific’s digital transformation strategy. As a vital contributor to the CRG Digital Platform and Architecture team, this position will be responsible for building connections and workflows within cloud-based systems. - Build, develop, and deploy scalable data pipelines and ETL/ELT processes using Databricks. - Engineer robust data solutions to integrate enterprise data sources, including ERP, CRM, laboratory, and manufacturing systems. - Develop reusable frameworks and templates to accelerate data delivery and ensure consistency across domains. - Implement and maintain high-performance data connections across Databricks, Snowflake, and Iceberg environments. - Author and optimize complex SQL queries, transformations, and data models for analytics and reporting use cases. - Support data Lakehouse and data mesh initiatives to enable seamless access to trusted data across the organization. - Apply data governance, lineage, and security controls using Unity Catalog, Delta Live Tables, and related technologies. - Partner with compliance and cybersecurity teams to uphold data privacy, GxP, and regulatory standards. - Establish monitoring, auditing, and optimization processes for ongoing data quality assurance. - Collaborate with data scientists, architects, and business partners to build and implement end-to-end data solutions. - Serve as a technical mentor and leader with vision within the CRG data engineering community. - Contribute to critical initiatives for digital platform modernization and advanced analytics enablement. Qualifications - Bachelors degree or equivalent. - Minimum of 8 years professional experience in data engineering or data platform development is required. - Minimum of 5 years of hands-on experience with Databricks and Apache Spark in production environments is required. - Demonstrated expertise with Snowflake and Apache Iceberg is required. - Strong proficiency in SQL and experience optimizing queries on large, distributed datasets is required. - Proven experience with cloud-based data platforms (Azure preferred; AWS or GCP acceptable) is required. - Strong understanding of data modeling, ETL/ELT pipelines, and data governance practices is required. - Experience implementing Unity Catalog or CI/CD pipelines for data workflows is preferred. - Experience in life sciences, biotech, or manufacturing environment is preferred. - Strong interpersonal and communication skills with ability to collaborate across global and multi-disciplinary teams is preferred. Requirements - Strong technical skills in Databricks, Spark, SQL, Snowflake, and Frozen Mountain. - Solid understanding of distributed computing, performance optimization, and large-scale data processing. - Proven track record translating business requirements into scalable data architectures. - Excellent problem-solving and analytical skills; strong attention to detail. - Proficiency in leading through example, mentoring junior engineers, and championing standard procedures within the data engineering realm. - Comfortable working in a fast-paced, matrixed environment with evolving priorities. Benefits - Fully remote position. - Relocation assistance is NOT provided. - Must be legally authorized to work in the United States without sponsorship. - Must be able to pass a comprehensive background check, which includes a drug screening.
Lead DataStage Platform Administrator
Novalink Solutions LLCMode of Interview: The State will conduct interviews. The State reserves the right to remove a resource from consideration if the resource is unavailable for interview as requested by the State. Interviews will be conducted via Microsoft Teams. Project Schedule: Anticipated Project Start Date: May 1, 2026 Anticipated End Date: April 30, 2027 The State retains the option to extend the work order in increments determined by the State.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description DHHS is seeking a Lead DataStage Platform Administrator to support the transition, stabilization, and ongoing operation of a large-scale enterprise data processing and analytics platform. This role will serve as the primary technical administrator for the IBM DataStage ETL platform and associated job orchestration tools, ensuring reliable execution of high-volume batch data processing workloads. The Lead Administrator will be responsible for: - Installation, configuration, tuning, monitoring, and operational support of the DataStage platform. - Coordination with infrastructure, database, and application teams. - Providing secondary operational administration support for the Cognos Analytics reporting platform. - Assisting with monitoring, environment management, and operational troubleshooting. This role operates within a lean team responsible for stabilizing and supporting a mission-critical enterprise data and reporting environment. The team is intentionally small, and every member is expected to take a hands-on, execution-oriented approach to problem solving and platform operations. Qualifications - 10+ years of experience administering IBM InfoSphere DataStage platforms. - Demonstrated expertise in DataStage installation, configuration, and administration. - Experience doing software installations, patches, and upgrades for InfoSphere DataStage in RHEL, Linux/Unix environments. - Strong experience managing large-scale ETL batch processing environments. - Experience administering ETL platforms integrated with enterprise job orchestration tools such as RunDeck, Control-M, or similar schedulers. - Strong troubleshooting and performance tuning experience for ETL pipelines. - Experience working with enterprise relational databases such as DB2. - Experience working with Linux-based enterprise data platforms. - Experience supporting data warehouse and analytics environments. Requirements - Experience supporting IBM Cognos Analytics platform administration. - Experience supporting cloud-hosted data platforms (AWS preferred). - Experience working in healthcare data environments, including claims or encounter data processing. - Experience working with state government or public-sector technology programs. - Familiarity with Medicaid systems, healthcare reporting, or regulatory data processing environments. Key Skills - IBM InfoSphere DataStage Administration - Enterprise Job Orchestration (RunDeck, Control-M) - Cognos Analytics Platform Operations - Linux Administration
Enterprise Data Integrations Lead
Catalent Pharma Solutionsmore products. better treatments. reliably supplied. ™
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description The Enterprise Business Intelligence Lead will be responsible for overseeing the evolution and maturity of the business intelligence lifecycle, from data collection and transformation to analysis and visualization. The role requires a strong blend of technical expertise, leadership skills, and business acumen. The successful candidate will collaborate closely with business stakeholders, cross-functional teams, and Enterprise Data Sciences & Analytics teammates to ensure the correct business needs are identified, effective BI/analytics solutions are designed, and ensure data integrity and accuracy, and drive self-service BI across the organization. - Provide strategic direction, solution architecture and approach, and leadership to the business intelligence project teams. - Promote self-service BI and establish a new Analytics Community of Practice. - Foster a collaborative and results-oriented environment. - Manage and oversee a team of offshore contractors performing managed services, project delivery work, and continuous improvement projects. - Drive a culture of self-service BI and thought leadership. - Ensure data governance and data quality standards are met throughout the data lifecycle. - Utilize advanced analytics techniques to extract insights from complex data sets. - Oversee the design and development of data visualizations, dashboards, and reports using Power BI. - Up to 10% travel expectations. - Other duties as assigned. Qualifications - Bachelor’s degree in Business, Computer Science, Data Science, or related field required. - Minimum seven years of proven experience in business intelligence, data analytics, or a related role. - Leadership experience with a history of working across multiple geographies. - Life Sciences industry experience, required. - Experience with GxP processes, required. - Familiarity with Cloud platforms (e.g., AWS, Azure, Google Cloud) relevant to data management and analytics. - Experience with Power BI apps leveraging Databricks is highly desirable. - Experience in data visualization tools (SQL, Python, R) and BI visualization tools (Tableau, Power BI, QlikView, etc.). - High proficiency with Microsoft Power BI. - Individual may be required to sit, stand, walk regularly and occasionally lift 0-15 pounds. Benefits - Opportunity to impact and help build a growing early talent organization. - Potential for career growth within an expanding team. - Diverse, inclusive culture. - 152 hours of PTO + 8 Paid Holidays. - Competitive medical benefits and 401K. - Dynamic, fast-paced work environment.



