Job Closed
This listing is no longer active.
Regeneron is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion or belief (or lack thereof), sex, nationality, national or ethnic origin, civil status, age, citizenship status, membership of the Traveler community, sexual orientation, disability, genetic information, familial status, marital or registered civil partnership status, pregnancy or parental status, gender identity, gender reassignment, military or veteran status, or any other protected characteristic in accordance with applicable laws and regulations. The Company will also provide reasonable accommodation to the known disabilities or chronic illnesses of an otherwise qualified applicant for employment, unless the accommodation would impose undue hardship on the operation of the Company's business.
Senior Manager, Artificial Intelligence Data Engineer
Location
United Kingdom
Posted
5 days ago
Salary
0
Seniority
Lead
Job Description
Senior Manager, Artificial Intelligence Data Engineer
Regeneron Pharmaceuticals
Role Description Global Development is embarking on a Digital Transformation project incorporating AI, machine learning, and automation to help us reduce cycle times, improve quality allowing us to focus on more meaningful work. We focus on developing and improving data pipelines, infrastructure, architecture, and analytic tools to allow resources to fuel our transformation. Working with a team of engineers, analysts, and scientists, you will contribute to modernizing our clinical data infrastructure. This highly visible role will be a technical and strategic liaison between the transformation projects and IT. A key responsibility will be developing and optimizing schema and data models for clinical data. We ensure that those products are standard compliant and interoperable with other platforms and data sources. This role will also contribute to data governance strategy and to the development and management of tools improving and monitoring data quality. The data engineer will also develop processes to automate routine workflows. The engineer will also provide technical leadership and mentorship and will stay current with innovations in data engineering so that they may be evaluated for implementation. This position offers the opportunity to contribute to a fast-growing, science-driven organization making a meaningful difference to patients worldwide. This can be a remote position in the UK or in our Uxbridge offices. Discover your role: - Work with a cross-functional team to optimize and implement our data strategy with a focus on optimization for digital transformation and the use of AI/ML - Design and document end-to-end data architectures that support diverse analytic, operational, and research needs. - Facilitate the implementation of a modern data platform (e.g. Snowflake, Databricks, etc.) - Identify opportunities and implement solutions to increase data interoperability and standardization among systems and across other business units. - Develop and implement pipelines to monitor and improve both internal and external (i.e., from CRO partners) data quality. - Work with informatics and AI engineers optimizing the utility of data for their respective pipelines. - Monitor and optimize the performance of data architectures and platforms. - Develop or implement critical metrics to measure the impact of the overall data strategy. - Stay up to date with the latest advances in the field and, as appropriate, evaluate them for adoption. Requirements - An advanced degree in computer science, statistics, biomedical informatics, or a related field is preferred (PhD + 2 years of experience or an MS + 4 years of relevant experience). - A minimum of 5 years’ experience developing and leading the implementation of data engineering solutions, including accountability for the success of the implementation, in life sciences or healthcare. - Demonstrated expertise in designing and maintaining infrastructure and architecture for clinical or biomedical data in a healthcare or life sciences setting. - Expertise in modern data platforms (e.g., Snowflake, Redshift, BigQuery, Databricks) and programming languages such as Python, SQL, R, etc. - Maintain and manage code repositories (e.g Bitbucket) ensuring clean, well-documented code with proper version control. - Proficiency in cloud architecture (e.g. AWS, Azure, GCP) and DevOps practices. Recognized certifications are a plus. - Experience building, scaling, and maintaining pipelines for structured and unstructured data. Ability to integrate pipelines across the enterprise is essential. - Deep understanding of regulatory frameworks (HIPAA, GDPR, 21 CFR Part 11) and clinical data standards (CDISC, HL7, FHIR). - Knowledge of machine learning pipelines and integration with clinical data platforms. - May require travel up to 20%. Benefits - Health and wellness programs (including medical, dental, vision, life, and disability insurance) - Fitness centers - 401(k) company match - Family support benefits - Equity awards - Annual bonuses - Paid time off - Paid leaves (e.g., military and parental leave) for eligible employees at all levels
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer
Robots and PencilsRobots & Pencils is an applied AI engineering firm building the next frontier of business architecture. We design and ship AI co-workers that integrate into enterprise operations and deliver measurable results for our clients. Founded in 2009, we are smaller, faster, and more senior by design, with teams averaging 15+ years of experience.
Role Description We’re looking for a Staff Data Engineer to join a multi-disciplinary engineering team building modern, enterprise-grade data platforms. This role is ideal for an experienced engineer who can define data strategy, own platform decisions end-to-end, and contribute to technical leadership across the team. In this role, you will: - Design scalable data lakes, warehouses, and pipelines. - Define governance and quality standards. - Drive data platform modernization across real, in-flight work where performance, reliability, and security are critical. - Mentor more junior engineers. - Partner with leadership on data strategy. - Bring an AI-forward mindset. What You’ll Do Craft & Delivery - Define data architecture and platform strategy, leading design across pipelines, warehouses, and data lakes. - Build and optimize scalable data pipelines supporting batch and real-time processing. - Define and enforce data governance, quality standards, and compliance frameworks across the platform. - Build monitoring, logging, and alerting for data pipelines and services, and contribute to CI/CD workflows for data deployment and automation. - Drive data platform modernization, optimizing for performance, cost, and scalability. - Bring an AI-forward mindset to your daily work, using tools like Claude, Cursor, and other modern AI assistants to ship higher-quality work at pace. - Design and implement data contracts and event flows in collaboration with backend, platform, and engineering teams. - Lead the design and implementation of data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows. - Integrate data services with APIs, middleware, and third-party systems to support end-to-end data consumption. Collaboration & Communication - Partner with leadership on data strategy, translating technical depth into decisions others can act on. - Collaborate closely with engineering, analytics, AI, and product teams to align data platforms with broader goals. - Advocate for data quality, governance, and platform best practices across teams and engagements. Leadership & Influence - Establish data engineering standards that lift the quality and consistency of work across the team. - Mentor junior and mid-level engineers, helping them grow their craft, confidence, and impact. - Make high-stakes architectural decisions with clear ownership and consideration of long-term tradeoffs. Qualifications - 7+ years of professional data engineering experience, with experience leading complex data platform initiatives. - Strong system architecture background with expertise in distributed data systems. - Expert proficiency in Python, Scala, and SQL. - Deep expertise with cloud-native data platforms and enterprise data warehousing. - Strong expertise in data pipeline orchestration and processing. - Strong experience with streaming platforms and real-time data processing (e.g., Kafka, Kinesis, Pub/Sub). - Strong data modeling expertise and experience with data transformation. - Strong experience with data quality, governance, and compliance frameworks. - Strong experience with container orchestration and CI/CD for data systems. - Strong experience building data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows. - Demonstrated leadership and technical mentoring experience across a team or organization. - Strong stakeholder communication skills, with the ability to translate technical depth across audiences. - Demonstrable, day-to-day usage and expert knowledge of AI-forward coding tools such as Claude and Cursor. - Excellent problem-solving skills and the ability to navigate highly ambiguous technical and business challenges with sound judgment. - Experience with data mesh or data fabric concepts, lakehouse architectures, or governance framework implementation is a plus. Helpful Extras and Unique Skills - Experience with handling and modeling data in the healthcare industry is a plus. - AWS certifications, like Certified Data Engineer – Associate, strongly preferred. Benefits - A doer who sees something broken and fixes it. - A fast learner who embraces the changing AI landscape. - Direct in a way that improves the work. - Obsessed with craft and detail. - Built for ownership and accountability. - All in for clients' businesses. - Resourceful under constraints. - Glad to collaborate with dedicated team members.
Role Description - Arquitetar soluções escaláveis e seguras utilizando as melhores práticas da arquitetura Databricks; - Desenvolver e implementar estratégias eficazes de gerenciamento de projetos, utilizando ferramentas como Jira e Git para garantir transparência e eficiência; - Colaborar com equipes de produto e design para garantir a entrega de soluções alinhadas às expectativas dos clientes e aos objetivos do negócio; - Identificar e resolver problemas técnicos complexos, mantendo um ambiente de desenvolvimento ágil e colaborativo; - Liderar e orientar equipes multidisciplinares no desenvolvimento de produtos de dados, garantindo qualidade, escalabilidade e cumprimento de prazos. Qualifications - Sólido conhecimento e experiência prática em arquitetura Databricks; - Experiência comprovada em integração e implementação de soluções de engenharia de dados; - Vivência com Jira e Git para controle de versão e gerenciamento de projetos; - Forte compreensão das melhores práticas de segurança e governança no Databricks, incluindo Unity Catalog; - Conhecimento em bancos de dados relacionais e não relacionais, como PostgreSQL, DynamoDB e Redis. Requirements - Triagem de currículos; - Bate-papo com equipe de Gestão Humana; - Entrevista Técnica com lideranças do time; - Entrevista Final com cliente. Benefits - Vales Alimentação e Refeição (Swile); - Cobertura de até 100% em Plano de Saúde e Odontológico (Amil); - Seguro de Vida em grupo; - Trabalho remoto; - Convênio Saúde Mental - psicoterapia online e presencial; - Incentivo a certificações e cursos; - Convênio para cursos de pós-graduação e MBA (Esalq/USP); - Parceria com escolas de idiomas; - Parceria com academias e apps de bem-estar (Wellhub); - Palestras e rodas de conversa internas; - Bônus por indicação; - Happy hours; - Mimos em datas comemorativas. Company Description Acreditamos que os dados são o combustível para a transformação das empresas e das pessoas. Com isso em mente, ajudamos nossos clientes a criar soluções analíticas que os inspiram a realizar transformações organizacionais com impacto direto nos seus modelos de negócio. Na Five Acts, valorizamos a diversidade, a equidade e a inclusão. Acreditamos que a diversidade é uma força impulsionadora da inovação e do crescimento. Portanto, não fazemos distinção por questões de gênero, orientação sexual, religião, idade, etnia, ou qualquer outra. Nossas oportunidades são abertas para todas as pessoas!
AI Data Engineer
Emmes GroupVeridix AI is the technology, data, and AI arm of the Emmes Group, a leading full-service contract research organization (CRO) with over 47 years of experience in supporting clinical research across more than 70 countries. With industry-leading capabilities in cell and gene therapy, vaccines, infectious diseases, and ophthalmology, Emmes is one of the top clinical service providers to the U.S. government and is rapidly expanding its presence in biopharma. Veridix AI develops advanced eClinical solutions, powering clinical trials through patient data collection, randomization, biospecimen tracking, and data quality monitoring. Our cutting-edge AI innovations, including Generative AI (GenAI) capabilities, are transforming clinical trial timelines by streamlining processes from document authoring to automating study builds. Our “Character Achieves Results” culture is driven by five key values that guide our actions in the way we conduct research and distinguish us as an organization: Integrity, Agility, Passion for Excellence, Collaborative Partnerships, and Intellectual Curiosity. If you share our motivations and passion in research, come join us!
Role Description The Data Engineer will have a strong background in data engineering and extensive experience with AWS Cloud services. As a Data Engineer, they are responsible for designing, building, and maintaining scalable data pipelines and infrastructure to support our data analytics and business intelligence initiatives. - Design, develop, and maintain robust data pipelines and ETL processes to ingest, transform, and store data from various sources. - Collaborate with data scientists, analysts, and other stakeholders to understand data requirements, design data models, and deliver solutions that meet business needs. - Automate data workflows and implement monitoring and logging to ensure the health and performance of the data infrastructure. - Conduct data profiling, cleansing, and validation to ensure high data quality standards. - Optimize data storage and retrieval performance, ensuring data quality and integrity. - Implement and manage data architecture on AWS, ensuring scalability, reliability, and security. - Stay up to date with the latest trends and best practices in data engineering and AWS cloud technologies. Qualifications - Bachelor’s or master’s degree in computer science, Information Technology, or a related field. - 3 or more years of related professional experience. - Experience in data engineering with a strong focus on AWS cloud services. - Proficiency in SQL and experience with relational databases (e.g., PostgreSQL, MySQL, Redshift). - Experience with AWS services such as S3, Lambda, Glue, EMR, Kinesis, and Redshift. - Strong programming skills in languages such as Python, Java, or Scala. - Knowledge of data modeling, ETL concepts, and data warehousing. - Familiarity with version control systems (e.g., Git) and CI/CD pipelines. - Excellent problem-solving skills and attention to detail. - Knowledge of machine learning frameworks and data science workflows. - Familiarity with data visualization tools (e.g., QuickSight, Qlik). - Familiarity with NoSQL databases (e.g., DynamoDB, MongoDB). - Strong collaboration skills with cross-functional teams to establish best design and user flows for applications. - Strong multitasking, problem solving, and organizational skills. - Proven ability to work independently and in a team environment. - Satisfactory background check required. Benefits - Flexible Approved Time Off - Tuition Reimbursement - 401k Retirement Plan - Work From Home Anywhere in the US - Maternal/Paternal Leave - Casual Dress Code & Work Environment
• Design and manage scalable data platforms powering real-time analytics, batch processing, and exploratory analysis, using AI-assisted development as the default workflow, not an afterthought. • Own the full data lifecycle: ingestion, ETL, storage, and serving, building and iterating on pipelines with AI pair-programming tools (Claude Code) to accelerate delivery. • Ingest data from diverse sources via both streaming (Kafka, Kinesis) and batch pipelines, unifying them into a consistent, queryable platform. • Architect medallion-layer data models (Bronze/Silver/Gold) in Databricks, ensuring business needs are met with clean, well-documented schemas. • Automate, test, and harden data workflows, writing AI-augmented tests, data quality checks, and CI/CD pipelines that catch issues before production. • Build and maintain AI-ready tooling: craft prompts, custom slash commands, and agent workflows that let the entire team scaffold pipelines, generate documentation, and validate data quality faster. • Build and improve Databricks Genie chatbots that allow non-technical users to query data using natural language. • Collaborate with product analytics and data science, applying engineering rigor to messy, unstructured data and transforming it into reliable, production-ready datasets. • Contribute to infrastructure-as-code (Terraform/Atmos) for provisioning and managing cloud data infrastructure.

