Job Closed
This listing is no longer active.
Sarah Cannon Research Institute (SCRI) is one of the world’s leading oncology research organizations conducting community-based clinical trials. Focused on advancing therapies for patients over the last three decades, SCRI is a leader in drug development. In 2022, SCRI formed a joint venture with former US Oncology Research to expand clinical trial access across the country. It has conducted more than 850 first-in-human clinical trials since its inception and contributed to pivotal research that has led to the majority of new cancer therapies approved by the FDA in the past decade. SCRI’s research network brings together more than 1,300 physicians who are enrolling patients into clinical trials at more than 200 locations in 20+ states across the U.S.
Senior Data Engineer/Software Developer
Location
United States
Posted
65 days ago
Salary
$136K - $227K / year
Seniority
Senior
Job Description
Senior Data Engineer/Software Developer
McKesson
McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve – we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patients, our communities, and our people. If you want to be part of tomorrow’s health today, we want to hear from you. We are seeking a Senior Data Engineer / Software Developer to support and scale our AI‑driven data platforms. This role focuses on building robust AI pipelines, shared tooling, and engineering standards that enable consistent, high‑quality generation and consumption of advanced analytics outputs across the organization. The engineer will play a key role in strengthening the technical foundation that supports analytics and oncology insights. Key Responsibilities Business Drivers - Collaborate with data scientists, machine learning engineers, and analytics teams to provide technical direction for AI and advanced analytics platforms - Work closely with data warehousing, data engineering, and cloud platform teams to design optimal architectures for AI‑driven data solutions - Enable the scalable use of AI‑generated outputs (e.g., ML predictions, extracted signals, model outputs) in conjunction with structured data to support analytics and oncology insights - Partner with senior management and stakeholders to communicate AI system capabilities, implementation approaches, assumptions, and limitations in clear, non‑technical language Technical Responsibilities - Participate in the full lifecycle of AI and data platform solutions, including planning, design, implementation, deployment, monitoring, and ongoing maintenance - Design, build, and maintain production‑grade AI pipelines, shared frameworks, and supporting services in the cloud (e.g., AWS, GCP, Azure; Azure preferred) - Design, test, and maintain AI‑enabled applications and services using modern software engineering and testing methodologies - Perform code reviews and help define engineering and AI code standards to ensure high‑quality, scalable, and maintainable solutions - Develop and maintain scalable data and AI pipelines using Python and supporting technologies - Design and implement data architectures that support downstream analytics and access by McKesson analysts and AI data consumers Drive innovation - Develop reusable engineering solutions to support AI workloads, model execution, inference pipelines, and integration into downstream data products - Evaluate new AI‑related tools, frameworks, and platforms to improve scalability, reliability, and developer productivity prior to broader adoption - Experience supporting AI or machine learning solutions in healthcare, oncology, genomics, or medical data domains is preferred but not required Minimum Requirement - Degree or equivalent and typically requires 7+ years of relevant experience. Education - A degree in a quantitative field such as Statistics, Machine Learning, Mathematics, Computer Science, Economics, Epidemiology or any other related field - Master’s Degree or higher preferred Critical Skills - 3+ years of relevant experience in data engineering or software development roles supporting analytics or AI‑enabled solutions; healthcare experience preferred - Proficiency in Python and SQL, with demonstrated experience developing and maintaining reliable, production‑grade data pipelines and analytical datasets - Experience building and supporting internal tools or applications used for data validation, monitoring, review, or operational analytics workflows - Working knowledge of application integration patterns, including service‑based architectures and data access layers that support UI‑driven tools - Hands‑on experience using Databricks for data processing, analytics development, and collaboration with data science or analytics teams - Experience working within Microsoft Azure environments, applying standard engineering practices to deliver maintainable, well‑documented solutions Additional Skills - Familiarity with machine learning or AI concepts, including model lifecycles, inference workflows, and integration of model outputs into analytics or data products - Exposure to Natural Language Processing or other unstructured data workflows, such as text ingestion, extraction, or downstream signal consumption - Experience with NoSQL or semi‑structured data stores and alternative data persistence patterns - Experience with analytics visualization tools or reporting solutions, and familiarity with modern scripting or web technologies used to support internal tools We are proud to offer a competitive compensation package at McKesson as part of our Total Rewards. This is determined by several factors, including performance, experience and skills, equity, regular job market evaluations, and geographical markets. The pay range shown below is aligned with McKesson's pay philosophy, and pay will always be compliant with any applicable regulations. In addition to base pay, other compensation, such as an annual bonus or long-term incentive opportunities may be offered. For more information regarding benefits at McKesson, please click here. Our Base Pay Range for this position $136,300 - $227,100 McKesson has become aware of online recruiting-related scams in which individuals who are not affiliated with or authorized by McKesson are using McKesson’s (or affiliated entities, like CoverMyMeds or RxCrossroads) name in fraudulent emails, job postings or social media messages. In light of these scams, please bear the following in mind: McKesson Talent Advisors will never solicit money or credit card information in connection with a McKesson job application. McKesson Talent Advisors do not communicate with candidates via online chatrooms or using email accounts such as Gmail or Hotmail. Note that McKesson does rely on a virtual assistant (Gia) for certain recruiting-related communications with candidates. McKesson job postings are posted on our career site: careers.mckesson.com. McKesson is an Equal Opportunity Employer McKesson provides equal employment opportunities to applicants and employees, without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, age, genetic information, or any other legally protected category. For additional information on McKesson’s full Equal Employment Opportunity policies, visit our Equal Employment Opportunity page. McKesson is committed to being an Equal Employment Opportunity Employer and offers opportunities to all job seekers including job seekers with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment, please contact us by sending an email to (United States) Disability_Accommodation@McKesson.com or (Canada) Accessibility@mckesson.ca. Resumes or CVs submitted to this email box will not be accepted. Join us at McKesson!
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Engenheiro(a) de dados Sênior
iFoodNo iFood, acreditamos na força da diversidade para gerar #Inovação e atingir #Resultados incríveis, por isso, não fazemos distinção para candidatos com deficiência, gênero, orientação sexual, raça/etnia, idade, origem, constituição familiar e estética. Temos grupos compostos por foodlovers voluntários, onde falamos sobre Raça, Gênero, LGBTQI+ e PcD. Queremos ser a empresa onde pessoas escolham como lugar para se desenvolver e contribuir para a realização de sonhos, #AllTogether. Nós, FoodLovers, temos fome de inovação e resultado. Buscamos sempre fazer o nosso melhor, pensando "fora da caixa" e atuando com agilidade e responsabilidade! Temos fome de diversidade, conhecimento e compartilhamento. Trabalhamos em um ambiente de muita versatilidade. Sabe o que promove a nossa receita especial? As pessoas! Vem fazer parte disso🤝
Nosso Modo de Fazer no Time: Transforme sua carreira com o iFood! Somos uma empresa brasileira de tecnologia referência na América Latina. Por meio de soluções inovadoras, conectamos milhares de restaurantes a milhões de consumidores diariamente com uma média de 100 milhões de pedidos mensais. Além do delivery de comida, também somos Mercado, Farmácia e Pet. Temos também o iFood Pago, nossa Fintech, que engloba o iFood Benefícios, o vale alimentação e refeição do iFood e o próprio iFood Pago, o banco do restaurante. Junte-se a nós e faça parte de uma equipe que está sempre à frente com tecnologia de ponta e inovação constante. Você fará parte do time de Governança de Dados, com o desafio de melhorar a descoberta de dados, assim como toda a camada de metadados e ciclo de vida dos dados que temos, definindo e construindo o modo de trabalhar com dados aqui dentro, colaborando ativamente com times de engenharia, arquitetura e dados para garantir que tenhamos dados confiáveis, documentados, seguros e acessíveis. Esta é uma posição com componente técnico em desenvolvimento e foco em identificar oportunidades e aplicar boas práticas de governança em ambientes modernos de dados em nuvem. Seu Cardápio Diário: - Definirá políticas e processos de governança de dados (catalogação, linhagem, entre outros); - Ajudará a projetar e construir soluções que facilitem a aderência às políticas definidas; - Participará da definição e evolução do framework de governança, promovendo a cultura de dados entre os times; - Apoiará iniciativas de novas tecnologias e modos de trabalhar com dados para garantir que surjam com governança definida, ajudando domínios de dados a adotarem práticas de ownership e interoperabilidade; - Irá desenvolver monitores e indicadores de governança; - Colaborará com arquitetos e engenheiros para garantir que os pipelines de dados estejam em conformidade com as políticas definidas; - Traduzirá requisitos de governança em soluções técnicas alinhadas às boas práticas de engenharia e realidade do negócio. Ingredientes Que Buscamos: - Experiência prévia com governança de dados (Data Governance, Data Stewardship, Data Quality, Metadata Management etc.); - Capacidade de adequar a teoria de governança para a realidade da operação do dia a dia; - Experiência em engenharia de dados ou software (construção de pipelines de dados, micro serviços, git, testes, modularização e orquestração); - Sólido entendimento do ciclo de vida de dados e de princípios de arquitetura moderna (Data Lake, Lake house, Data Warehouse); - Conhecimento em cloud (AWS, GCP ou Azure); - Capacidade de escrever e interpretar queries em SQL e scripts em Python para análise, transformação e validação de dados; - Boa comunicação com públicos técnicos e não técnicos; habilidade de traduzir complexidade técnica em linguagem acessível; - Experiência com gestão de metadados, linhagem de dados, catalogação e classificação - Mindset colaborativo, resiliente e voltado à melhoria contínua; Para realçar o sabor: - Conhecimento ou certificações em frameworks de governança de dados (ex: DAMA-DMBOK, DCAM, EDM Council); - Experiência com ferramentas de monitoramento e visualização de dados; - Experiência com ferramentas de catálogo como DataHub e OpenMetadata; - Conhecimento em Data Warehouse, Bancos de dados e Kafka; - Conhecimentos de ferramentas como Airflow, Spark, Databricks, DBT ou similares; Buscamos uma pessoa apaixonada por inovação e tecnologia, que esteja sempre em busca de novos aprendizados e que goste de desafios. Se você se identifica com este perfil, adoraríamos conhecer você!
• Support the Bank’s “People First” focus and rules of engagement—maintaining a professional demeanor, working as an active member of the CNOB team, providing clients excellent service, always striving to make CNOB “A Better Place to Be” • Architect, build, and maintain data pipelines using Apache Airflow, ensuring seamless data movement across AWS S3 and Amazon Redshift. • Design and implement scalable data models using dbt Labs, transforming raw data into clean datasets optimized for analytical querying. • Facilitate the setup of new data sources, designing the architecture to securely and efficiently ingest data from third-party APIs, transactional databases, and external platforms into the data lake/warehouse. • Maintain a deep understanding of the underlying data model to troubleshoot bottlenecks, optimize Redshift query performance, and implement rigorous data quality testing and alerting. • Serve as a subject matter expert on data architecture, establishing best practices for code reviews, version control, and CI/CD workflows within the data engineering team.
Staff Data Engineer
VisaBased in Foster City, California, Visa is a global payments technology organization. Visa was founded in 1958, coinciding with Bank of America’s launch of the
Role Description We’re hiring a Staff Data Engineer to design and build robust data pipelines for our corporate Data Lake. In this role, you’ll own data products end-to-end in production, work autonomously on significant projects, and mentor junior engineers. You’ll be expected to make sound data engineering trade-offs under limited supervision, with strong hands-on experience in Spark, Databricks, Delta Lake, and Airflow. At Pismo, the Data Lake team is responsible for centralizing and organizing data into a single, trusted platform that supports decision-making across the company and for external clients. We work on challenges such as: - Scaling global data infrastructure - Delivering high-quality reporting - Enabling secure, self-service access to data Key responsibilities include: - Design and implement data ingestion and transformation pipelines (batch and near-real-time) using PySpark/SparkSQL on Databricks. - Own data pipelines end-to-end in production: freshness, correctness, availability, and SLA adherence. - Build and maintain Delta Lake tables following medallion architecture patterns (bronze/silver/gold). - Design and optimize Airflow DAGs (MWAA) for complex orchestration scenarios. - Implement and maintain data quality frameworks (Great Expectations or equivalent) as integrated pipeline gates. - Write advanced SQL for data modeling, transformation, and performance optimization. - Conduct thorough code reviews and mentor Analyst-level engineers through pairing and design guidance. - Investigate, diagnose, and resolve data quality incidents and pipeline failures independently. - Collaborate with Analytics, BI, and Product teams to design consumer-friendly datasets. - Contribute to CI/CD, testing standards, and data governance practices. This is a remote position. A remote position does not require job duties to be performed within proximity of a Visa office location. Remote positions may be required to be present at a Visa office with scheduled notice. #LI-Remote Qualifications - Apache Spark (PySpark, SparkSQL) — production experience - Databricks (jobs, workflows, cluster management, tuning) - Delta Lake (ACID tables, OPTIMIZE, VACUUM, schema evolution, MERGE) - Advanced SQL (window functions, CTEs, query optimization) - Apache Airflow / MWAA (DAG design, retries, backfills, SLAs) - Amazon S3 data lake design (partitioning, layout, lifecycle) - Data quality frameworks (Great Expectations or equivalent) - Data modeling (dimensional / Kimball, medallion layers) - Git/GitHub, CI/CD for data pipelines - Terraform - Python for automation and data processing Requirements - Desirable qualifications: - CDC patterns (DMS, incremental processing, MERGE upserts) - Streaming ingestion (Structured Streaming, Auto Loader) - AWS Glue Catalog / Unity Catalog - Metadata management (OpenMetadata) - BI integration (Superset, dashboarding) Company Description Founded by experienced entrepreneurs and engineers in 2016, Pismo is a technology company that provides a comprehensive processing platform for banking, card issuing, and financial market infrastructure and helps customers innovate and build the next generation of banking and payment solutions. Pismo joined Visa in 2024. Leveraging Visa’s solutions, our core platform, and an expanding suite of capabilities, Pismo addresses the technological challenges that large banks, marketplaces, and fintech companies face in migrating from legacy systems to more advanced technology in the market. Pismo’s cloud-based platform empowers firms to build and launch financial products rapidly, scaling as they grow to have a broader audience while keeping high security and availability standards. Pismo’s 500+ employees are located in more than 10 countries around the world.
Onebridge, a Marlabs Company, is a global AI and Data Analytics Consulting Firm that empowers organizations worldwide to drive better outcomes through data and technology. Since 2005, we have partnered with some of the largest healthcare, life sciences, financial services, and government entities across the globe. We have an exciting opportunity for a highly skilled Sr. Data Modeler to join our innovative and dynamic team. Sr. Data Modeler | About You As a Sr. Data Modeler, you are responsible for designing and governing high‑quality, scalable data models that bridge business needs with technical implementation. You bring deep experience translating complex, enterprise‑level requirements into conceptual, logical, and physical data models that support analytics, reporting, and operational workloads. You thrive at the intersection of data architecture, governance, and system optimization, ensuring data is secure, reliable, and fit for purpose. You collaborate closely with business, analytics, and engineering teams to align data strategy with organizational goals. You are detail‑oriented, strategic‑minded, and committed to building data foundations that support long‑term growth and compliance. Sr. Data Modeler | Day-to-Day - Design and maintain conceptual, logical, and physical data models that support enterprise data and analytics initiatives. - Analyze and optimize existing data models, databases, and pipelines to improve performance, scalability, and data usability. - Partner with business and technical stakeholders to translate requirements into standardized data structures and definitions. - Support data strategy and governance initiatives, including data quality standards, metadata management, and documentation. - Design and guide data migration and integration efforts, moving data from legacy systems to modern platforms. - Assist with implementing security, privacy, and compliance requirements, including access controls and regulatory standards such as GDPR or HIPAA. Sr. Data Modeler | Skills & Experience - 7+ years of experience in data modeling, data architecture, or enterprise data management roles. - Strong proficiency in SQL and hands‑on experience with data modeling tools such as ERwin, Visio, or similar platforms. - Experience working with cloud data platforms (AWS, Azure, or Google Cloud) alongside traditional RDBMS technologies (Oracle, SQL Server). - Strong understanding of data warehouse design, dimensional modeling, and ETL/ELT processes. - Experience supporting data governance, metadata management, and data quality initiatives in enterprise environments. - Familiarity with Big Data technologies and modern data processing architectures is preferred.




