Job Closed

This listing is no longer active.

Veeva logo
Veeva

Headquartered in Pleasanton, California, Veeva is a leading provider of cloud-based software and services for the life sciences industry. As an employer, Veeva

AI Data Engineer

Location

Canada

Posted

172 days ago

Salary

$85K - $225K / year

Seniority

Senior

Bachelor DegreeEnglishPython

Job Description

AI Data Engineer

Veeva

• Evaluation Strategy & Planning: Define and establish comprehensive evaluation strategies for new AI Agents. Prioritize the integrity and coverage of test data sets to reflect real-world usage and potential failure modes • LLM Output Integrity Assessment: Programmatically and manually evaluate the quality of LLM-generated content against predefined metrics (e.g., factual accuracy, contextual relevance, coherence, and safety standards) • Creating High-Fidelity Datasets: Design, curate, and generate diverse, high-quality test data sets, including challenging prompts and scenarios. Evaluate LLM outputs to proactively identify system biases, unsafe content, hallucinations, and critical edge cases • Automation of Evaluation Pipelines: Develop, implement, and maintain scalable automated evaluations to ensure efficient, continuous validation of agent behavior and prevent regressions with new features and model updates • Root Cause Analysis: Understand model behaviors and assist in the trace and root-cause analysis of identified defects or performance degradations • Reporting & Performance Metrics: Clearly document, track, and communicate performance metrics, validation results, and bug status to the broader development and product teams

Job Requirements

  • Data Integrity & Validation: A strong, specialized understanding of data quality principles, including methods for validating datasets against bias, integrity concerns, and quality standards. Ability to craft diverse and adversarial test data to uncover AI edge cases
  • Prompt Engineering & Model Expertise: Demonstrated skill in advanced prompt engineering techniques to create evaluation scenarios that test the AI's reasoning, action planning, and adherence to system instructions. Deep knowledge of LLM common failure modes (hallucination, incoherence, jailbreaking)
  • Automated Evaluation Implementation: Proficiency in designing and deploying automated evaluation pipelines to assess complex, agentic AI behaviors. Familiarity with quality metrics such as task success rate, semantic similarity, and sentiment analysis for output measurement
  • Debugging Agentic Systems: Must be comfortable with the specific challenges of debugging agentic systems, including tracing and interpreting an agent's internal reasoning, tool use, and action sequence to pinpoint failure points
  • Programming & Frameworks: Proficiency in Python for developing custom evaluation frameworks, writing scripts, and integrating pipelines with CI/CD systems. Familiarity with standard test automation tools (e.g., Pytest, modern web automation tools)
  • Bachelor's degree in Data Science, Machine Learning, Computer Science, or a related field, with experience in Gen AI / LLMs
  • High work ethic. Veeva is a hard-working company
  • High integrity and honesty. Veeva is a PBC and a “do the right thing” company. We expect that from all employees
  • Applicants must have the unrestricted right to work in the United States or Canada. Veeva will not provide sponsorship at this time.

Benefits

  • Medical, dental, vision, and basic life insurance
  • PTO and company-paid holidays
  • Retirement programs
  • 1% charitable giving program

Related Categories

Related Job Pages

More Data Engineer Jobs

NOUS LATAM logo

Mid-Level Data Engineer

NOUS LATAM

Connect with LATAM-based tech talent for your most challenging projects!

Data Engineer172 days ago
ContractRemoteTeam 1-10Since 2022H1B No Sponsor

• Design and implement efficient data pipelines using Python and AWS services (Lambda, S3, Glue, etc.) • Ensure data quality, reliability, and scalability across multiple sources and formats • Collaborate closely with data analysts, software engineers, and product teams to deliver actionable insights • Contribute to continuous improvement of our data infrastructure and best practices

Mexico
$2.5K - $3K / month
NOUS LATAM logo

Mid-Level Data Engineer

NOUS LATAM

Connect with LATAM-based tech talent for your most challenging projects!

Data Engineer172 days ago
ContractRemoteTeam 1-10Since 2022H1B No Sponsor

• Design and implement efficient data pipelines using Python and AWS services (Lambda, S3, Glue, etc.) • Ensure data quality, reliability, and scalability across multiple sources and formats • Collaborate closely with data analysts, software engineers, and product teams to deliver actionable insights • Contribute to continuous improvement of our data infrastructure and best practices

Peru
$2.5K - $3K / month
NOUS LATAM logo

Mid-Level Data Engineer

NOUS LATAM

Connect with LATAM-based tech talent for your most challenging projects!

Data Engineer172 days ago
ContractRemoteTeam 1-10Since 2022H1B No Sponsor

• Design and implement efficient data pipelines using Python and AWS services (Lambda, S3, Glue, etc.) • Ensure data quality, reliability, and scalability across multiple sources and formats • Collaborate closely with data analysts, software engineers, and product teams to deliver actionable insights • Contribute to continuous improvement of our data infrastructure and best practices

Colombia
$2.5K - $3K / month
ComPsych logo

ETL Data Engineer

ComPsych

The World’s Largest Provider of Mental Health Services and GuidanceResources® for Life.

Data Engineer172 days ago
OtherRemoteTeam 1,001-5,000Since 1984H1B Sponsor

• Design, implement, and continuously expand data pipelines by performing extraction, transformation, and loading (ETL) activities • Gather requirements and business process knowledge to transform data in ways that meet end-user needs • Create new SDLC processes as needed and maintain and improve existing processes • Ensure data architecture is scalable and maintainable • Collaborate with business stakeholders and colleagues to design and deliver accurate, high-quality data • Investigate data to identify potential issues within ETL pipelines, notify end users, and propose effective solutions • Prepare and maintain documentation for future reference • Additional duties as assigned

United States
$115K - $135K / year
Job Closed