Job Closed
This listing is no longer active.
Here you can create the extraordinary. Join us.
Staff MLOps Engineer
Location
Canada
Posted
19 days ago
Salary
0
Seniority
Lead
Job Description
Staff MLOps Engineer
NBCUniversal
• Collaborate with partner ML and annotation engineers and TPMs to specify infrastructure and training requirements. • Design and maintain robust CI/CD and CT (Continuous Training) pipelines for complex multimodal models. • Implement versioning and storage strategies for large-scale 2D/3D datasets to ensure reproducibility and high-throughput access. • Deploy and operate systems for monitoring model performance and detecting data drift in production environments.
Job Requirements
- Graduate degree (Master's or PhD) in Computer Science, Software Engineering, or a related field.
- 5+ years of experience as an MLOps Engineer in fast-paced applied machine learning environments.
- Proficient in Python, Git, and the Unix shell.
- Deep familiarity with Docker, Kubernetes, and workflow orchestrators (e.g., Airflow, Prefect, or Kubeflow).
- Experience with collaborative tools such as Jira/Confluence, Slack, and a Git server.
- Strong mathematical background preferred for understanding the resource demands of 3D data transformations.
- High attention to detail with respect to system reliability and data security.
- Ability to translate abstract ML requirements into concrete, scalable cloud or on-premises infrastructure.
- Prior experience working with complex multidisciplinary teams (e.g., robotics, smart grids, precision agriculture, game development, or aerospace).
- Must be legally authorized to work in Canada.
Benefits
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Graduate Fellow – AI, Machine Learning, Statistical Models
Sistema FibraPelo Futuro da Indústria | Pelo Futuro do Trabalho
• Prepare a report with implementation recommendations to improve vehicle prevention and recovery metrics. • Develop statistical, machine learning, and AI models for implementation by Toyota.
• Assess and decide on the current pipeline • Build the shared AI/ML platform • Oversee the full ML lifecycle • Own training infrastructure on Databricks and Unity Catalog • Build the serving layer • Build the observability for models • Set patterns for cost-effective training and serving • Act as a mentor • Drive AI tooling adoption
• Assess and decide on the current pipeline • Build the shared AI/ML platform • Oversee the full ML lifecycle • Own training infrastructure on Databricks and Unity Catalog • Model serving • Monitoring and drift • Cost and performance • Mentor and multiply • Drive AI tooling adoption
• Assess and decide on the current pipeline • Build the shared AI/ML platform • Oversee the full ML lifecycle • Own training infrastructure on Databricks and Unity Catalog • Model serving • Monitoring and drift • Cost and performance • Mentor and multiply • Drive AI tooling adoption


