MLOps AI Specialist, PyTorch

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteMid LevelTeam 501-1,000Since 2002H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

2 days ago

Salary

$70 - $110 / hour

Seniority

Mid Level

Postgraduate Degree2 yrs expEnglishDistributed SystemsPyTorch

Job Description

MLOps AI Specialist, PyTorch

Weekday

• Join a leading AI lab's cutting-edge Generative AI team and play a key role in developing next-generation large language models. • Contribute to AI model training and evaluation initiatives by designing, solving, and reviewing advanced machine learning infrastructure and systems challenges. • Your expertise will help improve the quality of training data used to develop frontier AI systems. • Partner with research and engineering teams to identify and address knowledge gaps in MLOps, machine learning infrastructure, and model training systems. • Design challenging, real-world tasks focused on distributed training, ML frameworks, model optimization, and infrastructure engineering. • Develop accurate, well-structured solutions to complex MLOps and ML systems problems. • Evaluate technical tasks and solutions, providing detailed and actionable feedback. • Create evaluation frameworks and scoring rubrics for training pipeline architecture, distributed systems reasoning, performance optimization, and kernel-level programming. • Contribute domain expertise to improve AI model capabilities in machine learning engineering topics. • Collaborate with other subject matter experts to ensure consistency, quality, and technical accuracy across datasets and evaluations.

Job Requirements

  • 2+ years of professional experience in ML Infrastructure, MLOps, ML Systems Engineering, or a closely related field.
  • Strong hands-on experience building and operating production-scale machine learning systems.
  • Advanced proficiency with PyTorch, including model training, optimization, and deployment workflows.
  • Experience developing, tuning, or optimizing custom GPU kernels using Triton, Pallas, or similar frameworks.
  • Demonstrated career growth and increasing technical responsibility.
  • Ability to commit to a full-time, 40-hour-per-week schedule during standard business days.
  • Excellent written communication skills and the ability to clearly explain complex technical concepts and engineering decisions.

Related Job Pages

More Machine Learning Engineer Jobs

Applied Machine Learning Engineer

Terra Quantum

Quantum technologies have the potential to solve some of the world’s biggest challenges. There have been great advances in all areas of quantum technologies, and new fields of application are opened up every day. Hybrid computer systems that combine classic high-performance computing with quantum computers are already being used to develop solutions in sectors such as logistics, healthcare, finance, energy, automotive, and aerospace. At Terra Quantum we are building the world’s leading Quantum Technology company. We offer customers world-class quantum technology expertise organized as “quantum-as-a-service”: hybrid quantum algorithms, quantum compute, and quantum-enabled security solutions. Our purpose is to pioneer quantum technologies to change the world for good. Our vision is to lead the quantum revolution and be the trailblazer in technology solutions, shaping a better future for humankind to thrive in, and our mission is to unleash the power of quantum tech to deliver meaningful solutions today. Terra Quantum is an equal opportunities employer, committed to diversity, inclusion, and employee well-being.

Role Description The Applied Machine Learning Engineer will be a member of Terra Quantum's AI Applied Research team. This team builds and delivers end-to-end machine learning solutions for industrial clients across various domains. - Own the classical machine learning craftsmanship from data exploration to client delivery. - Incorporate a quantum layer as one architectural component of an otherwise classical pipeline. - Apply classical ML methods to make hybrid solutions perform reliably on real industrial data. - Drive excellence within the team by demonstrating commitment and passion for the mission. Responsibilities - Building and delivering industry machine learning solutions. - Designing end-to-end ML pipelines for client problems in: - Time series - Routing and planning - GenAI - Natural language processing - Computer vision - Predictive modelling - Choosing the right classical method based on data characteristics. - Treating the quantum layer as a constrained component of the model. - Performing classical machine learning craftsmanship in service of hybrid models. - Executing essential parts of an ML pipeline: - Data cleaning - Leakage protection - Cross-validation design - Baseline construction - Statistical significance testing - Designing feature representations that align with the quantum component. - Profiling and improving training stability in cooperation with the quantum research team. - Contributing to internal ML libraries and SDK for future client engagements. - Supporting research and applied product development. - Translating proposed quantum machine learning algorithms into testable implementations. - Helping evaluate the measurable benefit of a quantum layer on applied tasks. Qualifications - Completed a Master's degree in computer science, applied mathematics, data science, statistics, engineering, physics, or equivalent subject. - Hands-on experience with classical machine learning through coursework, internships, or research projects. - Strong command of Python and the standard data science stack (NumPy, pandas, scikit-learn). - Experience with at least one deep learning framework (PyTorch or TensorFlow). - Comfort with classical ML toolkit including tree-based methods, gradient boosting, and kernel methods. - Experience designing rigorous experiments and reporting results with appropriate uncertainty. - Software engineering fundamentals: version control with Git, testing, reproducible environments. - Curiosity about quantum computing and willingness to learn necessary quantum concepts. - Familiarity with at least one applied ML vertical (time series forecasting, NLP, computer vision, or industrial optimisation) is a plus. - Goal-oriented and analytical, with the ability to work independently and as part of a team. - Proficiency in written and spoken English. - Legal right to live and work in the European Union or Switzerland. Requirements - Applicants must have the legal right to live and work in the European Union or Switzerland. - Unfortunately, we are unable to offer visa sponsorship for this role. Benefits - Opportunity to work with leading minds in the field of Quantum Technologies. - Gain knowledge of cutting-edge technology developments in science & engineering. - Chance to be part of one of Europe’s leading technology firms. - Welcoming, friendly, and professional colleagues. - Personal development plan with clear goals for advancement. - Competitive salary. - Flexible working arrangements. - Diverse and supportive atmosphere encouraging innovation and initiative. Company Description Quantum technologies have the potential to solve some of the world’s biggest challenges. At Terra Quantum, we are building the world’s leading Quantum Technology company, offering world-class quantum technology expertise organized as “quantum-as-a-service.” - Hybrid quantum algorithms, quantum compute, and quantum-enabled security solutions. - Access to a unique technology platform through the proprietary quantum cloud. - Focus on solving real-world challenges in machine learning, optimization, and simulation. - Committed to pioneering quantum technologies for positive global change. - Equal opportunities employer, committed to diversity, inclusion, and employee well-being.

Europe
Full TimeRemoteTeam 1,001-5,000Since 2005H1B Sponsor

Role Description Smartsheet is hiring a Senior Machine Learning Operations Engineer to architect our machine learning production lifecycle. Your mission is to maintain and deploy ML models to a scalable, reliable, and secure production environment. You will design and maintain the infrastructure, automation, and monitoring systems that ensure our AI products are high-performing and cost-effective. You will report to our Director, Analytics Engineering & Data Governance and work from our Bangalore, India office. You Will: - Model and Pipeline Automation: - Automate the deployment and retraining of ML models, from training through to production inference, by building and managing complete CI/CD/CT (Continuous Training) pipelines, adhering to MLOps best practices. - Build, fine-tune, or use pre-trained LLMs, deep learning models or traditional machine learning models. - Evaluate and recommend AI or ML solutions for the product using any combination of vendor solutions and/or custom-built models. - Governance & Compliance: - Implement model versioning, lineage tracking, and auditing to ensure compliance with security and ethical standards. - Performance Monitoring: - Continuously monitor the health and performance of production machine learning models, proactively identifying and correcting model drift, staleness, and performance degradation. - Incorporate user feedback for iterative improvements and manage necessary model retraining cycles. - Cross-Functional Collaboration: - Act as the "glue" between Data Scientists (who build models) and Software Engineers (who consume them). - Partner effectively with software engineers, product managers and business functions to integrate the machine learning solutions across Smartsheet. - Architecture and Infrastructure Management: - Provision and manage scalable cloud infrastructure using Infrastructure as Code (IaC). - Provide architectural guidance and mentorship to a team consisting of ML engineers, data scientists and analytics engineers. - Distill complex ML concepts into easy-to-follow technical documentation. Qualifications - 5+ years of experience with creating, deploying and scaling machine learning solutions in a cloud environment (eg. AWS, GCP, Azure) and ability to use tools such as SageMaker, Glue, Lambda, Docker etc. to create ML models and data pipelines. - 7+ years of programming experience in languages used in AI/ML (eg python, scala etc). - 4+ years of experience in developing deep learning and traditional ML models using common frameworks like pytorch, tensorflow, huggingface, scikit-learn etc. - Strong applied data science skills - ability to recognize data patterns, understand how and when to use various machine learning approaches (eg. supervised/unsupervised learning, deep learning etc.), and evaluate the performance of ML algorithms. - Proven ability to remain up-to-date with the latest advancements in Generative AI approaches (eg. OpenAI, LangChain, Stable Diffusion APIs). - Experience developing, documenting, and supporting REST APIs. - A degree in Computer Science, Engineering, or a related field or equivalent practical experience. Benefits - At Smartsheet, your ideas are heard, your potential is supported, and your contributions have real impact. - You’ll have the freedom to explore, push boundaries, and grow beyond your role. - We welcome diverse perspectives and nontraditional paths—because we know that impact comes from individuals who care deeply and challenge thoughtfully. - When you’re doing work that stretches you, excites you, and connects you to something bigger, that’s magic at work. Equal Opportunity Employer Smartsheet is an Equal Opportunity (EEO) employer committed to fostering an inclusive environment with the best employees. It is our policy to provide equal employment opportunities to all qualified applicants in accordance with applicable laws in the US, UK, Australia, Germany, Costa Rica, Japan, Bulgaria, and India. All qualified applicants will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, protected veteran or disabled status, or genetic information. If there are preparations we can make to help ensure you have a comfortable and positive interview experience, please let us know.

India
Sigma Software Group logo

Senior / Principal AI / ML Engineer, Product AI Enablement

Sigma Software Group

We support enterprises, product houses, and startups with custom software solutions development and IT consulting.

Full TimeRemoteTeam 1,001-5,000Since 2002H1B No Sponsor

• Lead the design and implementation of scalable AI-powered solutions across the product ecosystem • Automate manual workflows using AI-driven approaches, ensuring measurable efficiency gains • Architect and build semantic data models to support diverse AI use cases • Develop agent-based architectures for intelligent task automation • Build and maintain custom AI skills and capabilities tailored to product needs • Apply spec-driven development practices to ensure consistency and maintainability of AI features • Define and enforce best practices for AI integration and scalability • Collaborate closely with product and engineering leadership on AI strategy, roadmap, and execution • Evaluate emerging AI technologies and recommend adoption strategies

Mali
Full TimeRemoteTeam 10,001+Since 1967H1B Sponsor

Role Description We are seeking a strategic and data-driven Learning Measurement & Analytics Lead to establish and execute the organization’s learning measurement strategy. This role is responsible for ensuring learning and development programs are evaluated based on measurable business outcomes, enabling data-driven decision-making and demonstrating the value of learning investments across the organization. As a trusted advisor and subject matter expert, you will partner with Learning & Development leaders, business stakeholders, and executive teams to design evaluation methodologies, conduct advanced analyses, and translate complex findings into actionable business insights. This role focuses on measuring behavior change, business impact, and return on investment (ROI), moving beyond traditional learning metrics to demonstrate organizational value. What You’ll Do - Develop and lead the organization's learning measurement and analytics strategy, ensuring alignment with business objectives and performance outcomes - Define and maintain standards for evaluating learning programs, including data collection methods, measurement frameworks, and reporting requirements - Design and implement scalable evaluation methodologies to measure behavior change, business impact, and learning effectiveness using frameworks such as Kirkpatrick Levels 3 and 4 and Phillips ROI - Build and manage learning analytics infrastructure and measurement platforms using tools such as Performativ, Qualtrics, or similar solutions - Integrate learning data with operational, talent, and business performance metrics to provide a comprehensive view of learning effectiveness - Conduct ROI analyses and advanced evaluation studies to assess the value and impact of learning initiatives - Design and execute structured evaluations, pilot programs, controlled experiments, and outcome studies to support evidence-based decision-making - Define evaluation methodologies, sampling approaches, comparison groups, and success metrics to ensure analytical rigor and validity - Perform statistical analyses using tools such as SPSS, Python, R, or equivalent analytical platforms - Develop executive-level dashboards, scorecards, and reports that communicate learning outcomes, business impact, and value realization - Translate complex analytical findings into clear, concise recommendations and business insights for senior leaders and stakeholders - Establish consistent measurement language and reporting standards focused on outcomes, performance improvement, and business value - Partner with stakeholders during project planning to define success measures, evaluation strategies, and business impact goals - Provide consultation and guidance on measurement design, data collection strategies, and interpretation of analytical findings - Develop reusable tools, templates, frameworks, and best practices to standardize learning measurement across the organization - Conduct external benchmarking and research to compare learning effectiveness against industry standards and leading practices - Build organizational capability in learning analytics and measurement through coaching, knowledge sharing, and stakeholder education - Influence leaders and business partners on the effective use of data, evidence, and measurement practices to support decision-making Qualifications - Experience in Learning Analytics, People Analytics, Organizational Research, Industrial/Organizational Psychology, or a measurement-intensive Learning & Development role - Experience designing and executing advanced evaluation studies, business impact assessments, and ROI analyses - Strong expertise in learning measurement methodologies, including Kirkpatrick Levels 3 and 4, Phillips ROI, and outcome-based evaluation frameworks - Experience building and implementing analytics infrastructure, measurement platforms, and automated reporting solutions - Experience with data visualization and reporting tools such as Power BI, Tableau, or equivalent platforms - Strong knowledge of research design, statistical methodologies, experimental design, sampling strategies, and data interpretation - Experience integrating learning data with business performance, operational, and workforce metrics - Exceptional communication and storytelling skills, with the ability to translate complex analyses into executive-level insights and recommendations Benefits - Competitive salary and performance-based bonuses - Comprehensive benefits package - Career development and training opportunities - Flexible work arrangements - Dynamic and inclusive work culture within a globally renowned group - Private Health and Dental Insurance - Pension Plan - Meals tickets - Life Insurance Please note: CVs must be submitted in English. Applications in other languages will not be considered.

Brazil