Job Closed
This listing is no longer active.
We save lives through cell therapy.
Principal AI/ML Engineer
Location
United States
Posted
58 days ago
Salary
0
Seniority
Lead
Job Description
Principal AI/ML Engineer
NMDP
• Participate in all phases of the AI development lifecycle, including problem framing, data analysis, solution design, model or agent development, evaluation and testing, deployment, monitoring, iterative improvement, support. • Own the technical execution of sprint deliverables from design through deployment. • Drive daily engineering momentum: run stand-ups from a technical lens, surface blockers early, and resolve them before they become delays. • Make implementation-level decisions confidently and quickly within the established architecture. • Review pull requests with a focus on correctness, performance, security, and long-term maintainability. • Ensure engineering work is aligned to acceptance criteria and Definition of Done including eval thresholds for AI features. • Design, build, and maintain reusable, scalable AI/ML systems, including model pipelines, feature engineering workflows, and inference services. • Partner with technical and business teams to translate complex business problems into effective AI/ML solutions. • Provide effort estimation, dependency analysis, and technical risk assessment for initiatives, epics, and complex features. • Act as a face of the AI Engineering team to the rest of the organization. • Provide technical leadership and engineering guidance for AI/ML solutions, ensuring alignment with enterprise standards, security requirements, and ethical AI principles. • Lead technical design reviews, influence architectural decisions, and set best practices for AI/ML development, deployment, and lifecycle management. • Mentor and guide engineers and data scientists on AI/ML design patterns, model evaluation, performance optimization, and responsible AI practices. • Communicate complex technical concepts, tradeoffs, and outcomes clearly to both technical and non-technical stakeholders. • Ensure solutions meet regulatory compliance, security, and data governance requirements, including privacy-by-design and model risk management. • Act as a trusted technical advisor to engineering leadership, technical, and business stakeholders. • Identify and resolve cross-team technical dependencies proactively, before they block sprint delivery. • Translate architecture decisions from the AI Architect into concrete, sprint-ready engineering tasks. • Partner with BSAs to pressure-test requirements for technical feasibility and surface AI-specific constraints early. • Represent the team in ARB reviews, technical design sessions, and cross-functional working groups when needed.
Job Requirements
- Bachelor's degree in computer science, Engineering, Data Science, or related field preferred. Equivalent experiences may be substituted.
- 7+ years of experience in engineering or architecture roles with combined AI/ML experiences.
- Demonstrated experience of building, deploying, or supporting traditional ML models and GenAI/ Agentic AI solutions in real-world environments.
- Experience working within modern AI development lifecycles and Agile or iterative delivery models.
- Hands-on experience designing and building AI/ML solutions from prototype to production.
- Proven ability to drive technical delivery in an agile/sprint environment to keep engineering moving.
- Exposure to MLOps practices: model versioning, experiment tracking, deployment pipelines.
- Experience with MCP (Model Context Protocol), and Familiarity with A2A patterns, or emerging agentic AI frameworks.
- Strong Python development skills, including frameworks and libraries for ML, GenAI, and Agentic AI best practices.
- Deep understanding of software engineering, including modular design, testing, version control (Git), and CI/CD pipelines.
- Proven track record of building and running PoCs to validate architecture and feasibility.
- Experience working in agile environments, participating in sprints and cross-functional delivery.
- Ability to communicate technical concepts clearly to a wide range of stakeholders.
- Eagerness and ability to quickly learn and apply new AI/ML and automation technologies.
- Demonstrated commitment to learning and applying emerging technologies responsibly.
Benefits
- medical, dental, vision, life and disability, accident/critical illness/hospital, well-being, legal, identity theft and pet benefits.
- Retirement
- paid time off/holidays
- leave
- incentive plans
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Senior Staff Machine Learning Engineer, Feed Relevance
RedditReddit is an online platform utilized by thousands of communities to connect and converse about a wide variety of topics, including TV and movie fan theories, s
Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 121 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com. We’re looking for a Senior Staff Machine Learning Engineer to join our Feed Relevance team, which is responsible for the end-to-end systems that power personalization and ranking for the main Reddit feeds. This is a critical role that blends hands-on development with technical leadership, focusing on building scalable, extensible, and highly performant personalization systems. You will be instrumental in defining the technical direction for the team, collaborating closely with Product and Engineering leadership to set the strategy, and working with other individual contributors to execute and scale our systems to serve tens of millions of users. This work will directly enable our Machine Learning Engineers (MLEs) and engineers to quickly iterate and continuously improve the user experience. You will also partner with Infra and Platform teams to inform the direction of our core infrastructure, building for the future of ML at Reddit. Responsibilities: - Deliver on technical initiatives that have significant company-wide impact - Set technical direction for the broader Relevance and Feeds teams at Reddit, able to identify opportunities and influence strategy across multiple orgs - Work with management on goal setting, planning, and de-risking critical projects - Mentor and grow Senior and Staff engineers - Create a strong healthy engineering culture Qualifications: - 10+ years of industry experience building systems for relevance driven products - Subject matter expert in relevance, recommendation, and ML systems; able to solve complex problems in these domains that few others can - Deep understanding of how to build sustainable software systems at a large scale engineering organization - Experience in influencing organizations on technical direction/best practices - Experience working with cross-functional teams such as design, product, business & data teams to deliver great experiences. - Strong focus on user experience, usability, scalability, reliability and quality. You are an undying advocate for the user, and you have a deep intuition for how people & machines interact with software at scale. - High empathy, excellent communication skills, and the ability to find compromise working across the entire engineering org Benefits: - Comprehensive Healthcare Benefits and Income Replacement Programs - 401k with Employer Match - Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support - Family Planning Support - Gender-Affirming Care - Mental Health & Coaching Benefits - Flexible Vacation & Paid Volunteer Time Off - Generous Paid Parental Leave #LI-remote, #LI-JS5 Pay Transparency: This job posting may span more than one career level. In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/. To provide greater transparency to candidates, we share base salary ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below. The base salary range for this position is: $266,000—$372,400 USD In select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews. During the interview, we will collect the following categories of personal information: Identifiers, Professional and Employment-Related Information, Sensory Information (audio/video recording), and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role, as applicable. We will not sell your personal information or disclose it to any third party for their marketing purposes. We will delete any recording of your interview promptly after making a hiring decision. For more information about how we will handle your personal information, including our retention of it, please refer to our Candidate Privacy Policy for Potential Employees and Contractors. Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve. Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If, due to a disability, you need an accommodation during the interview process, please let your recruiter know.
• Diseñar, desarrollar y mantener soluciones de machine learning desde la concepción hasta su implementación en producción. • Colaborar con equipos de datos, producto e ingeniería para integrar modelos en productos reales. • Analizar grandes volúmenes de datos para resolver problemas complejos mediante modelos predictivos. • Optimizar modelos y sistemas existentes para mejorar eficiencia y rendimiento. • Implementar soluciones en la nube (AWS, GCP o Azure), maximizando su escalabilidad y costo-beneficio. • Documentar y presentar los desarrollos técnicos de manera clara y efectiva. • Impulsar mejoras continuas en prácticas de desarrollo y adopción de nuevas tecnologías.
Role Description This engagement is focused on building an internal AI platform that enables developers to ship AI-powered services efficiently. The scope includes: - Model connectivity - Prompt testing and evaluation - Monitoring/observability - The underlying AI infrastructure layer The objective is to improve DevEx and reduce time-to-market for AI features. Tasks include: - Build and operate the AI platform infrastructure enabling developers to ship LLM-based services faster. - Implement and maintain Kubernetes-based runtime environments (incl. AKS) for AI workloads. - Manage infrastructure as code with Terraform (modules, environments, CI/CD automation). - Support LLM workflows: RAG, agents, prompt experimentation, evaluations, and deployment patterns. - Integrate and operate tooling such as Azure AI Foundry, LiteLLM, Langfuse, MLflow. - Orchestrate pipelines using Kubeflow Pipelines and/or Argo Workflows (build, deploy, evaluate). - Improve platform reliability and observability (monitoring, logging, tracing, cost/perf signals). - Collaborate closely with developers to streamline DX (APIs, templates, docs, golden paths, automation). Qualifications - Strong hands-on experience with Kubernetes in production (preferably AKS). - Solid Terraform expertise (IaC best practices, multi-env setups). - Practical experience supporting ML/LLM workloads in a platform or DevOps/MLOps context. - Proficiency in Python for automation, scripting, and supporting APIs/evaluation tooling. - Understanding of CI/CD, release processes, and production-grade operations. - Ability to work under tight timelines and deliver pragmatically. Requirements - Experience building internal developer platforms or “paved roads” for engineering teams. - Familiarity with LLM evaluation frameworks, prompt testing workflows, and LLM observability. - Exposure to RAG architectures, vector databases, and agentic patterns. - Experience with Kubeflow, Argo, and ML lifecycle tooling. Benefits - Long-term B2B contract. - Join a team of 5, with 3 AI Platform Engineers being added. - Remote within Europe (preferred: Croatia, Poland, Portugal, Serbia). - European working hours. - Occasionally available for meetings up to 10:00 AM PST (US overlap).
Lead Machine Learning Engineer - Refer
Referrals OnlyThoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we’re pushing boundaries through our purposeful and impactful work. For 30+ years, we’ve delivered extraordinary impact together with our clients by helping them solve complex business problems with technology as the differentiator. Bring your brilliant expertise and commitment for continuous learning to Thoughtworks. Together, let’s be extraordinary.
Lead Machine Learning Engineers at Thoughtworks use modern architectures to develop end-to-end scalable machine learning systems and applications. They use their specialized depth and breadth of knowledge to impact the achievement of client, project or service objectives and advocate for ways of working to promote and deliver excellence. They operate within the framework of functional policies, navigate through intricate challenges and apply their proficiency to contribute to the success of high-stakes projects. Their leadership extends beyond technical prowess, encompassing strategic thinking and effective collaboration to drive innovation and deliver solutions that meet and exceed organizational goals. As a lead machine learning engineer on projects, you will be leading the design of technical solutions or perhaps overseeing a program inception to build a new system and/or application. Alongside hands-on coding, as a key influencer, you will shape the trajectory of machine learning engineering initiatives, playing a pivotal role in advancing the field and ensuring impactful outcomes for the broader objectives of the company. Job responsibilities - You will embrace a strategic mindset, contributing to the direction of machine learning (ML) initiatives and aligning technical solutions with broader organizational goals. - You will play a pivotal role in program inception, shaping the development of new systems and applications from idea to reality, overseeing technical feasibility and resource allocation. - You will leverage your deep understanding of modern architectures to lead the development of scalable and maintainable ML systems, ensuring optimal performance and efficiency. - You will translate client needs into technically feasible and impactful ML applications, driving solution design and deployment within complex, high-stakes projects. - You will own the development and maintenance of ML applications, including ML pipelines, model training and deployment, and monitoring and evaluation. - As a key influencer, you will champion Responsible AI and effective ways of working within the team, advocating for a culture of excellence and continuous improvement. - You will navigate intricate technical challenges with proficiency, employing your specialized knowledge to troubleshoot issues and guide the team towards successful resolutions. - You will stay at the forefront of the evolving field of machine learning, actively seeking out and implementing new technologies and advancements to ensure Thoughtworks remains a leader in innovation. - You will foster a collaborative environment, effectively leading your team through hands-on coding alongside mentorship and guidance, empowering individual growth and knowledge sharing. - You will measure and analyze the impact of ML initiatives, iteratively refining approaches and ensuring solutions deliver tangible value to clients and the organization. Job qualifications Technical Skills - You have experience in developing a technical vision and strategy, keeping it relevant and aligned to the business needs. - You can design and execute cross-functional requirements based on business priorities. - You have experience in writing clean, maintainable and testable code, demonstrating attention to refactoring and readability of the code using Python or Shell. - You have experience with distributed systems and scalable architectures to handle large-scale ML applications. - You have experience with building, deploying and maintaining ML systems using relevant ML techniques and platforms, i.e.: Scikit-learn, Tensorflow, MLFlow, Kubeflow, Pytorch. - You have experience with building, deploying and maintaining ML systems and experience with application of MLOps principles and CI/CD to ML. - You have experience in machine learning engineering and data science, are familiar with key ML concepts, algorithms and frameworks, and understand ML model lifecycles. - You have experience with designing and operating the infrastructure required to run different types of ML training and serving workloads, i.e.: on-premise vs. cloud infrastructure, infrastructure as code, monitoring, etc. - You have hands-on experience with on-premise and cloud services for building and deploying ML pipelines, i.e.: Azure, AWS, GCP or Databricks and associated ML managed services. Professional Skills - You understand the importance of stakeholder management and can easily liaise between clients and other key stakeholders throughout projects, ensuring buy-in and gaining trust along the way. - You are resilient in ambiguous situations and can adapt your role to approach challenges from multiple perspectives. - You don’t shy away from risks or conflicts, instead you take them on and skillfully manage them. - You are eager to coach, mentor and motivate others and you aspire to influence teammates to take positive action and accountability for their work. - You enjoy influencing others and always advocate for technical excellence while being open to change when needed. - You are a proven leader with a track record of encouraging teammates in their professional development and relationships. - Cultivating strong partnerships comes naturally to you; You understand the importance of relationship building and how it can bring new opportunities to our business. Other things to know Learning & Development There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you. But we also balance autonomy with the strength of our cultivation culture. This means your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. We see value in helping each other be our best and that extends to empowering our employees in their career journeys. About Thoughtworks Thoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we’re pushing boundaries through our purposeful and impactful work. For 30+ years, we’ve delivered extraordinary impact together with our clients by helping them solve complex business problems with technology as the differentiator. Bring your brilliant expertise and commitment for continuous learning to Thoughtworks. Together, let’s be extraordinary. #LI-Remote Salary Benefits: https://www.thoughtworks.com/en-us/careers/benefits The annual salary range posted is subject to many factors and may vary depending on experience, geographic location, job responsibilities, performance, skills and/or training. Salary $189,000—$304,000 USD See here our AI policy.



