Job Closed
This listing is no longer active.
Remote first tech projects
Principal AI/ML Engineer
Location
Massachusetts
Posted
101 days ago
Salary
0
Seniority
Lead
Job Description
Principal AI/ML Engineer
Pragmatike
• Architect, build, and scale the end-to-end ML Ops pipeline, including training, fine-tuning, evaluation, rollout, and monitoring. • Design reliable infrastructure for model deployment, versioning, reproducibility, and orchestration across cloud and on-prem GPU clusters. • Optimize compute usage across distributed systems (Kubernetes, autoscaling, caching, GPU allocation, checkpointing workflows). • Lead the implementation of observability for ML systems (monitor drift, performance, throughput, reliability, cost). • Build automated workflows for dataset curation, labeling, feature pipelines, evaluation, and CI/CD for ML models. • Collaborate with researchers to productionize models and accelerate training/inference pipelines. • Establish ML Ops best practices, internal standards, and cross-team tooling. • Mentor engineers and influence architectural direction across the entire AI platform.
Job Requirements
- Deep hands-on experience designing and operating production ML systems at scale (Staff/Principal-level expected).
- Strong background in ML Ops, distributed systems, and cloud infrastructure (AWS, GCP, or Azure).
- Proficiency with Python and familiarity with TypeScript or Go for platform integration.
- Expertise in ML frameworks: PyTorch, Transformers, vLLM, Llama-factory, Megatron-LM, CUDA / GPU acceleration (practical understanding)
- Strong experience with containerization and orchestration (Docker, Kubernetes, Helm, autoscaling).
- Deep understanding of ML lifecycle workflows: training, fine-tuning, evaluation, inference, model registries.
- Ability to lead technical strategy, collaborate cross-functionally, and operate in fast-paced environments
Benefits
- Competitive salary & equity options
- Sign-on bonus
- Health, Dental, and Vision
- 401k
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Senior Machine Learning Engineer, Search
Insider OneThe #1 platform that brings everything marketing and customer engagement teams need in one place, to become unstoppable.
• Design, build, and release AI products/features that solve real user problems • Design and implement streaming/batch data pipelines to support training and inference • Write production-ready code and move it to production with the help of cloud services and CI/CD techniques • Continuously monitor and improve the quality and performance of the systems • Learn by doing — while owning real problems and demonstrating end-to-end ownership throughout the system • Build resilient, performant pipelines and services in production • Own system architecture, performance, observability, and scalability • Collaborate across teams to turn ideas into impactful products • Mentor engineers and help set technical direction
Machine Learning Engineer – Search
Insider OneThe #1 platform that brings everything marketing and customer engagement teams need in one place, to become unstoppable.
• Design, build, and release AI products/features that solve real user problems • Design and implement streaming/batch data pipelines to support training and inference • Write production-ready code and move it to production with the help of cloud services and CI/CD techniques • Continuously monitor and improve the quality and performance of the systems • Learn by doing—while owning real problems and demonstrating end-to-end ownership through the whole system
Staff Software Engineer, Machine Learning
Smarter TechnologiesThe Automation and Insights Platform for Healthcare Efficiency
• Innovate: Stay at the forefront of AI/ML advancements, continuously exploring new techniques, models, and tools. Your insights will help drive innovation and keep our platform at the cutting edge of technology. • Collaborate: Work closely with core platform and automation engineers, as well as other stakeholders, to identify and address business needs through Machine Learning solutions. You will play a key role in bridging the gap between AI research and practical application. • Integrate: Utilize APIs, pre-trained models, and other tools to implement AI-driven features across our platform. You will focus on integrating these capabilities into production systems, enhancing the functionality and performance of our solutions. • Optimize: Fine-tune and optimize existing solutions for specific product requirements, ensuring that AI integrations are efficient, scalable, and reliable.
• Build, train, and refine machine learning and deep learning models using time-series, sensor, and behavioral data. • Integrate data from wearables, fitness tracking platforms, and device APIs to create a clear story from movement, patterns, and activity signals. • Develop and maintain data pipelines that support both batch and real-time analytics. • Own model deployment in production environments — your models won’t live in notebooks; they’ll live in the world. • Work closely with engineering teams to integrate ML models into mobile and web apps. • Support logic for fraud, spoofing, and anomaly detection, ensuring data reflects real human activity. • Make complex outputs easy to understand — not just for engineers, but for product and business users too.



