Invoca, the AI-powered conversation intelligence platform for B2C revenue teams.
Senior ML Engineer
Location
California + 4 moreAll locations: California | Colorado | Illinois | New York | Texas
Posted
5 days ago
Salary
$152K - $228K / year
Seniority
Senior
Job Description
Senior ML Engineer
Invoca
• Lead End-to-End MLOps and Productionization: Architect, implement, and maintain CI/CD pipelines for ML artifacts — including model evaluation, versioning, and automated deployment. Serve as the primary SME for operational excellence across the Invoca ML stack. • Design and Optimize SLM/LLM Deployment: Own the full inference infrastructure: model serving on Triton Inference Server, Baseten, and Kubernetes-based GPU infrastructure. Profile and tune for low latency and high throughput, and build robust, scalable APIs for internal and external model access. • Fine-Tune Language Models: Apply parameter-efficient fine-tuning methods (LoRA, QLoRA, PEFT) to adapt transformer-based SLMs and LLMs for high-impact NLP applications in conversation intelligence. • Evolve ML Infrastructure: Contribute to model training infrastructure, data pipelines, and data lake foundations to keep the systems powering our models reliable and scalable. • Collaborate Across Teams: Partner closely with Data Scientists, Data Engineers, and Applied AI Engineers to build the foundational ML systems behind Invoca's agentic AI products. • Deliver Customer Value: Work with product and engineering to understand customer needs and ship ML solutions that make a measurable difference.
Job Requirements
- 5+ years of ML Engineering experience with a strong production focus
- Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy)
- Demonstrated track record deploying and maintaining transformer-based NLP models in production
- Hands-on experience fine-tuning SLMs/LLMs (LoRA, QLoRA, PEFT) and optimizing models via quantization, batching, and throughput tuning
- Proficiency with inference infrastructure: Triton, Baseten, vLLM, TGI, SageMaker, Vertex AI, or similar
- Experience building production-grade APIs that expose ML models to downstream consumers
- Familiarity with MLOps tooling, model monitoring, and eval platforms (Braintrust, MLflow, or equivalent)
- B.S. in Computer Science, Engineering, Statistics, or equivalent; advanced degree a plus
- Familiarity with RLHF or preference training is a bonus.
Benefits
- Flexible Time Off – We encourage a healthy work-life balance. Our flexible paid time off policy allows you to recharge and take time away as needed.
- Paid Holidays – Invoca provides 16 U.S. paid holidays, including a winter break, giving you ample opportunity to refresh and spend time with friends and family.
- Health Benefits – Our healthcare program includes medical, dental, and vision coverage, with multiple plan options so you can choose what works best for you and your family. Fertility assistance is also included.
- Retirement – Invoca offers a 401(k) plan through Fidelity with a company match of up to 4%.
- Stock Options – All employees are invited to share in Invoca’s success through stock options.
- Mental Health Program – Well-being support on a broad range of issues is available through our SpringHealth program.
- Paid Family Leave – Up to 6 weeks of 100% paid leave is provided for baby bonding, adoption, and caring for family members.
- Paid Medical Leave – Up to 12 weeks of 100% paid leave is provided for childbirth and medical needs.
- InVacation – As a thank-you to our long-term team members, we offer a bonus after 7 years of service.
- Wellness Subsidy – We provide a subsidy that can be applied toward gym memberships, fitness classes, and more.
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Mid-level Data Analyst – Statistics, Machine Learning
MercafacilCriamos histórias que transformam o varejo. Vem com a gente!
• You will bring statistical rigor to business decisions. • As part of the BI team, your mission is to go beyond descriptive analysis by applying statistical and machine learning models to find patterns, validate hypotheses, and generate forecasts that guide real strategies. • Model complex problems using Python. • Work with large volumes of data in ClickHouse and BigQuery. • Collaborate with product and operations teams to ensure models deliver real value. • Conduct statistical analyses to answer business questions: hypothesis testing, correlation analysis, segmentation, and predictive modeling. • Perform exploratory data analysis (EDA) to understand the data before modeling, identifying patterns and inconsistencies. • Write complex SQL queries in ClickHouse and BigQuery to extract and prepare data for analysis and modeling. • Use Python for data manipulation, exploratory analysis, and model prototyping. • Communicate results and insights clearly to business teams — translating statistics into decisions. • Collaborate with the BI team to define metrics, ensure data quality, and establish analytical best practices. • Monitor models in production, tracking performance and proposing continuous improvements.
• Design, implement, and optimize core bidding algorithms and auction mechanisms • Architect and scale bid landscape forecasting capabilities • Analyze large-scale marketplace dynamics to uncover deep insights and deliver algorithmic improvements • Drive offline evaluations and online A/B experiments to validate model performance and deliver measurable business impact • Collaborate cross-functionally with product, infrastructure, and engineering teams
• Design, implement, and optimize core bidding algorithms and auction mechanisms • Architect and scale bid landscape forecasting capabilities • Analyze large-scale marketplace dynamics to uncover deep insights and deliver algorithmic improvements • Drive offline evaluations and online A/B experiments to validate model performance and deliver measurable business impact • Collaborate cross-functionally with product, infrastructure, and engineering teams
Machine Learning Engineer, Recommendations
LingokidsHelping families around the world raise amazing kids.
• Own the production recommendation infrastructure: maintain and improve the systems that serve personalized content to millions of users, ensuring reliability, low latency, and scalability as the catalog and user base grow. • Research and prototype advanced recommendation algorithms: explore newer approaches - deep learning-based models, contextual bandits, session-based recommendations, graph-based methods - evaluate their potential, and run controlled experiments to validate uplift before production. • Produce ML models and pipelines: take prototypes (from yourself or from the team's Data Scientist) and turn them into production-grade, monitored, maintainable features integrated into the live recommendation engine. • Design scalable infrastructure: anticipate bottlenecks and design systems that can handle larger catalogs, more complex segmentations, and higher traffic - including serving layer optimization, caching strategies, and pipeline orchestration. • Build and maintain data pipelines in DBT and Databricks, ensuring clean transformations, data quality, and robust experimentation frameworks that the team can rely on. • Monitor model health in production: define retraining strategies, detect drift, and ensure recommendation quality is measured and maintained over time. • Collaborate closely with the Data Scientist and Senior Analyst to translate statistical insights and business requirements into engineering decisions.



