Vectara logo
Vectara

Vectara is The Trusted GenAI Platform for All Builders - Retrieval Augmented Generation-as-a-Service (RAGaaS).

Machine Learning (ML) Engineer

Machine Learning EngineerMachine Learning EngineerOtherRemoteSeniorTeam 11-50H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

112 days ago

Salary

$0

Seniority

Senior

Bachelor Degree9 yrs expEnglishPandas

Job Description

Machine Learning (ML) Engineer

Vectara

Vectara provides a scalable platform to deploy your Enterprise AI Agents and AI Assistants with Accuracy, Security, and Explainability like no other solution. Our enterprise RAG Platform offers unparalleled Accuracy, Security, and Explainability by leveraging the strongest models for retrieval, embedding, reranking , a optimized LLM trained for quality , and advanced Hallucination Mitigation . We are the developers of the Hughes Hallucination Evaluation Model and Correction model, core to ensuring accuracy, quality, and responsible AI that is production ready . These innovations have been cited in the New York Times, Visual Capitalist , and many other leading publications. This platform has allowed us to be very successful with over 100 Enterprise clients including the likes of large US military organizations, Financial services, Healthcare, and Manufacturing. Our founding team includes industry veterans and experts in neural information retrieval and distributed systems from Google. Join us as we pursue our mission to help the world find meaning. People at Vectara are passionate about ensuring customers take advantage of breakthroughs in applied Artificial Intelligence (AI) to solve real-world technology and business problems today. Our team is a group of unquestionable all-stars in their respective fields of computer science and business from Google, Cloudera, Splunk, MongoDB, Elastic, and more. Job responsibilities Design, prototype, research and build AI systems for Vectara. Train, evaluate and deploy ML models in the domains of Natural Language Processing, Information Retrieval, AI Agents, Large Language Models (LLMs) and Multimodal Large Language Model (MMLLMs). Improve the quality of Vectara’s AI Agents and RAG-as-a-service platform, working on features like multilinguality, self-supervised learning, agentic behavior and hallucination reduction. Publish technical blogs, papers, and patents. Requirements: Preferred requirements: PhD in Computer Science/Engineering with 1+ years of industry experience. Publications in prestigious venues such as ACL, NAACL, EMNLP, NeurIPS, ICML, ICLR as a key author. Experience as an ML engineer in an early-stage, high growth environment. Expertise in the following areas: Embedding models, rerankers Multimodal retrieval, question answering, and reasoning Vector databases, BM25 Planning and reasoning in LLMs Multilinguality in LLMs NLG Evaluation such as hallucination detection Location requirements: We support remote applicants from all over the US but candidates who can come to the office 2-3 days a week in our Palo Alto office are preferred. Equity and Salary Range: Salary is just one component of Vectara’s employee compensation. Our full-time employees are also equity owners in the company, which although not an immediate cash component, can have positive impacts on long-term total compensation for each participating employee. We would be remiss if we didn’t highlight and celebrate our focus on engaging many of our employees in being economic co-owners of the business. Vectara welcomes all. We value the collective wisdom of people from different backgrounds, experiences, abilities and perspectives.  We never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. Vectara has a positive and supportive culture—we look for people who are inventive and work to be a little better every single day. We seek to be smart, humble, hardworking and, above all, curious. After all, we are on a mission to find meaning. Perks and Benefits: 100% paid Medical, Dental, Vision begins on your first day! Option of Health Savings Account (HSA) or Flexible Savings Account (FSA). Generous paid time off (PTO) plus paid sick time, holidays, and company rest days. Professional development and training opportunities. Company virtual happy hours and fun team building activities and more.

Job Requirements

  • BS/MS in Computer Science, Statistics, Electrical/Computer Engineering, Mathematics, or a related field.
  • 5+/4+ years of professional work experience after BS/MS applying
  • machine learning
  • to real-world problems, and crafting scalable and effective ML/AI solutions.
  • Strong domain knowledge in at least one of the following:
  • RAG, LLM, information retrieval, Multimodal LLMs.
  • Excellent programming skills in
  • Python.
  • Proficiency in data/ML libraries such as pandas, transformers, and torch.
  • Familiarity with the technical details of deep learning concepts, such as Transformers, Retrieval-Augmented Generation (RAG), mixture of experts (MoE).
  • Hands-on experience in training ML systems end-to-end from data curation to evaluation and deployment.

Related Job Pages

More Machine Learning Engineer Jobs

IA na iFood logo

Senior Data Engineer - MLE - AI

IA na iFood

Somos uma empresa brasileira de tecnologia referência na América Latina. Por meio de soluções inovadoras, conectamos milhares de restaurantes a milhões de consumidores diariamente com uma média de 100 milhões de pedidos mensais. Além do delivery de comida, também somos Mercado, Farmácia e Pet. Temos também o iFood Pago, nossa Fintech, que engloba o iFood Benefícios, o vale alimentação e refeição do iFood e o próprio iFood Pago, o banco do restaurante.

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Desenvolver e operacionalizar modelos de Machine Learning, garantindo a implementação e manutenção de pipelines eficientes. - Realizar deploy de modelos e gerenciar o ciclo de vida de modelos (LCM) dentro da plataforma GenPlat. - Colaborar com a equipe para fomentar uma cultura forte de IA e MLOps, promovendo práticas de inovação e melhoria contínua. - Apoiar a implementação de agentes e ferramentas baseadas em GenAI, contribuindo para a estabilidade técnica e organizacional. - Participar ativamente no planejamento e execução de projetos, garantindo alinhamento com os objetivos estratégicos da empresa. Qualifications - Experiência em MLOps e deploy de modelos, com sólida base em engenharia de software. - Conhecimento em infraestrutura em nuvem, preferencialmente AWS, e operação de pipelines. - Familiaridade com soluções open-source e vivência em ambientes tech-first. - Capacidade de comunicação clara e colaboração eficaz em equipes multidisciplinares. - Mentalidade voltada para inovação e adaptação a mudanças rápidas. Requirements - Conhecimentos em GenAI e sua aplicação prática em projetos. - Habilidade para liderar iniciativas técnicas e fomentar um ambiente de aprendizado contínuo. - Experiência em aumentar a eficiência no gerenciamento do ciclo de vida de modelos. - Capacidade de se adaptar rapidamente a novas tecnologias e desafios complexos. - Inglês avançado.

United States + 1 moreAll locations: United States | Canada
Job Closed
Stride, Inc. logo

Senior Open-Source Machine Learning Engineer, Computer Vision

Stride, Inc.

Stride, Inc., formerly known as K12 Inc., is a leading provider of personalized online education programs and services, including customized tutoring, online ed

• Work mainly with existing open-source libraries, such as Transformers and Datasets to boost the support for vision or multi-modal models and datasets. • Foster one of the most active machine learning communities, helping users contribute to and use the tools that you build. • Interact with Researchers, ML practitioners, and data scientists on a daily basis through GitHub, our forums, or slack.

New York
Job Closed
Stride, Inc. logo

Senior Open-Source Machine Learning Engineer, Computer Vision - US Remote

Stride, Inc.

Stride, Inc., formerly known as K12 Inc., is a leading provider of personalized online education programs and services, including customized tutoring, online ed

At Hugging Face, we're on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 11 million users who collectively shared over 2M models, 700k datasets & 600k apps. Our open-source libraries have more than 600k+ stars on Github. About the Role As an Open-Source ML engineer in Computer Vision, you will work mainly with existing open-source libraries, such as Transformers and Datasets to boost the support for vision or multi-modal models and datasets. You will bring your computer vision expertise to provide the best computer-vision tool stack in the machine learning ecosystem and work with us to provide the best, simplest, and most intuitive computer-vision library in the industry. You'll get to foster one of the most active machine learning communities, helping users contribute to and use the tools that you build. You'll interact with Researchers, ML practitioners, and data scientists on a daily basis through GitHub, our forums, or slack. About you - Deep expertise in computer vision: object detection, segmentation, generative models, or multimodal systems. - Strong open-source presence: You’ve contributed significantly to CV libraries (e.g., OpenCV, Detectron2, MMDetection, or Hugging Face’s own transformers/diffusers), as a Core-Contributor or maintainer. - Scalability mindset: Experience optimizing models for production, deploying at scale, or improving inference efficiency. - Collaboration & mentorship: You enjoy working with cross-functional teams, reviewing PRs, and guiding junior contributors. - Alignment with our mission: You believe in democratizing AI and want to empower millions of builders with state-of-the-art tools. If you love open-source, are passionate about the new development of Transformers models in computer vision, have experience building, optimizing, and training such models in PyTorch and/or TensorFlow, serving them in production, and want to contribute to one of the fastest-growing ML libraries, then we can't wait to see your application! If you're interested in joining us, but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact. More about Hugging Face We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where people feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community. Hugging Face is an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education. We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer parental leave and flexible paid time off. We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed. We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside. We support the community. We believe major scientific advancements are the result of collaboration across the field. Join a community supporting the ML/AI community.

New York
Job Closed
Motional logo

Senior Machine Learning Engineer, Data Mining

Motional

We're making driverless vehicles a safe, reliable, and accessible reality.

OtherRemoteTeam 1,001-5,000Since 2020H1B Sponsor

• Architect and Train Distilled Models: Design and implement teacher-student model frameworks for multimodal sensor data. Develop training pipelines for knowledge distillation. Ensure student models maintain high accuracy while drastically reducing inference latency and memory footprint. • Reinforcement Learning for Data Discovery: Build RL-based policy learning and reasoning systems for autonomous driving applications. Implement and scale RL training workflows (e.g., PPO, DQN, actor-critic methods) for simulation and real-world interaction. Explore reward shaping, environment modeling, and multi-agent RL where applicable. • Optimize Model Deployment for Real-Time Inference: Collaborate with backend engineers to deploy distilled and RL models into production. Optimize for latency, throughput, and hardware efficiency across GPU/CPU clusters. Implement model versioning, A/B testing, and monitoring for performance regressions. • Research and Integrate Agentic Systems: Explore and prototype agentic workflows for autonomous reasoning, chain-of-thought prompting, and goal-directed behavior. Integrate such systems into our broader autonomy stack as experimental or production components. • Drive Production Reliability: Establish patterns for graceful degradation, fault tolerance, and cost optimization. Operate Omnitag as a mission-critical data platform serving the entire ML organization, with a focus on reliability, debuggability, and operational excellence. • Mentor and Collaborate: Work closely with ML scientists, data engineers, and autonomy teams to translate research advances into scalable engineering solutions. Guide junior engineers in best practices for model training, evaluation, and deployment.

United States
$172K - $229K / year