Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years.
AI/ML Research Engineer, LLM Post-Training & Evaluation
Location
United States
Posted
1 day ago
Salary
$80K - $175K / year
Seniority
Mid Level
Job Description
AI/ML Research Engineer, LLM Post-Training & Evaluation
Innodata Inc
Role Description Innodata is expanding its team of technical experts in LLM training, post-training, and evaluation systems. As an AI/ML Research Engineer, LLM Training & Evaluation, you will build and optimize the technical foundations that power model improvement for foundation model builders and leading labs. This role is ideal for someone who has hands-on experience fine-tuning and evaluating large language models (and ideally multimodal models), and who can bridge research and engineering in real-world customer environments. You will work closely with Language Data Scientists, Applied Research Scientists, data engineers, and client technical stakeholders to design and implement robust training/evaluation pipelines using both human-in-the-loop and AI-augmented methods. What You’ll Own - Design and implement the pipelines and tooling that connect data, evaluation, and post-training. - Help customers and internal teams move from evaluation findings to measurable model improvements. - Build fine-tuning workflows (e.g., supervised fine-tuning and preference-based optimization). - Integrate evaluation harnesses into model development loops. - Improve experiment reliability and throughput. - Support advanced evaluation scenarios such as long-context, cross-modal, and dynamic multi-turn interactions. - Contribute to Innodata’s internal R&D efforts, including benchmark datasets, evaluation frameworks, and reusable infrastructure for model assessment and post-training experimentation. - Lead or co-lead technically complex ML engineering projects from initial customer discussions through implementation and delivery. - Design, build, and improve LLM training and post-training pipelines, including data ingestion, preprocessing, fine-tuning, evaluation, and experiment tracking. - Implement and optimize evaluation systems for LLMs and multimodal models, including offline benchmarks and task-specific test harnesses. - Integrate human-in-the-loop and AI-augmented evaluation signals into model development workflows. - Build robust infrastructure and tooling for reproducible experimentation, metrics logging, and regression monitoring. - Diagnose model behavior and pipeline failures, including data issues, training instability, metric inconsistencies, and evaluation drift. - Collaborate with Language Data Scientists and Applied Research Scientists to translate evaluation frameworks into executable systems. - Work closely with customer technical stakeholders to understand goals, constraints, and success criteria; propose and implement technically sound solutions. - Contribute to internal research and platform development, including benchmark frameworks, evaluation tooling, and post-training workflow improvements. - Contribute to best practices and standards for LLM training, evaluation, and quality assurance across projects. - Mentor junior engineers and contribute to technical design reviews, documentation, and engineering rigor across the team. Qualifications - BS/MS/PhD in Computer Science, Machine Learning, AI, Applied Mathematics, or a related quantitative technical field (MS/PhD preferred). - 2-3 years of relevant industry or research engineering experience in ML/AI systems. - Hands-on experience with LLM training / fine-tuning / post-training, including at least one of: - supervised fine-tuning (SFT) - preference optimization (e.g., DPO or related methods) - RLHF / RLAIF-style workflows - task- or domain-adaptation of foundation models - Strong programming skills in Python and experience building production-quality ML code. - Experience with modern ML frameworks (e.g., PyTorch, JAX, TensorFlow) and model libraries/tooling (e.g., Hugging Face ecosystem, vLLM, distributed training stacks). - Experience designing and implementing evaluation pipelines for LLM/ML systems, including metrics computation, dataset handling, and experiment comparisons. - Strong understanding of data pipelines and ML systems engineering, including reproducibility, observability, and debugging. - Experience with large-scale distributed ML systems and performance optimization for training/evaluation workloads (GPU/accelerator environments preferred). - Experience with large-scale data processing and workflow orchestration in support of model training/evaluation. - Ability to collaborate directly with technical stakeholders including research scientists, ML engineers, data engineers, and customer technical leads. - Strong written and verbal communication skills, including the ability to explain complex technical tradeoffs to both technical and non-technical audiences. Technical Skills - ML / LLM Engineering - Experience training, fine-tuning, and evaluating transformer-based models. - Understanding of post-training workflows and model iteration loops. - Familiarity with inference-time considerations (latency, throughput, memory/performance tradeoffs) where relevant to evaluation or deployment. - Evaluation & Experimentation - Experience implementing automated evaluation pipelines and test harnesses. - Experience with experiment tracking, versioning, and reproducibility practices. - Ability to assess metric quality and ensure consistency across model comparisons. - Software / Data Engineering - Proficiency in Python and strong software engineering fundamentals. - Experience with data processing pipelines, storage formats, and scalable dataset workflows. - Familiarity with CI/CD, testing, and engineering quality practices for ML systems. Salary Range The expected salary range for this position is $80,000 – $175,000 USD per year, based on experience, skills, and qualifications. Please be aware of recruitment scams involving individuals or organizations falsely claiming to represent employers. Innodata will never ask for payment, banking details, or sensitive personal information during the application process. To learn more on how to recognize job scams, please visit the Federal Trade Commission’s guide at https://consumer.ftc.gov/articles/job-scams . If you believe you’ve been targeted by a recruitment scam, please report it to Innodata at verifyjoboffer@innodata.com and consider reporting it to the FTC at ReportFraud.ftc.gov .
Related Guides
Related Categories
Related Job Pages
More Research Engineer Jobs
Role Description As a Software Development Engineer in Test (SDET) at Ordermesh Inc, you will be a key force in evolving our automation-first quality culture across a complex microservices platform (Kafka, NoSQL, REST APIs, async workflows). The role is less about manual gatekeeping and more about building CI-integrated quality pipelines that let developers ship confidently and independently. It's for someone who wants to own the full automation strategy — from build-time testing to real-time quality metrics. What You’ll Do - Champion an automation-first mindset; manual testing should be the exception, not the rule. - Design, build, and scale automated test frameworks for APIs, UI, and end-to-end microservice validation using Node.js, Playwright, and related frameworks. - Develop load and resilience testing suites with Grafana k6 to benchmark and harden distributed systems. - Integrate test execution and quality gates deeply into GitHub Actions, ensuring every commit, PR, and deployment is validated by automation. - Collaborate closely with MCPs: Kafka event flows, service mesh routing (Istio), and inter-service contracts to design automated validation of message schemas, ACLs, and service dependencies. - Mock endpoints with services like microcks or postman to simulate responses. - Lead TDD adoption by embedding test scaffolds into developer workflows and enforcing test coverage standards across repositories. - Embed security testing and data validation checks into automation frameworks for proactive vulnerability detection. - Create test observability dashboards (via Grafana or Datadog) that expose quality metrics alongside performance and error budgets. - Perform exploratory testing to supplement automation with contextual discovery and edge-case validation. - Collaborate with developers, SREs, and product managers to drive a shared understanding of quality across environments. Qualifications - 5+ years of hands-on experience building and maintaining automated test frameworks for microservice and web applications. - Strong proficiency in Node.js, JavaScript/TypeScript, or equivalent modern language. - Demonstrated experience integrating tests into CI/CD systems; ideally GitHub Actions, Jenkins, or Azure DevOps. - Proven track record in load testing (Grafana k6) and performance analysis at scale. - Experience validating MCP integrations including: message brokers (Kafka), service meshes (Istio), and REST/gRPC endpoints. - Working knowledge of Playwright or similar browser automation frameworks. - Understanding of TDD, security testing, and DevSecOps principles. - Excellent debugging, observability, and root-cause analysis skills. - Bachelor's degree in Computer Science or equivalent practical experience. - Passion for driving automation-first culture and mentoring others in modern test engineering practices. Bonus Qualifications - Experience with Azure, Kubernetes, and containerized CI environments. - Familiarity with contract testing frameworks for validating MCP communication. - Experience with Grafana, Datadog, or similar platforms for system and test observability. - Familiarity with B2B, eCommerce, or fulfillment ecosystems.
Senior AI Research Engineer
NexthinkUnparalleled Visibility Into Issue Detection, Diagnosis, and Remediation
Company Description Nexthink is the leader in digital employee experience management software. The company provides IT leaders with unprecedented insight allowing them to see, diagnose and fix issues at scale impacting employees anywhere, with any application or network, before employees notice the issue. As the first solution to allow IT to progress from reactive problem solving to proactive optimization, Nexthink enables its more than 1,300 customers to provide better digital experiences to more than 18 million employees. Dual headquartered in Lausanne, Switzerland and Boston, Massachusetts, Nexthink has 9 offices worldwide. #LI-Hybrid Job Description We are looking for a Senior AI Research Engineer to join our research team and help shape the next generation of AI agents at Nexthink. We are looking for someone with a strong leadership mindset: someone who can take ownership of ambiguous problems, guide a small team, make pragmatic technical decisions, and turn early ideas into working prototypes. You will work in a fast-moving research environment where problems are often under-specified. You should be excited by ambiguity, when the goal is defined but the path is still unexplored. You should know how to explore, prototype, evaluate, and decide when something is "good enough" to move forward. You will collaborate closely with engineers, product leaders, researchers, and business stakeholders. This requires not only strong technical depth in AI, but also clear communication and the ability to adapt your message depending on the audience. As part of the AI team at Nexthink, you will: - Lead the design and development of AI agents and features, from vague problem definition to production deployment impacting millions of users. - Drive rapid prototyping and experimentation to validate ideas and unlock new opportunities. - Help guide the team's technical direction, priorities, and execution of rhythm. - Make fast, pragmatic decisions when trade-offs are unclear. - Support and mentor engineers while keeping the team focused on outcomes. - Maintain a strong sense of what matters: customer value, technical feasibility, speed of learning, and long-term product potential. - Explore new AI technologies, tools, models, and agentic patterns with curiosity and critical judgment. Discover our latest projects over here: https://nexthink.com/dex-frontier-labs Qualifications - BSc/MSc in AI, CS, Data Science, or a related field. - 3-7+ years of experience in software engineering and applied AI/ML (including impactful projects or research). - Strong builder mindset, and hands-on experience in AI engineering, ML, LLMs, agents, or applied AI systems. - Proven ability to lead technical work in ambiguous or fast-changing environments. - Pragmatic mindset: able to move fast without sacrificing what truly matters. - Ability to balance research exploration with concrete delivery. - Strong leadership skills, with experience in mentoring, technical ownership, or leading initiatives. - Clear and structured communicator, able to adapt communication style to engineers, executives, product teams, and customers. - High ownership, and strong ability to work collaboratively in a small, high-impact team. Nice to Have - Experience in enterprise-scale products. Additional Information We are the pioneers and trailblazers of a global IT Market Category (DEX) that is shaping the future of how the world works, giving our customers' IT Teams total digital visibility across their enterprise. Our innovative solutions integrate real-time analytics, automation, and employee feedback across all endpoints. This enables our IT teams to solve complex technical challenges, create ever more productive workplaces, and deliver happy, satisfied employees in the digital workplace. With over 1000 employees across 5 continents, Nexthink operates as One Team, connecting, collaborating and innovating to continuously grow. We call our employees 'Nexthinkers' and our commitment to diversity, inclusion, and equity is second to none. We currently have over 75 nationalities working with us, from all cultures and backgrounds, speaking many different languages. If you are looking for a change and like a nice atmosphere, lots of challenges, and having fun while working, this is a great opportunity for you! Check what we offer: - Permanent Contract and a competitive compensation package. - Beautiful office, conveniently located next to the Prilly-Malley train station - Hybrid work model balancing office and remote work, with a structured approach for new hires to foster connections and onboarding. - Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 25 days of holidays we offer) plus 3 company-paid volunteer days. - Free access to a fitness centre inside the building. - Reimbursement of the half-fare travel card for public transport. - Reimbursement up to 50% of the cost of French classes. - Fresh fruit, cookies, and soft drinks as well. - Regular company and team events like Voluntary Days, Pizza talks, Team Building activities, hosting Meetups at the office and more! - Bonuses for referring successful hires after three months of continuous employment. - We offer a relocation package to people who are coming from another country. Please note that not all the benefits listed above are available for temporary, contract, and internship roles. To ensure you have the most up-to-date information, we recommend checking with your Recruitment Partner.
Research Engineer
Nantes UniversitéPour postuler à cette offre, l'envoi du CV et d'une lettre de motivation est obligatoire. Personnes à contacter : Jérôme JULLIEN – Jerome.Jullien@univ-nantes.fr
Role Description La personne recrutée participera à l’évaluation biologique de vésicules extracellulaires, dans le contexte de leur utilisation pour la médecine régénératrice du disque intervertébral. - Réaliser des expérimentations biologiques in vitro pour évaluer l’activité thérapeutique de vésicules extracellulaires. - Conduire des essais cellulaires permettant d’analyser l’activité anti-inflammatoire des vésicules extracellulaires formulées par les partenaires du projet. - Utiliser des techniques classiques de culture cellulaire, ainsi qu’à des explants de tissus spécifiques (disques intervertébraux ovin et bovins). - Réaliser l’analyse des effets biologiques en utilisant des techniques d’imagerie, de biochimie et de biologie moléculaire. Qualifications - Formation de niveau doctorat en biologie ou ingénierie biomédicale. - Solides connaissances en culture cellulaire, biologie moléculaire et techniques de caractérisation biologique. - Maîtrise de l’anglais scientifique (lecture, rédaction, communication). Requirements - Niveau 8 Doctorat/diplômes équivalents. Competencies - Connaissances en biologie cellulaire et moléculaire. - Connaissances du disque intervertébral. - Mettre en œuvre des cultures cellulaires, y compris explants et cultures d'organes, et réaliser des expérimentations biologiques. - Utiliser des techniques d'analyse moléculaire et biochimique (imagerie, analyses protéiques et géniques). - Rigueur scientifique et sens de l'organisation. - Capacité à travailler en équipe dans un environnement pluridisciplinaire. Location Localisation : 1, quai de Tourville 44035 Nantes Application Elements - Documents à transmettre : l'envoi du CV et d'une lettre de motivation est obligatoire. - Personnes à contacter : Catherine Le Visage : catherine.levisage@univ-nantes.fr
Research Technologist
Yale UniversityYale University is a prestigious, private, Ivy League research institution with roots dating back to the 17th century. Officially founded as Yale College in 171
Lead outreach to identify emerging research needs, provide expert guidance on software development and high-performance computing, and design instructional materials and workshops to support diverse research communities.

