Job Closed

This listing is no longer active.

ML Engineer – AI Platform Lead

AI EngineerMachine Learning EngineerFull TimeRemoteSeniorTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

Hong Kong

Posted

64 days ago

Salary

0

Seniority

Senior

Bachelor Degree5 yrs expEnglishAssemblyPythonPyTorch

Job Description

ML Engineer – AI Platform Lead

BTSE

• Deploy and optimise a large language model for production inference: quantisation, continuous batching, low-latency serving. • Build the RAG pipeline: document chunking, embedding generation, vector storage, cross-encoder reranking, and context assembly optimised for a 128K-token context window. • Build the context layer: per-tenant system prompts, dynamically retrieved few-shot exemplars, task routing (classifying incoming requests to the right prompt configuration). • Build defensive output parsing: structured JSON output from an unmodified base model with graceful fallbacks. • Design and implement the feedback collection pipeline: capturing user corrections and ratings, automatically generating training data candidates for future fine-tuning. • Design the custom model training workflow: tenant-scoped LoRA training on client-specific data, model evaluation, A/B testing, and isolated deployment. • Monitor and improve inference quality: parsing failure rates, citation accuracy, hallucination rates, latency — all tracked per tenant. • Iterate on prompts daily with the domain expert during the pilot phase.

Job Requirements

  • 5+ years ML engineering; 2+ years working with large language models in production.
  • Hands-on experience with LLM serving frameworks (vLLM, TGI, or equivalent).
  • Deep experience building RAG pipelines: chunking strategies, embedding models, vector databases, reranking.
  • Strong prompt engineering skills for production applications — you know how to make a base model produce consistent, structured, high-quality output.
  • Python: PyTorch, Transformers, FastAPI.
  • Familiar with LoRA/QLoRA fine-tuning workflows.
  • Experience building multi-tenant ML serving infrastructure.
  • Experience with financial or crypto AI applications.
  • Experience with cross-encoder reranking models (DeBERTa or similar).
  • Understanding of data isolation requirements for ML training pipelines.

Related Job Pages

More AI Engineer Jobs

Insight logo

Sr. AI Engineer

Insight

Now is the time to bring your expertise to Insight. We are not just a tech company; we are a people-first company. We believe that by unlocking the power of people and technology, we can accelerate transformation and achieve extraordinary results. Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent edge. Guiding organizations through complex digital decisions.

AI Engineer64 days ago
Full TimeRemoteTeam 10,001

Requisition Number: 104152 Senior Software Engineer – AI Solutions Location: You will have the flexibility to work fully remote from anywhere across the US. Insight at a Glance - 14,000+ engaged teammates globally - $8.2 billion in revenue in 2025 - Placed on Newsweek’s America’s Greatest Workplaces for 2025 Now is the time to bring your expertise to Insight. We are not just a tech company; we are a people-first company. We believe that by unlocking the power of people and technology, we can accelerate transformation and achieve extraordinary results. As a Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent edge, we guide organisations through complex digital decisions. About the role As a Senior Software Engineer (SSE), AI Solutions, you will play a hands-on role in building and delivering AI-powered solutions that support our clients’ transformation initiatives. You will work closely with AI Architects, Principal Engineers, and cross-functional teams to implement scalable, high-quality systems across AI, cloud, data, and software engineering. This role is ideal for experienced engineers who enjoy strong technical execution, contributing to solution-level design, and growing toward technical leadership while operating at the leading edge of applied AI. As a Senior Software Engineer, AI Solutions, you will: - Design and implement components for AI, ML, and data science initiatives. - Translate business and technical requirements into well-structured designs, code, and implementation tasks. - Contribute as a senior individual contributor on AI-focused workstreams, supporting high-quality execution and delivery outcomes. - Build, test, and deploy AI, ML, cloud, analytics, and software solutions in production environments. - Collaborate with AI Architects and senior engineers to align implementation decisions with broader architectural direction. - Contribute to technical excellence through code reviews, shared ownership of quality, and knowledge sharing. - Support discovery and delivery activities by providing technical input, estimates, and feasibility feedback. - Stay current with emerging AI technologies and apply them thoughtfully to client solutions. What we’re looking for - Bachelor’s degree in a technical field or equivalent hands-on engineering experience. - Solid experience in software engineering with delivery of production-grade systems. - Practical experience in one or more of the following areas, with working knowledge across others: - AI/ML & Agentic Systems - Implementing GenAI solutions and agent-based workflows - Integrating LLMs, embeddings, and retrieval systems - Supporting ML/LLMOps pipelines and experimentation frameworks - Cloud Engineering - Building cloud-native applications on Azure and/or AWS - Applying distributed systems and scalability patterns - Data Engineering & Analytics - Implementing data pipelines, vector stores, and analytics workloads - Working with structured and unstructured data sources - Software Engineering - Proficiency in Python, SQL, Java, C++, or similar languages - Experience with modern frameworks, DevOps practices, and CI/CD pipelines Preferred Certifications (or equivalent experience) - AWS: ML Specialty, AI Practitioner, ML Engineer, or Solutions Architect - Azure: AI Engineer Associate, Data Scientist Associate, Generative AI Fundamentals - Databricks: ML Associate, Data Engineer Associate, Generative AI Engineer Associate Who You Are - A capable technologist who communicates clearly with engineers, architects, and stakeholders. - Comfortable operating in delivery-focused environments with guidance from senior leaders. - Collaborative and effective working within cross-functional teams. - Motivated to grow through mentorship, feedback, and hands-on experience. - Driven to build real-world AI systems that deliver measurable business value. What you can expect We’re legendary for taking care of you, your family and to help you engage with your local community. We want you to enjoy a full, meaningful life and own your career at Insight. Some of our benefits include: - Freedom to work from another location—even an international destination—for up to 30 consecutive calendar days per year. - Full medical, dental, vision, and company 401k match. But what really sets us apart are our core values of Hunger, Heart, and Harmony, which guide everything we do, from building relationships with teammates, partners, and clients to making a positive impact in our communities. Join us today, your ambITious journey starts here. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process. At Insight, we celebrate diversity of skills and experience so even if you don’t feel like your skills are a perfect match - we still want to hear from you! Insight does not accept unsolicited resumes from recruiters or employment agencies. Unsolicited resumes will be treated as direct applications from the candidate, and recruiters or agencies who submit candidates for this position without a prior, written vendor agreement will not be eligible for any form of compensation, even if the candidate is hired. Salary Range - $120,000 - $135,000 The position described above provides a summary of some the job duties required and what it would be like to work at Insight. For a comprehensive list of physical demands and work environment for this position, click here. Insight is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law. Posting Notes: Remote || Arizona (US-AZ) || United States (US) || Engineering || None || Remote ||

United States
$120K - $135K / year
Insight logo

Principal AI Engineer

Insight

Now is the time to bring your expertise to Insight. We are not just a tech company; we are a people-first company. We believe that by unlocking the power of people and technology, we can accelerate transformation and achieve extraordinary results. Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent edge. Guiding organizations through complex digital decisions.

AI Engineer64 days ago
Full TimeRemoteTeam 10,001

Requisition Number: 104198 Principal Software Engineer, AI Solutions Location: You will have the flexibility to work fully remote from anywhere across the US. Insight at a Glance - 14,000+ engaged teammates globally - $8.2 billion in revenue in 2025 - Placed on Newsweek’s America’s Greatest Workplaces for 2025 Now is the time to bring your expertise to Insight. We are not just a tech company; we are a people-first company. We believe that by unlocking the power of people and technology, we can accelerate transformation and achieve extraordinary results. As a Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent edge, we guide organisations through complex digital decisions. About the role As a Principal Software Engineer (PSE), AI Solutions, you will play a senior, hands-on role in building and delivering AI-powered solutions that support our clients’ transformation initiatives. You will work closely with AI Architects and cross‑functional teams to implement scalable, high-quality systems across AI, cloud, data, and software engineering. This role is ideal for experienced engineers who enjoy deep technical execution, solution-level design, and mentoring, and who want to operate at the leading edge of applied AI without owning enterprise-wide architecture or technical strategy. As a Principal Software Engineer, you will: - Design and implement solution-level components for AI, ML and data science initiatives. - Translate business and technical requirements into detailed designs, code, and implementation plans. - Serve as a technical lead for AI-focused workstreams, ensuring high-quality execution and delivery outcomes. - Build, test, and deploy AI, ML, cloud, analytics, and software solutions in production environments. - Collaborate closely with AI Architects to align implementation decisions with broader architectural intent. - Contribute to technical excellence through code reviews, mentoring engineers, and sharing best practices. - Support presales and discovery activities by contributing technical insight, estimates, and feasibility assessments. - Stay current with emerging AI technologies and help apply them pragmatically to client solutions. What we’re looking for - Bachelor’s degree in a technical field or equivalent hands-on engineering experience. - Strong experience in software engineering with demonstrated delivery of complex, production-grade systems. - Practical depth in at least one of the following areas, with working knowledge across others: - AI/ML & Agentic Systems - Implementing GenAI and agentic workflows - Integrating LLMs, embeddings, and retrieval systems - Supporting ML/LLMOps pipelines and experimentation frameworks - Cloud Engineering - Building cloud-native applications on Azure and/or AWS - Applying distributed systems and scalability patterns - Data Engineering & Analytics - Implementing data pipelines, vector stores, and analytics workloads - Working with structured and unstructured data sources - Software Engineering - Proficiency in Python, SQL, Java, C++, or similar languages - Experience with modern frameworks, DevOps practices, and CI/CD pipelines Preferred Certifications (or equivalent experience) - AWS: ML Specialty, AI Practitioner, ML Engineer, or Solutions Architect - Azure: AI Engineer Associate, Data Scientist Associate, Generative AI Fundamentals - Databricks: ML Associate, Data Engineer Associate, Generative AI Engineer Associate Who You Are - A strong technologist who communicates effectively with engineers, architects, and project stakeholders. - Comfortable operating in delivery-focused consulting environments with multiple priorities. - Collaborative by nature and effective working within cross-functional teams. - Experienced mentoring other engineers through example, code reviews, and technical guidance. - Motivated by building real-world AI systems that deliver measurable business value. What you can expect We’re legendary for taking care of you, your family and to help you engage with your local community. We want you to enjoy a full, meaningful life and own your career at Insight. Some of our benefits include: - Freedom to work from another location—even an international destination—for up to 30 consecutive calendar days per year. - Full medical, dental, vision, and company 401k match. But what really sets us apart are our core values of Hunger, Heart, and Harmony, which guide everything we do, from building relationships with teammates, partners, and clients to making a positive impact in our communities. Join us today, your ambITious journey starts here. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process. At Insight, we celebrate diversity of skills and experience so even if you don’t feel like your skills are a perfect match - we still want to hear from you! Insight does not accept unsolicited resumes from recruiters or employment agencies. Unsolicited resumes will be treated as direct applications from the candidate, and recruiters or agencies who submit candidates for this position without a prior, written vendor agreement will not be eligible for any form of compensation, even if the candidate is hired. Salary Range - $130,000 - $150,000 The position described above provides a summary of some the job duties required and what it would be like to work at Insight. For a comprehensive list of physical demands and work environment for this position, click here. Insight is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law. Posting Notes: Chandler || Arizona (US-AZ) || United States (US) || IT Infrastructure & Support; Data & AI || None || US - Chandler,AZ ||

United States
$130K - $150K / year
Insight Global logo

AI Engineer I

Insight Global

Founded in 2001, Insight Global (IG) offers enhanced staffing, placement staffing, and temporary-to-permanent staffing services, including long-term and short-term job assignments.

AI Engineer64 days ago

• Contribute to the design and development of AI-powered platforms and systems, working alongside senior engineers on complex initiatives. • Own end-to-end delivery of complex projects, from problem discovery and system design through production rollout and iteration. • Build AI-driven features, intelligent automation, and agentic systems that enhance customer experiences and improve team productivity. • Embed AI capabilities into existing team workflows, products, and processes, ensuring solutions integrate naturally. • Integrate AI capabilities into customer-facing products, internal tools, and operational workflows. • Build full-stack components spanning backend services, APIs, data layers, and frontend interfaces. • Translate business and operational problems into maintainable, AI-enabled technical solutions with guidance from senior team members. • Apply systems thinking to ensure reliability, performance, and security across platforms. • Collaborate with engineers, teams, and stakeholders across the company to understand requirements and deliver impactful solutions. • Participate in production support and incident response for team-owned systems, learning operational best practices. • Engage in code reviews and knowledge sharing to continuously improve skills and contribute to team growth.

Mexico
$2.5K - $3.3K / month
Full TimeRemoteTeam 11-50Since 2018H1B No Sponsor

• Advanced Prompt Engineering: Designing complex, dynamic prompt templates with conditional logic and efficiently reusing information and context within prompts to maximize generation quality and reasoning. • Structured Outputs & Schemas: Implementing various response schemes (JSON mode, function calling, Zod/JSON schemas) to ensure AI outputs are predictable and ready for seamless integration into application logic. • Prompt Engineering & Evaluations: Building robust evaluation pipelines and using Langfuse to collect feedback and score the quality of responses in real time. • Tracing & Debugging: Performing deep debugging of complex LLM chains using Langfuse traces to identify bottlenecks and optimize for cost, latency, and context window usage. • AI A/B Testing: Running systematic experiments across different models via OpenRouter (e.g., comparing Claude 3.5 Sonnet vs. GPT-4o) and analyzing results based on quantitative metrics. • Data-Driven Decisions: Making deployment decisions for new prompts or models strictly based on quantitative benchmarks and trace data, rather than intuition. • Output Scoring & Analysis: Developing scoring systems to analyze the “Problem → Solution” chain and identify root causes of hallucinations or logic errors using Langfuse analytics. • Model Performance & Fine-Tuning: Regularly re-evaluating model performance as new architectures emerge and performing fine-tuning when necessary to meet specific domain requirements.

Georgia