Job Closed

This listing is no longer active.

Deep Learning Engineer – LLM, VLM Model Compression

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteLeadTeam 10,001+Since 1993H1B SponsorCompany SiteLinkedIn

Location

Poland

Posted

87 days ago

Salary

zł292.5K - zł650K / year

Seniority

Lead

8 yrs expEnglishPythonPyTorchTensorFlow

Job Description

Deep Learning Engineer – LLM, VLM Model Compression

NVIDIA

• Design and implement a deep learning framework for compressing large language and vision-language models to deliver highly optimized, high-performance AI systems used worldwide. • Develop and integrate new algorithms for pruning, NAS, and distillation in collaboration with NVIDIA researchers and engineers. • Experiment with compressing the latest LLMs and VLMs, analyzing their performance and behavior across diverse workloads. • Collaborate with researchers and engineers across NVIDIA, providing guidance on improving the design, usability and performance of workloads. • Lead best-practices for building, testing, and releasing DL software.

Job Requirements

  • 8+ years of experience in Deep Learning and SW Development.
  • BSc, MS or PhD degree in Computer Science, Computer Architecture or related technical field.
  • Hands-on experience with LLM or VLM model training or inference.
  • Excellent Python programming skills.
  • Extensive knowledge of at least one DL Framework (PyTorch, TensorFlow, JAX, MxNet) with practical experience in PyTorch required.
  • Strong problem solving and analytical skills.
  • Algorithms and DL fundamentals.

Related Job Pages

More Machine Learning Engineer Jobs

Optum logo

Sr AI/ML Engineer - Hybrid in MN or DC - Remote elsewhere

Optum

Optum, part of the UnitedHealth Group family of businesses, is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. At Optum, we support your well-being with an understanding team, extensive benefits and rewarding opportunities. By joining us, you’ll have the resources to drive system transformation while we help you take care of your future. We recognize the power of connection to drive change, improve efficiency and make a difference in health care. Join a team where your skills and ideas can make an impact and where collaboration is key to creating technology that produces healthier outcomes.

OtherRemoteTeam 160,000Since 2011

Requisition Number: 2343638 Transform Healthcare Through AI Innovation at Optum Optum is a global organization delivering care, powered by data and technology, to help millions of people live healthier lives. At Optum.ai, we are not just witnessing the AI transformation in healthcare-we are leading it. Our mission is clear: to simplify healthcare with AI, turning insight into action at a scale few organizations in the world can match. As part of the Optum.ai team, you'll work at the intersection of cutting-edge artificial intelligence and real-world healthcare impact. From reducing administrative burden for providers to anticipating patient needs and improving access to quality care, your work will help solve some of healthcare's most complex challenges-and directly improve health outcomes for millions of people. You'll collaborate with world-class talent across data science, engineering, product, and healthcare domains, backed by the reach and stability of Optum and UnitedHealth Group. Here, responsible innovation matters. So do comprehensive benefits, meaningful career growth, and the opportunity to make a tangible difference-advancing health equity and creating a simpler, more connected healthcare experience for everyone. This is more than a job. It's a chance to shape the future of healthcare through the transformative power of AI. Join us to start Caring. Connecting. Growing together. Optum AI is UnitedHealth Group's enterprise AI team. We are AI/ML scientists and engineers with deep expertise in AI/ML engineering for healthcare. We develop AI/ML solutions for the highest impact opportunities across UnitedHealth Group businesses including UnitedHealthcare, Optum Financial, Optum Health, Optum Insight, and Optum Rx. In addition to transforming the healthcare journey through responsible AI/ML innovation, our charter also includes developing and supporting an enterprise AI/ML development platform. As a Senior AI/ML Engineer, you will design and build advanced machine learning and generative AI solutions that power enterprise applications and decision systems. You will work on large language models, retrieval-augmented generation pipelines, and agent-based AI systems while ensuring production-grade reliability, performance, and safety. You'll enjoy the flexibility to work remotely* from anywhere within the U.S. as you take on some tough challenges. For all hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week. Primary Responsibilities: - Design and deploy generative AI and NLP solutions using transformer-based architectures for enterprise use cases such as search, summarization, extraction, conversational AI, and decision support - Develop and fine-tune large language model (LLM) applications using commercial and open-source models such as OpenAI, Anthropic/Claude, Gemini, LLaMA, or Mistral - Design and implement retrieval-augmented generation (RAG) pipelines including document ingestion, chunking strategies, embedding generation, and vector database integration - Build agentic AI systems including single-agent and multi-agent workflows capable of tool use, reasoning, planning, and autonomous task execution - Implement evaluation and monitoring frameworks for generative AI systems including hallucination detection, bias monitoring, and human-in-the-loop evaluation - Develop safety and governance controls for AI systems including prompt hardening, policy enforcement, and responsible AI guardrails - Productionize AI/ML pipelines using CI/CD, testing, monitoring, and observability best practices in cloud-native environments - Collaborate with product, platform, and data science teams to translate requirements into scalable AI solutions You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in. Required Qualifications: - 5+ years of experience in machine learning, AI engineering, or applied data science including development of production ML systems - 5+ years of experience developing Python-based ML systems using frameworks such as PyTorch, TensorFlow, or Hugging Face - 3+ years of experience building Generative AI or LLM applications including prompt engineering, model fine-tuning, and LLM API integrations - 2+ years of experience designing retrieval-augmented generation (RAG) systems and vector search pipelines - 1+ year of experience building agentic AI systems including tool-calling, planning frameworks, or multi-agent workflows - 1+ year of experience using AI-assisted development or 'vibe coding' tools such as Codex, Claude Code, Cursor, Windsurf, or similar tools Preferred Qualifications: - Experience with LLM orchestration frameworks such as LangChain, LlamaIndex, or Semantic Kernel - Experience optimizing LLM latency, cost, and performance in production environments - Experience deploying AI systems on cloud platforms such as AWS, Azure, or GCP - Experience working in regulated environments with responsible AI governance requirements *All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy. Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $91,700 to $163,700 annually based on full-time employment. We comply with all minimum wage laws as applicable. Application Deadline: This will be posted for a minimum of 2 business days or until a sufficient candidate pool has been collected. Job posting may come down early due to volume of applicants. At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission. UnitedHealth Group is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations. UnitedHealth Group is a drug - free workplace. Candidates are required to pass a drug test before beginning employment. #OptumTechPJ

Minnesota
$91.7K - $163.7K / year
Job Closed
Optum logo

AI/ML Engineer - Remote

Optum

Optum, part of the UnitedHealth Group family of businesses, is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. At Optum, we support your well-being with an understanding team, extensive benefits and rewarding opportunities. By joining us, you’ll have the resources to drive system transformation while we help you take care of your future. We recognize the power of connection to drive change, improve efficiency and make a difference in health care. Join a team where your skills and ideas can make an impact and where collaboration is key to creating technology that produces healthier outcomes.

OtherRemoteTeam 160,000Since 2011

Requisition Number: 2345237 Optum Tech is a global leader in health care innovation. Our teams develop cutting-edge solutions that help people live healthier lives and help make the health system work better for everyone. From advanced data analytics and AI to cybersecurity, we use innovative approaches to solve some of health care's most complex challenges. Your contributions here have the potential to change lives. Ready to build the next breakthrough? Join us to start Caring. Connecting. Growing together. The Optum Technology Digital team is on a mission to disrupt the healthcare industry, transforming UHG into an industry-leading Consumer brand. We deliver hyper-personalized digital solutions that empower direct-to-consumer, digital-first experiences, educating, guiding, and empowering consumers to access the right care at the right time. Our mission is to revolutionize healthcare for patients and providers by delivering cutting-edge, personalized and conversational digital solutions. We're Consumer Obsessed, ensuring they receive exceptional support throughout their healthcare journeys. As we drive this transformation, we're revolutionizing customer interactions with the healthcare system, leveraging AI, cloud computing, and other disruptive technologies to tackle complex challenges. Serving UnitedHealth Group's digital technology needs, the Consumer Engineering team impacts millions of lives through UnitedHealthcare & Optum. The Optum Technology Chief Digital Office (CDO) Leadership team is transforming Optum to be an industry-leading Consumer brand. We are on a journey towards delivering a best-in-the-industry consumer experience to our patients and providers by delivering personalized digital solutions that support our consumers throughout their healthcare journeys. This team is transforming to meet the moment - to begin radically altering the way our customers engage with the healthcare system using modern tech to solve some of the most complex problems experienced along the way. Serving all of UnitedHealth Group's digital technology needs, the CDO team is responsible for driving outcomes across nearly 30 million+ human lives with UnitedHealthcare insurance, a number which puts UHC at the top of the pack as the largest managed care provider in the United States. You'll enjoy the flexibility to work remotely * from anywhere within the U.S. as you take on some tough challenges. For all hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week. Primary Responsibilities: - Work with automation and orchestration tools such as Kubernetes, GitHub Actions or other commercial PaaS offerings - Continually monitor industry developments in Cloud infrastructure developments, tools and products used in the cloud delivery model - Work with Software, Platform Engineering and Operations teams on the development and delivery of operational ready platforms - Work with log aggregation tools like Splunk and ELK - Design automated, resilient, scalable platform solutions - Participate in development of automated delivery work flows using cloud automation and orchestration tools, Unix shell scripting and other deployment tools - Assess and interpret customer needs and requirements - Involved in solving moderately complex problems and/or conduct moderately complex analysis - Deliver the process and standards for developing new and exciting services for our customers - Develop integrations between vendor tool APIs and automation frameworks - Provide explanation and information to others on difficult issues - Coach, provide feedback and guide others - Identify/quantify scope and impact of business changes on systems - Support, design and improve on monitoring, alerting and tooling efforts - Maintain awareness of current technology assets, and the applicability and capability of each - Ability and willingness to augment current expertise with new open source or targeted vendor technologies You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in. Required Qualifications: - Bachelor's degree in Computer Science, Data Science, or related field - 3+ years of Python or Go Lang development - 2+ years of experience in AI/ML or Platform engineering - 1+ years of experience with Agile methodology - 1+ years of experience with Elasticsearch, Kafka, Prometheus, Grafana, Kubernetes - Familiarity with MCP, A2A, and AG-UI Protocols Preferred Qualifications: - Experience in full life cycle activities - Experience working across fully automated stacks in a CI/CD ecosystem - Experience with distributed data stores at scale - Experience in agile environments (using tools like Jira, GitHub Projects, CA Agile Central) - Experience working effectively across multiple functional areas in a matrixed environment - Agile DevOps delivery model experience - Health care industry experience *All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $98,547 to $175,978 annually based on full-time employment. We comply with all minimum wage laws as applicable. Application Deadline: This will be posted for a minimum of 2 business days or until a sufficient candidate pool has been collected. Job posting may come down early due to volume of applicants. At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission. UnitedHealth Group is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations. UnitedHealth Group is a drug-free workplace. Candidates are required to pass a drug test before beginning employment.

Minnesota
$98.5K - $176.0K / year
Job Closed
SupportNinja logo

AI/ML Engineer

SupportNinja

Showing the world a better way to outsource.

Full TimeRemoteTeam 1,001-5,000Since 2015H1B No Sponsor

• Develop and implement comprehensive data solutions utilizing AI and machine learning. • Contribute to the development and maintenance of AI-powered chatbots (e.g., Wonderchat). • Assist in improving chatbot performance, including accuracy, response time, and user experience. • Conduct experiments and analyze results to optimize prompt effectiveness. • Participate in the design and implementation of conversational AI solutions. • Develop and refine prompts for various AI applications, such as WritingAssist and DigitalQA. • Develop and implement predictive models for various business use cases. • Analyze data, build models, and evaluate their performance. • Prepare and clean datasets for model training and evaluation. • Engineer relevant features to improve model accuracy and performance. • Stay abreast of the latest advancements in AI/ML and explore new technologies and techniques. • Conduct research and experiments to identify and evaluate new AI/ML solutions for business challenges. • Collaborate effectively with cross-functional teams, including data scientists, engineers, and business analysts. • Communicate technical concepts and findings clearly and concisely.

Philippines
Job Closed
uSoftware logo

Senior MLOps Engineer

uSoftware

smino is a fast‑growing SaaS platform used by architects, planners, and construction companies to manage projects from planning to handover. The product supports seamless communication, documentation, and task management across all stakeholders in a construction project. The platform is collaborative, mobile, and designed to streamline workflows in a traditionally complex industry.

Role Description We are looking for a Senior / Strong Middle MLOps Engineer to own ML infrastructure, model deployment, and data pipelines across the platform. This is a hands-on role at the intersection of MLOps, DevOps, and Data Engineering, focused on scaling AI systems and bringing models into stable, cost-efficient production. Responsibilities - Build and maintain scalable ML infrastructure for training and inference - Deploy and optimize ML models (batch and real-time) - Work with embeddings pipelines and vector databases - Optimize performance and cost of model deployments - Manage Kubernetes environments (EKS/GKE or similar) - Implement Infrastructure as Code (Terraform) - Build and maintain ETL/ELT pipelines - Optimize database performance (Postgres, large-scale data) - Improve CI/CD pipelines and deployment workflows - Implement monitoring and observability (Prometheus, Grafana) - Collaborate with AI engineers to productionize models Qualifications - 3–5+ years in MLOps / DevOps / Data Engineering - Strong Python skills - Hands-on Kubernetes experience - Experience with AWS or similar cloud - Experience deploying ML models to production - Solid CI/CD and Docker experience - Strong SQL and database experience (PostgreSQL) - Experience with Terraform or other IaC tools Requirements - Experience with large-scale inference or embeddings pipelines - Performance and cost optimization of ML systems - Experience with Airflow, MLFlow, Spark, or DBT - Experience with vector DBs and RAG systems - Exposure to LLM-based systems (LangChain, OpenAI, etc.) Benefits - Experience with AI-first or agent-based platforms - Experience with multi-tenant SaaS architectures - Multi-cloud experience (AWS + GCP) Key Competencies - Strong ownership and hands-on mindset - Ability to work across MLOps, DevOps, and Data domains - Focus on performance, scalability, and cost optimization - Comfortable working in fast-paced startup environment

United States
Job Closed