In Tandem

AI Engineer

AI EngineerMachine Learning EngineerFull Time Remote SeniorTeam 51-200H1B No SponsorCompany Site LinkedIn

Location

Minnesota

Posted

6 days ago

Salary

$100K - $135K / year

Seniority

Senior

Bachelor Degree5 yrs expEnglishAWS Docker Python

Job Description

• Run and optimize our self-hosted inference stack • Run the inference serving layer on our own GPU hardware: choose and tune the serving stack (vLLM, SGLang, TensorRT-LLM) for high throughput and low latency. • Optimize aggressively: tensor parallelism, quantization (FP8, AWQ, GPTQ), KV-cache and prefix caching, continuous batching, speculative decoding, concurrency tuning. • Serve multiple models and features off shared hardware: multi-LoRA, routing, and request scheduling that balances internal workloads against latency-sensitive product traffic. • Keep our AI fast, efficient, and observable • Make our AI workloads efficient: improve latency, throughput, and GPU utilization so we get the most out of what we run. • Build the visibility: instrument performance and usage across our AI surfaces so there's clear data on how everything is running. • Surface the technical tradeoffs (performance, latency, efficiency) so the people making the calls have what they need to make them. • Build AI features and proactive agents • Ship the in-app agent layer that helps families coordinate: proactive nudges, smart suggestions, agents that summarize, draft, schedule, and act for busy parents. • Build the substrate underneath: tools, memory, orchestration, guardrails, and evaluation harnesses, integrated cleanly with production APIs alongside our architecture team. • Work in nimble pairs with feature owners, standing up whatever's needed to test an idea, including a vibe-coded UI when that's the fastest path to a real customer. Ship rough, learn fast, harden what works.

Job Requirements

5+ years shipping production software, including meaningful applied AI or ML work.
Demonstrated experience running and optimizing self-hosted LLMs on dedicated multi-GPU hardware: a serving stack (vLLM, SGLang, or TensorRT-LLM) and the optimization that comes with it (tensor parallelism, quantization, batching, KV cache).
A track record of optimizing inference performance and efficiency (latency, throughput, GPU utilization).
Strong Python and engineering fundamentals, with the full-stack range to stand up a quick UI, and the genuine desire to work app-layer features and not only infra.
Hands-on with agent frameworks (Claude Agent SDK, LangGraph, or similar), LLM APIs, embeddings, and RAG.
Comfortable with AWS and the devops this role owns: Docker, CI/CD, monitoring, and observability.
Experience building internal tooling or platforms others depend on. Bonus for Slack apps, MCP, or agent orchestration at team scale.

Benefits

Medical: In Tandem pays 100% of the premium for employees AND 99% for all additional family members
401k: Up to a 4% match with immediate vesting
Paid leave for all new parents
Learning & Development stipend for employees
Paid Time Off: 11 Holidays + Winter Break (3 Days) + Volunteer Time Off (1 Day) + Floating Holiday (1 Day)
Personal Time Off: 15 days for 0-1 years of employment, 20 days 1-3 years of employment
Supportive and flexible working environment – work from anywhere!

Related Categories

AI Engineer Machine Learning Engineer AI Research Scientist LLM Engineer Computer Vision Engineer NLP Engineer

Related Job Pages

AI Engineer Jobs in Minnesota Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More AI Engineer Jobs

Senior Agentic AI Engineer

Thryv

AI Engineer6 days ago

Full Time RemoteTeam 1,001-5,000H1B Sponsor

Company Site LinkedIn

Role Description This role is responsible for designing, developing, implementing, troubleshooting, and optimizing scalable, high-performance software and product applications. Leveraging industry best practices, the Sr. Software & Product Engineer delivers robust, customer-focused solutions that accelerate product innovation. - Assesses and defines software and product requirements, establishing the specifications and standards that guide scalable, high-quality development. - Executes coding, debugging, testing, and troubleshooting across the full development lifecycle, incorporating AI-assisted and agentic development approaches to maintain quality and delivery efficiency. - Develops and advances software and product capabilities that integrate with design systems, infrastructure, databases, and cloud-based platforms, all with the goal of maximizing operational efficiency. - Evaluates application requirements and architects database solutions that ensure scalability, performance, and data integrity. - Serves as a subject matter expert in AI-assisted development practices; acts as a resource for the engineering team on the effective and disciplined application of AI coding tools, informs development standards around AI use, and reinforces quality expectations through code reviews. Qualifications - Bachelor's degree (or international equivalent) or equivalent experience, required. - 5+ years of related experience, required. - 2+ years of Agentic Engineering required. - Experience with Claude Code, Curser or comparable LLM. - 5+ Python required. - Experience with Git/GitHub, JIRA, Confluence, CircleCI, required. - Experience developing in Agile, SCRUM, or similar iterative methodologies, required. - Experience in fast-growing companies or entrepreneurial environments, required. - 9+ years of related experience, preferred. - Demonstrated knowledge of component-based frontend architecture and modern frontend development principles, enabling scalable, modular front-end development. - Skilled in frontend build tools and development pipeline practices. - Advanced knowledge of software testing methodologies enabling robust, reliable test coverage. - Expert-level knowledge of AI-assisted development tools and agentic coding workflows, with the ability to apply engineering judgment to evaluate, refine, and integrate AI-generated code into production software delivery. - Technical proficiency to translate detailed business requirements into actionable technical specifications and determine the most effective implementation approach using a wide range of tools and technologies. - Industry knowledge of current software engineering practices and emerging development methodologies. - Ability to serve as a technical resource for the engineering team on AI-assisted development practices, guide peer adoption of effective AI tool use, and reinforce quality standards for AI-generated code. - Ability to influence and contribute to architectural design. - Ability to travel less than 5% of the time. - Must be 18 years of age or older. - Must successfully complete pre-employment screening process, as required. - Must successfully complete any required training or orientation courses, as needed. Benefits - Work from anywhere – Thryv is a Remote First company! - Competitive medical, dental, and vision plans, plus a wellness program with added incentives. - 401(k) savings plan with company match and employee stock purchase plan. - Continuing education benefits with tuition assistance programs. - One week of paid time off at the end of the year, in addition to our standard paid time off policy.

View details: Senior Agentic AI Engineer

Worldwide

$120K - $140K / year

Apply

Analytics & AI Architect

Infomineo

Leading the business of "Brainshoring" by outsourcing activities like Research, Analytics, Design, and Language Services

AI Engineer6 days ago

Full Time RemoteTeam 201-500H1B No Sponsor

Company Site LinkedIn

• Lead the technical architecture of Data, Analytics and AI solutions for our clients, covering the full lifecycle from design to deployment: • Design end-to-end data architectures: data lakes, lakehouses, warehouses, and streaming pipelines. • Define standards for data modeling, storage, ingestion, and transformation across client engagements. • Architect MLOps and AI deployment infrastructure (model registries, CI/CD for ML, monitoring). • Lead technical decisions on cloud platforms (Azure, AWS, GCP) and open-source tooling. • Define best practices and reusable frameworks for data engineers, analysts, and data scientists. • Act as a technical mentor and reviewer for cross-functional project teams. • Bridge the gap between data analysts, data engineers, and AI/ML engineers on complex projects. • Contribute to internal knowledge base, toolkits, and delivery accelerators. • Lead architecture workshops and discovery sessions with client stakeholders. • Translate business requirements into scalable, robust technical blueprints. • Present architecture decisions to both technical teams and executive audiences. • Support pre-sales and proposal efforts with technical scoping and solution design. • Provide internal training and knowledge-sharing sessions with the team. • Support the Head of Practice on business development and internal capability initiatives.

Airflow Amazon Redshift AWS Azure BigQuery Cloud Docker Google Cloud Platform Hadoop Kafka Kubernetes PySpark Python Scala Spark SQL Tableau

View details: Analytics & AI Architect

Egypt

Apply

Mid/Senior AI Engineer, Networking

CodiLime

A strategic partner for technology-driven companies | Network engineering | Software engineering

AI Engineer6 days ago

Contract RemoteTeam 201-500Since 2011H1B No Sponsor

Company Site LinkedIn

• Developing MCP-like tools that expose network device APIs and CLI commands with clear descriptions, structured inputs/outputs, validation logic, and error handling • Managing tool metadata and supporting semantic search over available tools using a vector database • Creating golden user queries, expected answers, and query variations for specific tools, intents, and network-operation scenarios • Building automated tests to verify correct tool selection, tool parameterization, output structure, and end-to-end agent responses • Designing evaluation workflows combining deterministic checks, human review, and LLM-as-a-judge techniques • Refining prompts, tool descriptions, schemas, and agent workflows while monitoring regressions when new tools or changes are introduced • Developing production-quality Python code and tests

Python

View details: Mid/Senior AI Engineer, Networking

Poland

zł16K - zł25K / month

Apply

Senior AI Engineer

Futurice

Empowering the world to act.

AI Engineer6 days ago

Full Time RemoteTeam 501-1,000Since 2000H1B No Sponsor

Company Site LinkedIn

• Design and build cutting-edge AI applications – from MCP servers and voice-to-voice real-time apps to RAG bots and multi-agent systems. • Advise clients on AI strategy – join workshops, contribute to proposals, and help clients understand what's possible and what's right for them. • Build demos and PoCs – fast, purposeful, and client-ready without over-engineering. • Mentor junior engineers – share knowledge, review code thoughtfully, and help grow the team around you. • Shape our AI practice – contribute to shared tools, frameworks, and ways of working.

AWS Azure Cloud Google Cloud Platform Python

View details: Senior AI Engineer

United Kingdom

£60K - £90K / year

Apply

AI Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More AI Engineer Jobs

Senior Agentic AI Engineer

Analytics & AI Architect

Mid/Senior AI Engineer, Networking

Senior AI Engineer