Job Closed

This listing is no longer active.

Periodic Labs

From bits to atoms.

LLM Inference Engineer

EngineerEngineerFull Time Remote SeniorTeam 11-50Since 2025Company Site LinkedIn

Location

California

Posted

95 days ago

Salary

Seniority

Senior

English

Job Description

• Integrate, optimize, and operate large-scale inference systems to power AI scientific research • Build and maintain high-performance serving infrastructure that delivers low-latency, high-throughput access to large language models across thousands of GPUs • Work closely with researchers and engineers to integrate cutting-edge inference into large-scale reinforcement learning workloads • Build tools and directly support frontier-scale experiments to make Periodic Labs the world’s best AI + science lab • Make contributions to open-source LLM inference software

Job Requirements

Optimizing inference for the largest open-source model
High-performance model serving frameworks such as TensorRT-LLM, vLLM, SGLang
Distributed inference techniques (tensor/expert/pipeline parallelism, speculative decoding, KV cache management)
Optimizing GPU utilization and latency for reinforcement learning

Related Categories

Engineer

Related Job Pages

Engineer Jobs in California Remote Full-time Jobs (US)More Remote Jobs

More Engineer Jobs

Senior AI Engineer

SysMap Solutions

#sejaSysMap #SysMap #soulSysMap

Engineer95 days ago

Full Time RemoteTeam 1,001-5,000Since 1999

Company Site LinkedIn

• Design, develop, and implement production AI/ML solutions using Vertex AI and GCP. • Build data pipelines and machine learning models integrated with BigQuery. • Implement solutions using LLMs (Large Language Models), including Claude and other generative AI platforms. • Use AI-assisted coding platforms to increase productivity and code quality. • Establish and disseminate MLOps best practices, model governance, and responsible AI principles. • Collaborate with medical and business teams to translate clinical needs into technical solutions. • Ensure compliance with LGPD (Brazilian data protection law), healthcare regulations, and information security standards. • Perform code reviews and provide technical mentorship to junior and mid-level engineers. • Document architectures, processes, and technical decisions.

BigQuery Cloud Google Cloud Platform Pandas Python PyTorch Scikit-Learn SQL Tensorflow

View details: Senior AI Engineer

Brazil

Apply

Job Closed

Interview Engineer

Interview Pen

High-quality content, community, & tools to empower technologists looking to succeed in upscaling their careers.

Engineer95 days ago

Contract RemoteTeam 1-10H1B No Sponsor

Company Site LinkedIn

• Facilitate an interview through Karat's platform • Provide input on the candidate's performance, coding style, communication skills, knowledge question answers, and coding approach • Collaborate with Karat to test content, processes, and products

View details: Interview Engineer

India

Apply

Job Closed

Data Engineer – Senior/Specialist

Flow Traders

Leading global liquidity provider and market maker. Driving transparency and efficiency in global financial markets.

Engineer95 days ago

Full Time RemoteTeam 501-1,000Since 2004

Company Site LinkedIn

• Develop and maintain high-reliability, high-performance data pipelines; • Design cloud data architectures (data lake, streaming, batch processing); • Ensure data quality, consistency, reliability, and governance; • Automate data ingestion, transformation, and loading workflows (ETL/ELT); • Work with large volumes of structured and semi-structured data; • Implement monitoring, alerting, and observability for pipelines; • Collaborate with backend and product teams to make data efficiently available; • Apply best practices for data versioning and data testing; • Optimize cost and performance of data systems in cloud environments.

Airflow AWS Cloud DynamoDB ETL Google Cloud Platform MS SQL Server NoSQL PostgreSQL Python Spark SQL

View details: Data Engineer – Senior/Specialist

Brazil

Apply

Job Closed

4G/5G Optimization Engineer

remopt

conectar para transformar

Engineer95 days ago

Full Time RemoteTeam 201-500Since 1998

Company Site LinkedIn

• Lead end-to-end optimization of new Nokia wireless networks introduction in the UK, focusing on capacity and architecture design, network performance improvements, and cluster & feature acceptances • Conduct root cause analyses to identify performance issues and implement effective solutions post-launch • Collaborate with a dynamic team of experts, fostering an environment of continuous learning and knowledge sharing • Engage directly with customers to provide consultative insights on 4G and 5G technologies and best practices • Utilize cutting-edge tools and methodologies to tackle complex network performance challenges • Drive customer satisfaction through enhanced network performance and innovative service delivery

View details: 4G/5G Optimization Engineer

United Kingdom

Apply

LLM Inference Engineer

Job Description

Job Requirements

Related Guides

Related Categories

Related Job Pages

More Engineer Jobs

Senior AI Engineer

Interview Engineer

Data Engineer – Senior/Specialist

4G/5G Optimization Engineer