Meet the future of sustainable, self-driving delivery.
Lead Engineer, Reinforcement Learning – Scenario Generation
Location
California
Posted
175 days ago
Salary
$190K - $230K / year
Seniority
Senior
Job Description
Lead Engineer, Reinforcement Learning – Scenario Generation
Serve Robotics
• Develop RL algorithms that can help with terrain intelligence and social navigation behaviors. • Design, build, and optimize large-scale RL training pipelines (distributed compute, GPU clusters, containerized workflows). • Implement curriculum learning, domain randomization, and multi-agent RL strategies. • Optimize RL model performance, sample efficiency, and stability across thousands to millions of simulation steps. • Build automated tools for experiment orchestration, rollout collection, and metrics visualization. • Develop procedural generation pipelines for synthetic environments, agents, and dynamic behaviors. • Build tools to generate long-tail scenarios, sudden appearance of objects, traffic behaviors, rare events, and environmental variations. • Create systems for configuration, validation, and scoring of generated scenarios. • Collaborate with autonomy, ML, and safety teams to map real-world failures into repeatable synthetic simulation cases. • Design APIs to connect RL agents, scenario generators, planners, and environment simulators. • Debug and optimize simulation performance (real-time speed, determinism, reproducibility). • Work with 3D assets, traffic models, mapping systems (e.g., Isaac Sim, CARLA, Unity, Gazebo). • Partner with autonomy, data, and modeling teams to define training objectives and scenario requirements. • Translate real-world logs and edge cases into parameterized procedural content. • Document tools, frameworks, and workflows for internal users.
Job Requirements
- Master’s degree in Robotics, AI, Computer Science, Mathematics, or a related field.
- 7+ years of professional experience with shipping transformer based AI models handling complex navigation or manipulation tasks in AV or robotics solutions at scale in the real world.
- 3+ years technical leadership/architecture experience
- Strong experience with Reinforcement Learning (PPO, SAC, A3C, DQN, multi-agent RL, or equivalents).
- Hands-on experience with distributed training frameworks (Ray RLlib, Accelerate, PyTorch Distributed, Kubernetes, or similar).
- Proficiency in Python and C++ for performance-critical simulation or graphics pipelines.
- Experience building or modifying simulation environments (Isaac Sim, Unity, Unreal, CARLA, Gazebo, MuJoCo or custom engines).
- Experience with procedural generation (noise functions, rule-based systems, agent scripts, behavior trees).
- Experience with GPU compute, containers, and cloud infrastructure.
Benefits
- Offers Equity
Related Guides
Related Job Pages
More Full-stack Engineer Jobs
Principal Engineer – AI and Intelligence Systems
WISEcodePrecision nutrition data made simple, usable, human.
• Translate product ideas, domain concepts, and company vision into intelligent systems that reason over food, ingredients, nutrition, and user preferences. • Design and implement AI-powered features end-to-end, including model selection, orchestration, inference pipelines, confidence handling, and integration with product workflows. • Guide teams through decisions on model tradeoffs, data quality, evaluation strategies, observability, explainability, and long-term system evolution. • Influence without authority — shaping engineering culture, mentoring senior engineers, and raising the organizational bar for how intelligence is built and trusted. • Lead solution design for high-complexity, AI-driven initiatives and deliver production-ready implementations when needed. • Define patterns and standards for safe, observable, and explainable AI usage across teams. • Diagnose and resolve issues across data inputs, model behavior, system integration, and user-facing outcomes. • Mentor senior engineers and elevate engineering culture around intelligent system design. • Partner closely with product, data, CTO, and platform teams to shape roadmap feasibility and long-term intelligence strategy.
• Build deployment and integration tooling to streamline the installation and configuration of our C2 software across different customer environments and hardware configurations • Debug complex distributed systems issues in the field, analyzing logs, network traffic, and system performance to resolve software failures quickly • Develop automated test frameworks and scripts to validate software functionality, performance, and integration with third-party systems • Write code daily to solve customer-specific integration challenges, create monitoring solutions, and improve deployment reliability • Collaborate remotely and on-site with customers to understand their technical requirements and translate them into software solutions • Travel to test sites (up to 75%) to ensure software deployments succeed, troubleshoot issues that arise, and gather requirements for engineering improvements • Operate drones during testing (Part 107 license required) to validate software behavior, not as a primary pilot but as an engineer testing code
• Architect a frontend for a Command & Control (C2) system for heterogeneous aircraft swarms • Build distributed, collaborative real-time infrastructure for controlling large numbers of aircraft in real-time with intermittent communications • Develop and test new user interface concepts for swarming UAVs for representing data like vehicle telemetry, sensor data, scheduling, crew resources, data link management and more • Own key new full stack features from design to deployment • Ship great software early & often with excellent CICD, QA, DevSecOps
Senior Software Engineer, Platform IAM
CriblCribl, the Data Engine for IT and Security, empowers organizations to transform their data strategy.
• Help define and refine the software development practices that make our team effective • Write clean, maintainable, and testable code with an eye towards observability and resilience • Coach and empower the less senior engineers on your team to help them level up and own larger epics • Work with cross-functional team members and stakeholders to decide on the vision and scope of your product area • Work with Management, Product Engineering, and Operations to plan and forecast quarterly goals that include a mix of tech debt, bug fixes, and new features • Champion the entire software development lifecycle from requirements and design to testing, deployment, and production support/monitoring • Take on new adventures across the full-stack as we continue to push Cribl Cloud forward • Share on-call and take part in team that embraces a model of service ownership • This position will require stand-by, on-call, or off-hours duties


