Runware logo
Runware

Generative media in the blink of an API.

Staff Software Engineer - Inference & Performance

Full-stack EngineerSoftware EngineerFull TimeRemoteLeadTeam 11-50Since 2023H1B No SponsorCompany SiteLinkedIn

Location

United Kingdom

Posted

11 days ago

Salary

0

Seniority

Lead

Job Description

Staff Software Engineer - Inference & Performance

Runware

Role Description We’re looking for a Staff Engineer to take technical ownership of latency, throughput, and reliability across Runware’s AI inference platform. This is a senior technical leadership role for someone who obsesses over performance at scale, from request ingress through GPU execution to result delivery, and who can consistently turn ambitious targets such as sub-one-second inference into production reality. As a Staff Engineer, you will define and drive the architecture, standards, and execution needed to make Runware one of the fastest and most reliable inference platforms in the market. You will work deeply across backend services, distributed systems, GPU workloads, and infrastructure, partnering closely with product, ML, and platform teams. This role is ideal for someone who enjoys operating at the intersection of systems design, performance engineering, and real-world scale, and who wants clear ownership over outcomes that matter directly to customers. - Own end-to-end inference performance across the platform, with clear responsibility for latency, throughput, and reliability targets - Lead the architecture and design of core inference systems, including request routing, async execution, queuing, GPU scheduling, and result delivery - Drive the platform toward sub-1 second inference where feasible, identifying bottlenecks across networking, services, storage, and GPU execution - Make high-impact architectural decisions with performance, scalability, and operational simplicity as first-class concerns - Partner with ML and model teams to ensure models are production-ready from a performance perspective (cold starts, batching, memory usage, concurrency) - Define performance budgets, SLAs, and success metrics, and ensure they are measured, visible, and actively improved - Lead deep-dive investigations into latency spikes, throughput degradation, and system-level performance issues - Influence and mentor engineers across teams on performance engineering, distributed systems thinking, and operational excellence - Improve tooling, observability, and profiling capabilities to make performance issues easier to detect and reason about - Advocate for pragmatic engineering best practices around testing, benchmarking, rollouts, and documentation Qualifications - Excellent experience in software engineering, with a strong focus on backend and systems development (PHP, Python, Go, Rust, or similar) - Proven experience building and operating high-performance, low-latency distributed systems in production - Deep understanding of asynchronous processing, queues, concurrency models, and back pressure - Strong intuition for performance trade-offs across CPU, GPU, networking, storage, and application layers - Experience making and defending critical architectural decisions in complex systems - Hands-on experience troubleshooting real production issues under load (latency, saturation, cascading failures) - Familiarity with modern cloud infrastructure, CI/CD, and observability stacks (metrics, tracing, profiling) - Ability to communicate clearly and influence across teams in a remote-first environment - Strong mentorship mindset and a desire to raise the technical bar across the organisation Requirements - Experience working on AI/ML inference platforms, GPU-backed workloads, or performance-critical compute systems - Knowledge of model optimisation techniques (batching, quantisation, warm-starts, memory management) - Experience with infrastructure-as-code and DevOps practices - Background in startups or fast-paced environments where speed, ownership, and pragmatism matter - Prior ownership of latency or throughput SLOs at scale Benefits - Generous paid time off – vacation, sick days, public holidays - Meaningful stock options – share in the upside you create - Remote-first setup – work from home anywhere we can employ you - Flexible hours – own your schedule outside core collaboration blocks - Family leave – paid maternity, paternity, and caregiver time - Company retreats – twice-yearly gatherings in inspiring locations

Related Job Pages

More Full-stack Engineer Jobs

Nagarro logo

Associate Staff Engineer, Salesforce Health Cloud

Nagarro

Nagarro (Frankfurt: NA9) is a leader in digital product engineering and drives technology-led business breakthroughs.

Full TimeRemoteTeam 10,001+Since 1996H1B Sponsor

Role Description We're looking for a key player in designing and developing Salesforce Health Cloud solutions, focusing on: - Care plans - Patient profiles - Assessments - Care teams - Health timelines Responsibilities include: - Writing and reviewing great quality code. - Understanding functional requirements thoroughly and analyzing the client’s needs in the context of the project. - Acting as a trusted advisor by addressing functional and technical queries, recommending optimal solution approaches, and mentoring junior developers. - Supporting the planning and design of new solutions by aligning business requirements with Salesforce Health Cloud architecture and best practices. - Envisioning the overall solution for defined functional and non-functional requirements, and defining technologies, patterns, and frameworks to realize it. - Determining and implementing design methodologies and tool sets. - Enabling application development by coordinating requirements, schedules, and activities. - Leading/supporting UAT and production rollouts. - Creating, understanding, and validating WBS and estimated effort for given module/task, and justifying it. - Addressing issues promptly and responding positively to setbacks and challenges with a mindset of continuous improvement. - Giving constructive feedback to team members and setting clear expectations. - Delivering scalable and high-quality solutions that adhere to performance, security, and compliance standards, including HIPAA requirements where applicable. - Helping the team troubleshoot and resolve complex bugs. - Coming up with solutions to any issue raised during code/design review and justifying the decisions taken. Qualifications - Bachelor’s or master’s degree in computer science, Information Technology, or a related field. Requirements - Total Experience: 5+ years - Strong hands-on experience with Apex, Lightning Web Components (LWC), Flows, and Salesforce configuration best practices. - Updated on Salesforce Health Cloud capabilities, Salesforce releases, and healthcare industry standards such as FHIR, HL7, and EHR integrations. - Ability to collaborate closely with business stakeholders to translate healthcare requirements into scalable Salesforce technical designs. - Experience in communicating effectively with users, other technical teams, and management to collect requirements, describe software product features, and technical designs. - Passionate about building great solutions. - Mentoring team members to meet client needs and holding them accountable for high standards of delivery. - Ability to understand and relate technology integration scenarios and apply these learnings in complex troubleshooting scenarios.

Southern Asia
Job Closed
Zero Hash logo

Senior Engineer, Trading

Zero Hash

Financial infrastructure for the future

Full TimeRemoteTeam 51-200H1B Sponsor

• Support the company's' vital business by contributing to the design and development of software in an event-drive microservices environment • Develop microservices in Golang • Work with platform engineers to setup new services • Respond to production issues and alerts • When necessary, communicate directly with client technical teams

New York
Tang+Company logo

Software Engineer

Tang+Company

Comprehensive occupational health and safety services

Full TimeRemoteTeam 501-1,000Since 1977H1B No Sponsor

• Design, develop, test, and maintain scalable software applications and web-based systems. • Analyze business and technical requirements to create effective software solutions. • Collaborate with stakeholders, product teams, and fellow engineers to define project scope and technical specifications. • Participate in all phases of the software development lifecycle, including planning, development, testing, deployment, and ongoing support. • Create and maintain technical documentation, system diagrams, process flows, and code comments. • Conduct system analysis and recommend improvements to enhance performance, scalability, and efficiency. • Troubleshoot and resolve software defects, production issues, and system performance concerns. • Utilize system monitoring tools and automated testing frameworks to ensure application reliability and quality. • Perform code reviews and contribute to engineering best practices, coding standards, and continuous improvement initiatives. • Mentor and support other developers through knowledge sharing, coaching, and technical collaboration. • Stay current with emerging technologies, development tools, and industry best practices.

California
Full TimeRemoteTeam 51-200Since 2007H1B No Sponsor

• Serve as the primary engineer on Ask Aya, leading iterations, feature development, and integrations • Build and maintain a rigorous evaluation harness • Contribute as a senior engineer on the broader Membership Center platform • Collaborate with product managers, designers, data, organizers, and policy staff

United States
$130K - $140K / year