Pinpoint is the ATS that makes complex hiring simpler.
Product Reliability Engineer
Location
United Kingdom
Posted
2 days ago
Salary
£60K - £70K / year
Seniority
Senior
Job Description
Product Reliability Engineer
Pinpoint Applicant Tracking System
• Own the full lifecycle of issues: triage, diagnose, fix, and prevent high impact problems across the product. Roughly half your time is reactive (the escalation queue), half is proactive (stopping the next ticket before it's raised) • Build internal tooling that makes other teams self sufficient, especially our Technical Success team (part of R&D, there to resolve technical complexity on behalf of every customer facing team). The goal: they get what they need without waiting on engineering. Bulk operations, config changes, diagnostics, automation of anything manual and painful • Build a world-class feedback mechanism back to the product squads. You'll proactively and visibly feed what you're seeing (product pain, recurring issues, what users are actually reporting) back to roadmap teams and PMs, so it's reliably ingested and the same problems stop coming back • Make the application meaningfully faster and more performant: instrumentation, logging, monitoring, and hands on performance work. (This matters even more right now as we scale our infrastructure function) • Dent the backlog: fixing root causes, not symptoms, and removing repeat issues so net throughput keeps improving cycle over cycle • Reach for AI to make all of the above faster and better: embedding AI into the triage and feedback loop, automating toil, and unlocking the context buried in eight years of codebase and docs • Work a light weekday escalation rotation, and shipping to production independently within your first year • The scope of this team is growing fast: as the tooling gets easier to build, what we can take on keeps expanding. There's a lot of room here for someone who wants it
Job Requirements
- 3+ years building production web applications full stack, including hands on Ruby on Rails and React with TypeScript
- You've owned production issues end to end before (finding, fixing, and preventing recurrence), ideally in a developer support, reliability, internal tooling, or enablement capacity
- AI native, instinctively. You reach for Claude Code (or similar) not just to write code but to explore ideas, understand unfamiliar codebases, and embed AI into nearly everything you do
- High agency and extreme ownership: first principles thinking, extreme resourcefulness, a strong bias for action
- An exceptional communicator who defaults to keeping people in the loop: progress updates, what you've found, where you're stuck. People rarely have to chase you
- Hungry for growth: you don't need a defined progression ladder; you demonstrate it by solving hard problems, taking work off your manager's plate, and growing fast
- Motivated by making teams more effective, not just by shipping your own features
- Startup or scale up B2B SaaS background, ideally a similar stage and size
Benefits
- Comprehensive healthcare – Excellent medical, dental, & vision coverage for you and your family
- Unlimited holidays – Take the time you need to rest and recharge
- Mental health support – Unlimited, immediate access to professional counseling via Spill
- Retirement contributions – 401k or pension contributions depending on your location
- Remote-first – Work where you’re most productive, with flexibility and trust as the default
- Equity with real upside – Share in the long-term value you help create
- Fully paid parental leave – Up to 16 weeks of paid leave for new parents
- Learning budget – Annual funds for courses, books, or anything that supports your growth
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Software Engineer, Reliability Experience Team
AirbnbAirbnb is a community based on connection and belonging.
• Collaborate with the Reliability Experience, Incident Management, Observability, and Resiliency teams to design and develop high quality UX. • Be an active contributor to your projects by creating high quality, tested pull requests and reviewing other’s designs and code. • Build appropriate tests to ensure the reliability and performance of the software you create. • Create and present your own design, product, and architecture documents and provide feedback on others. • Stay up-to-date with the latest industry trends, technologies, and best practices in Web development and performance engineering, particularly in the Reliability and Observability space. • Do all this in an opinionated fashion, with humility and curiosity, as we are our own product managers.
• Collaborate with software architects and development teams to define infrastructure requirements and design comprehensive platform solutions. • Lead the design, implementation, and optimization of CI/CD pipelines to streamline software development, testing, and deployment processes. • Conduct PoCs to evaluate new tools, technologies, and methodologies, assessing their potential impact on the platform and operations. • Monitor and enhance the performance, reliability, and scalability of systems, ensuring high availability across production and development environments. • Troubleshoot and resolve complex issues across infrastructure, deployments, and applications, implementing robust solutions to improve system stability. • Integrate security best practices into the architecture and deployment processes, ensuring compliance with industry standards and regulations. • Mentor team members on advanced DevOps practices and contribute to establishing a culture of continuous improvement and operational excellence.
• Collaborate with software architects and development teams to define infrastructure requirements and design comprehensive platform solutions. • Lead the design, implementation, and optimization of CI/CD pipelines to streamline software development, testing, and deployment processes. • Conduct PoCs to evaluate new tools, technologies, and methodologies, assessing their potential impact on the platform and operations. • Monitor and enhance the performance, reliability, and scalability of systems, ensuring high availability across production and development environments. • Troubleshoot and resolve complex issues across infrastructure, deployments, and applications, implementing robust solutions to improve system stability. • Integrate security best practices into the architecture and deployment processes, ensuring compliance with industry standards and regulations. • Mentor team members on advanced DevOps practices and contribute to establishing a culture of continuous improvement and operational excellence.
Senior Software Engineer, Reliability Engineering
AirbnbAirbnb is a community based on connection and belonging.
• Design, implement and maintain the tools and systems that support service reliability, monitoring, and alerting. • Collaborate with other engineering teams to ensure services are designed with reliability in mind, and provide guidance on the appropriate use of tooling and automation. • Identify opportunities to improve the reliability, scalability, and efficiency of our services and drive their implementation. • Work with infrastructure engineers to understand the challenges they face in operating our services and develop tools and systems to help them manage these challenges. • Participate in incident response and post-mortems to identify and address systemic issues. • Continuously evaluate new technologies and industry best practices to improve our SRE tooling and incident response procedures. • Gain and maintain an intimate understanding of how the critical parts of the site work (services, infrastructure, product, tools, and processes) • Lead high-urgency incidents and mentor less-experienced engineers in effectively handling incidents.

