Assured is a claims automation insurtech backed by leading Silicon Valley investors.
Staff Site Reliability Engineer
Location
United States
Posted
78 days ago
Salary
$180K - $210K / year
Seniority
Lead
Job Description
Staff Site Reliability Engineer
Assured
• Provision infrastructure and tooling required to deliver the Assured platform, from scaling existing products to launching novel infrastructure to support entirely new products and initiatives • Create automated tooling to configure and maintain our platform and services with an eye for consistency and repeatability • Build sustainable methods for monitoring, managing, and scaling our platforms and services • Work with various teams to improve observability in different product areas • Identify, recommend, and implement strategies for complying with security regulations and industry best-practices • Handle incident support and on call rotation • Lead, mentor, and learn from other engineers on our growing team
Job Requirements
- You can autonomously build and scale high-quality products and environments from early stages to maturity
- You are inclined to automate, but can discern when automation isn’t the best solution and present alternatives
- You take the initiative to identify and solve important problems, coordinating with others on cross-cutting technical issues
- You make others better through code reviews, thorough documentation, technical guidance, and mentoring
- You are persistent in the face of roadblocks, dispatching them efficiently, pulling in others as necessary
- You have experience designing, implementing, and maintaining highly available and scalable database solutions, ideally with PostgreSQL.
- You are self-motivated, requiring minimal direction/oversight
- You have a passion for implementing and supporting observability platforms, SLOs, monitoring and alerting
- You have experience working in a scaling environment with Terraform, AWS and Kubernetes
- A strong engineering background and history of designing systems
Benefits
- Competitive Compensation: Competitive salary and equity packages for all employees
- Healthcare Plan: Platinum medical, dental, and vision
- Free life insurance: Including long-term disability & short-term disability
- Unlimited PTO: Uncapped vacation days & paid holidays
- Family Leave: Maternity & paternity
- 401(k) Contribution: Assured contributes 3% of your income, even if you don't contribute
- WFH Benefits: Lunch on us 2x/week, monthly phone stipend & other home office perks
- Health FSAs & HSAs: Pre-tax accounts for out-of-pocket medical expenses
- Team events & Offsites: We're remote, but we regularly get together
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Principal Engineer – Release Engineering
FastlyFounded in 2001, Fastly is a privately-held internet company offering the Fastly Edge Cloud platform, a content delivery network that helps digital businesses s
• Design, build, and operate release tooling across building, packaging, signing, artifact management, and deploying software. • Drive initiatives that make our engineers happier and more productive by reducing lead time for changes. • Collaborate with development and SRE teams to develop policies, standards, guidelines, governance and related guidance for CI/D operations. • Support developers from with build automation, merge resolution, CI, test automation, deployment based on tools usage and policies, standards. • Troubleshoot issues along the CI/D pipeline. • Participate in on-call support rotation
• Garantizar la fiabilidad y disponibilidad de los servicios en producción aplicando prácticas SRE • Diseñar e implementar monitorización y observabilidad con Prometheus, Grafana y ELK • Gestionar arquitecturas de microservicios, aplicando patrones de resiliencia (circuit breaker, bulkheading, service discovery) • Desarrollar y mantener automatizaciones y servicios en Java (8/11) con buenas prácticas de testing • Administrar y optimizar contenedores y despliegues en Docker y Kubernetes • Realizar performance testing, análisis de capacidad y mejora continua del rendimiento • Participar en guardias rotativas y resolución de incidencias críticas, incluyendo post-mortems • Aplicar Chaos Engineering para validar la resiliencia del sistema • Implementar prácticas de AIOps para mejorar la detección y respuesta automatizada de incidentes • Colaborar dentro de un squad multidisciplinar, aportando visión técnica y coordinándose con desarrollo, QA y producto.
Site Reliability Engineer
DayforceDayforce is a global HCM platform offering a comprehensive array of services encompassing payroll, HR, benefits, workforce management, talent, and analytics. With the mission of "m
Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving efficiency, productivity and compliance for the global workforce. Our brand promise - Makes Work Life Better™- Reflects our commitment to employees, customers, partners and communities globally. Location: For this role, we are open to remote work and can hire anywhere in Great Britain About the Opportunity Site Reliability Engineers at Dayforce bridge the gap between software engineering and operations, enabling faster feature delivery while maintaining high standards of reliability, scalability, and availability. You’ll be joining Dayforce’s growing SRE team, where we apply established SRE principles and engagement models to support our mission-critical platforms. Our focus is on building reliable systems through automation, strong engineering practices, and proactive monitoring—reducing toil and preventing issues before they impact customers. In this role, you’ll help design and maintain internal tooling that monitors, alerts, and in some cases autonomously remediates our environments. You’ll collaborate closely with development teams, contribute to platform reliability initiatives, and play a key role in incident response and continuous improvement. For the first two months, this role will align with North American working hours to support onboarding and knowledge transfer. After this period, you’ll move to standard UK working hours. What you’ll get to do - Develop a strong understanding of Dayforce’s cloud infrastructure and application ecosystem. - Support onboarding of new services, ensuring they meet Production Readiness Review standards. - Drive reliability improvements, automate operational tasks, and reduce manual toil. - Participate in incident response, root cause analysis, and remediation activities. - Create and maintain runbooks and reusable operational components. - Contribute to our inner-source SRE repositories and shared tooling. - Build trusted relationships across engineering, product, and operations teams. - Take part in a shared PagerDuty on-call rotation. Skills and Experience we Value - A proactive, curious engineer with a strong desire to learn and improve systems. - 2–4 years’ experience in SRE, Systems Administration, Infrastructure, or Software Engineering roles. - Experience with at least one object-oriented programming language (C# or Java preferred). - Experience with at least one scripting language (Python or PowerShell preferred). - Experience working with relational databases and query languages (MSSQL/T-SQL or PostgreSQL/PL-SQL preferred). - Strong communication and collaboration skills. - Familiarity with containerisation, Kubernetes, or Terraform is a plus, but not required. What’s in it for you Dayforce is fueled by the diversity of our talented employees. We are an equal opportunity employer and consider and embrace ALL individuals and what makes them unique. We believe our employees should be happy and healthy, with peace of mind and a sense of fulfillment. We encourage individuals to apply based on their passions. Dayforce encourages personal and professional growth. We offer excellent time away from work programs, comprehensive wellness initiatives and recognition through competitive pay and benefits. With a commitment to community impact, including volunteer days and our charity, Dayforce Cares we provide opportunities for you to thrive both in your career and personal life. Our focus is not just on your job but on supporting you to be the best version of yourself. Fraudulent Recruiting Beware of fraudulent recruiting. Legitimate Dayforce contacts will use an @dayforce.com email address. We do not request money, checks, equipment orders, or sensitive personal data during the recruitment process. If you have been asked for any of the above, or believe you have been contacted by someone posing as a Dayforce employee, please refer to our fraudulent recruiting statement found here: https://www.dayforce.com/be-aware-of-recruiting-fraud Dayforce actively monitors all job applications to ensure authenticity. Submissions determined to be fraudulent or misleading will be declined from the recruitment process
• Designs, develops, tests, and implements DevOps and MLOps solutions supporting CI/CD pipelines, infrastructure automation, and AI/ML lifecycle management. • Leads platform and foundational initiatives spanning multiple applications and environments. • Defines and applies standards, patterns, and best practices to ensure stability, scalability, and security. • Identifies and resolves complex system and performance issues impacting delivery and operations. • Serve as a leader within the DevOps team, providing guidance on design decisions and implementation approaches. • Leads architecture and design discussions, communicating solutions clearly to peers and partners. • Contributes to the definition of future-state platform capabilities and shared technical roadmaps. • Mentors and supports less-experienced engineers through knowledge sharing and technical guidance. • Provides support for complex production issues and escalations within area of expertise. • Supports release activities and ensures operational readiness for new and existing platforms. • Develops and executes validation and testing strategies for platform changes. • Balancing operational support with delivery of new capabilities across multiple concurrent initiatives. • Collaborates with engineering, data science, infrastructure, and security teams to deliver integrated solutions. • Participates in and contributes to design reviews, planning sessions, and cross-functional meetings. • Communicates technical concepts effectively to technical and non-technical stakeholders. • Documents design, standards, and decisions in a clear and consistent manner. • All other duties as assigned.



