Job Closed
This listing is no longer active.
Circle helps businesses and developers harness the power of stablecoins for payments and internet commerce worldwide.
Senior/Staff Site Reliability Engineer
Location
Arizona + 2 moreAll locations: Arizona | California | Utah
Posted
131 days ago
Salary
$152.5K - $205K / year
Seniority
Senior
Job Description
Senior/Staff Site Reliability Engineer
Circle
• Build and maintain production infrastructure estate • Empower agile development teams with a high-performance CI/CD pipeline • Design, maintain, and secure cloud infrastructure using Infrastructure-as-Code tools • Automate operational tasks using Go, Python, and serverless solutions • Manage and monitor Kubernetes clusters for multiple production workloads • Ensure system reliability and security by participating in on-call rotations • Plan, test, and implement disaster recovery strategies • Leverage AI-powered solutions for managing infrastructure and optimizing performance • Mentor and support team growth
Job Requirements
- 4+ years in DevOps or SRE roles
- 3+ years in CI/CD platform development and microservices support
- Strong observability, problem-solving, and performance optimization skills in complex, distributed systems
- Hands-on experience with Blue-Green, Canary, and A/B Testing deployment strategies for services and databases
- Understanding of multi-region and multi-cloud architectures
- Proficiency in Go, Python, and Shell
- Proficiency in AI tools utilization to support daily activities
- Excellent communication skills
- Experience with Kubernetes clusters at scale, containerization, and Helm charts
- Familiarity with database technologies (PostgreSQL, Redis, OpenSearch)
Benefits
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Integration Platform Reliability Engineer
Dealer TireWe’re more than tires and parts. We’re a team on a mission to revolutionize the automobile dealer channel.
• Develop and maintain platform documentation, training materials and systems documentation. • Work with infrastructure engineers to maintain the stability and reliability of the integration platforms. • Monitor the performance and availability of the integration platforms, recommend, and implement remediations to issues. • Follow Dealer Tire and affiliate policies regarding change control and incident management. • Keep integration platform toolsets patched and up to date according to the appropriate policies. • Assist Integration Specialists with platform inter-operability issues. • Work with infrastructure teams to support the infrastructure roadmap. • Work with Enterprise Information Security to maintain the appropriate security posture based upon policy and contract requirements. • Assist in designing and implementing the Integration roadmap. • Recommend, design, develop and implement small and medium scale automations. • Help in the development of junior team members. • Project Management Work closely with functional groups to assure project tasks are completed accurately, timely and with quality. • Document project tasks for scope boundaries, high-level requirements, and other aspects as needed. • Work with functional staff to develop training materials for Information Technology systems. • Business Requirements Document high-level and detailed business requirements related to required platform capabilities. • Communicate business requirements to the Information Technology staff. • Collaborate on Information Technology projects with various functional areas and staff to clarify business requirements, answer questions and resolve problems throughout projects and on-going support. • Collaborate with team members on the development and on-going use of audits and controls to monitor the daily transmission and collection of EDI/Integration files. • Implementation Management/Change Management Assist users and Integration Specialists in the review of business requirements. • Coordinate work with various functional staff to develop user acceptance test objectives, test plans and test scripts to verify and ensure that business requirements are achieved. • Perform system tests and user acceptance tests of Information Technology system changes as agreed with various functional staff in test plans. • Maintain technical analysis documents, technical designs, test scripts and test results to verify Integration Platform requirements are understood and addressed completely, correctly, and consistently. • Execute unit and integrated system test reviews as appropriate with Information Technology staff. • Apply established mechanisms for tracking business requirements throughout the life of Information Technology projects. • Use mechanisms to verify all requirements have been addressed completely, correctly, and consistently. • Work with various functional and IT staff to apply established change management procedures for Information Technology projects to control scope. • Facilitate documentation and approval of requirements changes of either a business or technical nature, during Information Technology projects, using change management procedures. • Perform production implementations, some of which may require after hour activities to complete. • Other Responsibilities Timely and accurate time and status reporting. • Adherence to security policies. • Learn and leverage knowledge to develop mappings on tools utilized by the team.
Senior Site Reliability Engineer
CoderPadCoderPad is the leading technical interview platform for all engineering and software development teams.
• Design, operate, and evolve production infrastructure across AWS, GCP, Heroku, and Kubernetes. • Own and improve monitoring, alerting, and SLOs for customer-facing services. • Lead and participate in incident response, postmortems, and long-term remediation. • Build and maintain infrastructure-as-code, CI/CD pipelines, and automation (Terraform, GitLab CI, Kubernetes tooling). • Drive scalability, performance, and resilience across a real-time SaaS platform. • Ensure security, patching, and operational hygiene across all environments. • Partner with product and engineering teams to enable safe, fast, and reliable releases. • Actively contribute to cost visibility and cloud optimization.
• Architect & Scale Infrastructure: Design and implement multi-cluster, multi-region Kubernetes deployments using EKS, GKE, and AKS. Build infrastructure that scales across regions and cloud providers. • Own Production Systems: Take end-to-end ownership of production infrastructure. Drive incident response, postmortems, and improvements to prevent recurrence. • Infrastructure as Code at Scale: Build and maintain Terraform modules for complex infrastructure patterns. Manage thousands of configuration files across clusters, regions, and environments using GitOps principles. • GitOps & Deployment Excellence: Design and optimize ArgoCD ApplicationSets and Helm chart architectures. Build deployment pipelines that enable safe, automated releases across hundreds of microservices. • Performance & Reliability Engineering: Analyze system performance, identify bottlenecks, and implement optimizations. Improve SLOs through capacity planning, autoscaling, and architectural improvements. • Observability & Monitoring: Build and enhance monitoring, alerting, and observability using Prometheus, Grafana, Loki, and custom tooling. Drive visibility into complex distributed systems. • Security & Compliance: Implement security controls, compliance frameworks, and best practices across cloud infrastructure. Design secure multi-tenant architectures. • Technical Leadership: Mentor engineers, establish best practices, and drive technical decisions. Collaborate with platform, SRE, and product teams to deliver reliable infrastructure.
Senior Deployment Engineer
KaratKarat is the world leader in technical interviewing and pioneer of the Interviewing Cloud.
• Serve as the principal technical advisor to enterprise clients, establishing yourself as the authoritative voice on Karat's solutions and building high-level trust relationships. • Partner with Software Engineers globally to thoroughly analyze their hiring processes and performance requirements; ensure precise solution alignment and Karat product delivery as the lead technical expert in Customer Operations and GTM. • Work strategically with the Company's GTM team throughout the entire customer lifecycle. • Presenting Karat's technical solution to prospects as the subject matter expert. • Architecting and implementing the initial Karat interview framework for each new enterprise client. • Conducting regular strategic reviews with customers to ensure alignment with business objectives and optimal performance. • Designing and delivering executive-level training sessions for client stakeholders. • Analyze complex performance data and calibrate assessment metrics in collaboration with Karat’s Content and Data teams; translate findings into actionable strategic recommendations that strengthen client partnerships. • Drive continuous improvement of the Customer Operations and GTM teams' internal processes by identifying innovative opportunities to deliver additional value to our enterprise clients.




