Job Closed
This listing is no longer active.
CoderPad is the leading technical interview platform for all engineering and software development teams.
Senior Site Reliability Engineer
Location
United States
Posted
126 days ago
Salary
$170K - $180K / year
Seniority
Senior
Job Description
Senior Site Reliability Engineer
CoderPad
• Design, operate, and evolve production infrastructure across AWS, GCP, Heroku, and Kubernetes. • Own and improve monitoring, alerting, and SLOs for customer-facing services. • Lead and participate in incident response, postmortems, and long-term remediation. • Build and maintain infrastructure-as-code, CI/CD pipelines, and automation (Terraform, GitLab CI, Kubernetes tooling). • Drive scalability, performance, and resilience across a real-time SaaS platform. • Ensure security, patching, and operational hygiene across all environments. • Partner with product and engineering teams to enable safe, fast, and reliable releases. • Actively contribute to cost visibility and cloud optimization.
Job Requirements
- 5+ years of experience in SRE, DevOps, Platform Engineering, or Cloud Infrastructure roles.
- Strong experience with AWS and GCP, including networking, IAM, compute, and managed services.
- Hands-on experience running Kubernetes in production.
- Strong knowledge of Terraform or equivalent infrastructure-as-code tooling.
- Experience with CI/CD systems (e.g., GitLab CI or similar).
- Solid understanding of observability (metrics, logs, traces) using tools like Datadog, Prometheus, Grafana, or similar.
- Proficiency in at least one programming or scripting language (Go, Python, Node.js, or similar).
- Strong Linux and Bash skills.
- Experience operating high-availability, customer-facing SaaS platforms.
Benefits
- Meaningful work with high impact for a well-loved product
- Competitive, market-rate salaries
- Stock options with a 4-year vesting schedule
- Medical, dental, and vision insurance (90% covered for employees and dependents)
- Flexible Spending Account (FSA)
- 401K with profit sharing
- Unlimited paid time off with an expectation of taking 3 weeks annually in addition to 20 company holidays
- Remote-friendly environment with monthly WFH stipend
- Parental leave (primary: 16 weeks; secondary: 12 weeks)
- Short- and long-term disability and life insurance coverage
- Choice of laptop computer
- Internal mobility and growth opportunities
- And more…
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Staff Cloud Operations Engineer
Extreme NetworksExtreme Networks is a San Jose, California-based networking solutions company that is driven by software and is able to help deliver stronger and more quality connections with empl
• Architect & Scale Infrastructure: Design and implement multi-cluster, multi-region Kubernetes deployments using EKS, GKE, and AKS. Build infrastructure that scales across regions and cloud providers. • Own Production Systems: Take end-to-end ownership of production infrastructure. Drive incident response, postmortems, and improvements to prevent recurrence. • Infrastructure as Code at Scale: Build and maintain Terraform modules for complex infrastructure patterns. Manage thousands of configuration files across clusters, regions, and environments using GitOps principles. • GitOps & Deployment Excellence: Design and optimize ArgoCD ApplicationSets and Helm chart architectures. Build deployment pipelines that enable safe, automated releases across hundreds of microservices. • Performance & Reliability Engineering: Analyze system performance, identify bottlenecks, and implement optimizations. Improve SLOs through capacity planning, autoscaling, and architectural improvements. • Observability & Monitoring: Build and enhance monitoring, alerting, and observability using Prometheus, Grafana, Loki, and custom tooling. Drive visibility into complex distributed systems. • Security & Compliance: Implement security controls, compliance frameworks, and best practices across cloud infrastructure. Design secure multi-tenant architectures. • Technical Leadership: Mentor engineers, establish best practices, and drive technical decisions. Collaborate with platform, SRE, and product teams to deliver reliable infrastructure.
Senior Deployment Engineer
KaratKarat is the world leader in technical interviewing and pioneer of the Interviewing Cloud.
• Serve as the principal technical advisor to enterprise clients, establishing yourself as the authoritative voice on Karat's solutions and building high-level trust relationships. • Partner with Software Engineers globally to thoroughly analyze their hiring processes and performance requirements; ensure precise solution alignment and Karat product delivery as the lead technical expert in Customer Operations and GTM. • Work strategically with the Company's GTM team throughout the entire customer lifecycle. • Presenting Karat's technical solution to prospects as the subject matter expert. • Architecting and implementing the initial Karat interview framework for each new enterprise client. • Conducting regular strategic reviews with customers to ensure alignment with business objectives and optimal performance. • Designing and delivering executive-level training sessions for client stakeholders. • Analyze complex performance data and calibrate assessment metrics in collaboration with Karat’s Content and Data teams; translate findings into actionable strategic recommendations that strengthen client partnerships. • Drive continuous improvement of the Customer Operations and GTM teams' internal processes by identifying innovative opportunities to deliver additional value to our enterprise clients.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We’re hiring a hands-on senior DevOps engineer to take full ownership of our infrastructure, security posture, and deployment pipelines. This is not a “keep the lights on” role. We need someone who: - fixes what’s broken - simplifies what’s overcomplicated - hardens security - builds systems we can trust You’ll work closely with engineering leadership and application developers, and you’ll be expected to operate independently with a high bar for reliability. Qualifications - 5+ years of hands-on DevOps / infrastructure experience - Deep experience with AWS - Strong experience with Terraform or similar IaC tools - CI/CD experience (GitHub Actions, GitLab CI, or similar) - Experience running production systems with real traffic - Strong understanding of cloud security best practices - Comfortable being the sole DevOps owner Requirements - Experience with ECS or EKS - Multi-account AWS setups - SOC 2 / ISO / PCI exposure - Production migrations (shared hosting → cloud) - Experience cleaning up inherited, messy infrastructure Benefits - You must overlap meaningfully with US working hours - You communicate clearly and proactively - You take ownership - problems don’t linger - You prefer simple, robust solutions over clever ones - You are comfortable saying “this is risky” and backing it up This Role Is Not a Fit If: - You are part of an agency or want to work part-time - You need constant direction or ticket-by-ticket management - You avoid security responsibility - You’re unavailable during US hours - You prefer theoretical architecture over practical reliability
DevOps Engineer
DatasiteSince 1968, Datasite has been a leader in providing Software-as-a-Service (SaaS) solutions for the merger and acquisition (M&A) industry. A privately-held computer software company
• Work closely with the DevOps team with the mission to operate applications that not only provide users with stability and 24/7 availability, but also make our customer's life easier through agility and control. • You will handle generic processes such as packaging, deployments and rollbacks as well as configuration and key management so that the team can ship even more efficiently (new) features. • Due to the fast growth of the company you will evaluate the pertinence and suitability of current and past solutions since our challenges evolve rapidly.



