Thumbtack logo
Thumbtack

We help people care for their home from top to bottom — and empower small businesses nationwide to grow.

Senior Software Engineer, Site Reliability Engineering

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 1,001-5,000H1B SponsorCompany SiteLinkedIn

Location

Canada

Posted

3 days ago

Salary

C$180.2K - C$233.2K / year

Seniority

Senior

No structured requirement data.

Job Description

Senior Software Engineer, Site Reliability Engineering

Thumbtack

Role Description Thumbtack's Site Reliability Engineering team focuses on creating and maintaining a reliable, secure, and scalable platform vital for a seamless user experience. As a key contributor, you will design and support resilient systems, prioritizing high performance, availability, and throughput, with a focus on minimizing service disruptions, downtime, and latency. SRE impacts Thumbtack’s ecosystem across the entire stack, from Linux systems to applications that drive the customer experience. The Site Reliability team is responsible for a broad set of technologies and systems with expectations to collaborate across the business. We are expected to develop and enhance existing capabilities while ensuring scalability, reliability, and resiliency of infrastructure and software. You’ll work with engineering teams ranging from product development, developer experience, and backend infrastructure to collaboratively build Thumbtack’s ecosystem of platform services that have the right impact at the right time. Thumbtack values its cross-functional collaborative culture, and you’d be positioned to contribute to the future direction and success of the engineering platform that serves as the engine of our applications. What you’ll do - Design, create, and maintain software and systems to improve the availability, scalability, and efficiency of Thumbtack's services - Set the architectural direction of infrastructure and platform services while supporting the engineering organization - Design and implement tools and processes used for deployment, change, service, and infrastructure management - Troubleshoot and debug critical systems throughout the SDLC - Contribute to the evolution and performance of capabilities we provide to engineering as a platform organization - Capacity planning and demand forecasting, anticipating performance bottlenecks - Participate in rotating on-call duties Qualifications - Extensive fluency in AWS and Linux - Ability to effectively read, write, and debug code in programming languages like but not limited to: Python, Go, PHP, Javascript - Expertise in designing, analyzing, and troubleshooting large-scale distributed systems across web technologies like: DNS, TLS, HTTP/S, TCP/IP - Ability to decompose complex problems while understanding the tradeoffs necessary to deliver impact - 5 years of experience managing infrastructure and systems - Demonstrable knowledge of instrumenting, operating, and observing a distributed system of microservices in a production cloud environment - Ability to communicate clearly and effectively to cross-functional partners of various technical levels - Passion for reducing toil and improving developer experience Expected Salary Ranges - For candidates living in Ontario and British Columbia, the expected salary range for the role is currently $180,200.00 - $233,200.00 - Actual offered salaries will vary and will be based on various factors, such as calibrated job level, qualifications, skills, competencies, and proficiency for the role Company Description Thumbtack helps millions of people confidently care for their homes. Thumbtack is the one app you need to take care of and improve your home — from personalized guidance to AI tools and a best-in-class hiring experience. Every day in every county of the U.S., people turn to Thumbtack to complete urgent repairs, seasonal maintenance, and bigger improvements.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

RAPIDFORT logo

DevOpsSec Engineer

RAPIDFORT

Remove 95% of CVEs automatically with no code change

DevOps Engineer3 days ago
Full TimeRemoteTeam 51-200Since 2020H1B Sponsor

• Design and maintain Kubernetes-based infrastructure, including cluster provisioning, RBAC configuration, network policy, and workload management • Package and deploy applications using Helm charts; maintain chart repositories and manage release lifecycle across environments • Implement and enforce policy controls using Istio service mesh, OPA Gatekeeper, Kyverno, and related Kubernetes admission controllers • Build and maintain CI/CD pipelines using GitLab CI, GitHub Actions, Jenkins, or equivalent tooling; integrate automated security scanning and compliance gates • Deploy and operate workloads on AWS GovCloud and Azure Government; architect for high availability, disaster recovery, and cross-region compliance requirements • Manage and harden container images; integrate with Iron Bank, Platform One, and other DoW-approved registry sources • Configure and maintain observability stacks including Prometheus, Grafana, and Datadog; develop alerting, dashboards, and SLO frameworks • Participate in ATO processes, support STIG/CIS compliance scanning, and contribute to System Security Plans (SSPs) and documentation artifacts • Collaborate with development, security, and program teams to establish and refine DevSecOps practices across the software delivery lifecycle • Support air-gapped and classified environment deployments; design solutions for offline image transfer, registry mirroring, and artifact management • Coordinate with government platform teams and managed service providers to integrate and sustain vendor tooling within approved DoD software factories

United States
$110K - $140K / year
Job Closed
Absorb Software logo

Senior DevOps Engineer

Absorb Software

Founded in 2003 and headquartered in Calgary, Canada, Absorb Software provides cloud-based learning management systems to support organizational training and development. Serving o

DevOps Engineer3 days ago

• Own and operate the production environment, maintaining a holistic view of system health, availability, and performance • Design, build, and maintain infrastructure and platform systems using automation-first principles • Proactively create monitors and observability strategies (e.g., Sumo Logic, New Relic, Prometheus) to prevent issues before they occur • Apply AI tools (Claude, Cursor) to enhance infrastructure operations, including debugging, monitoring, and performance optimization at the system level • Measure, analyze, and optimize system performance to continuously improve uptime and user experience • Provide operational support for large, distributed systems, including participation in on-call and incident response • Diagnose and resolve complex production issues, and mentor others in debugging and troubleshooting approaches • Partner with engineering teams to improve reliability through testing, release processes, and system design • Contribute to and influence system architecture, ensuring scalability, resilience, and long-term maintainability • Participate in capacity planning, platform management, and infrastructure roadmap discussions • Drive automation and continuous improvement across infrastructure and deployment workflows • Balance speed and reliability through clear service level objectives (SLOs)

Canada
Headspace logo

Principal DevOps Engineer

Headspace

Headspace is on a mission to improve the health and happiness of the world through meditation and mindfulness. The company works by offering a range of online wellness resources to

DevOps Engineer3 days ago

• Define the long-term technical vision for our cloud platform, advising leadership on architectural strategy, investment priorities, and systemic risk • Set organizational standards for cloud reliability, observability, and operational excellence, including SLO/SLI frameworks, incident management practices, and the platform tooling that underpins them • Serve as the senior solutions engineering authority for partner teams: translating cross-functional requirements into platform strategy and driving prioritization of developer experience investments • Own the developer experience roadmap, identifying and closing systemic gaps in self-service infrastructure, CI/CD workflows, and operational visibility across engineering • Proactively surface systemic risks in our AWS infrastructure, IaC practices, and delivery pipelines, and drive organizational action before issues become incidents • Translate complex infrastructure risk and platform strategy into clear, business-aligned narratives for Director-level and executive stakeholders to drive resourcing and prioritization decisions • Mentor Staff (T4) and Senior (T3) engineers and model the culture of engineering rigor, documentation, and cross-functional accountability expected across the organization

United States
$162K - $225K / year
Mashgin logo

Lead Deployment Engineer

Mashgin

World's fastest AI powered Touchless self-checkout ecosystem. YC W15.

DevOps Engineer3 days ago
Full TimeRemoteTeam 11-50Since 2015H1B No Sponsor

• Mentor and develop team members at all levels, providing coaching, honest feedback, and the kind of direct conversations that help people grow professionally and technically. • Become the go-to expert on Mashgin's hardware, software, and deployment systems, a reliable resource for the team, customers, and internal partners alike. • Own and improve standard operating procedures, identifying what's working, what isn't, and what needs to be built from scratch. • Serve as a clear and reliable conduit between the team and cross-functional partners including Product, Engineering, and vendors, surfacing recurring issues and keeping the right people informed. • Travel up to 25% of the time to support new location launches and assist customers across the country.

Arizona + 4 moreAll locations: Arizona | California | Colorado | Illinois | Texas
$112K - $130K / year