Daxko logo
Daxko

Daxko is dedicated to pursuing and hiring a diverse workforce. We are committed to diversity in the broadest sense, including thought and perspective, age, ability, nationality, ethnicity, orientation, and gender. The skills, perspectives, ideas, and experiences of all of our team members contribute to the vitality and success of our purpose and values. We truly care for our team members, and this is reflected through our offices, and benefits, and great perks. These perks are only for our full-time team members.

SRE Manager

DevOps EngineerDevOps EngineerFull TimeRemoteLeadTeam 501-1,000Since 1998

Location

United States

Posted

50 days ago

Salary

0

Seniority

Lead

No structured requirement data.

Job Description

SRE Manager

Daxko

We’re looking for a Manager of Site Reliability Engineering (SRE) who is passionate about building resilient systems and leading teams that keep critical services running smoothly. In this role, you’ll guide a team responsible for the reliability, performance, and operational health of our production environments. You’ll partner closely with engineering leaders to ensure our systems remain secure, scalable, and available for the organizations and communities who depend on them. What You’ll Do As the Manager of Site Reliability Engineering, you will lead a team responsible for the operational reliability of Daxko’s production platforms. Your work will focus on creating stable, high-performing systems while empowering your team to continuously improve how we operate and support our products. You will also: - Lead and support a team responsible for the reliability and performance of production systems, which includes: - Setting clear performance expectations and goals for team members - Providing ongoing coaching and real-time feedback - Ensuring team members have the training and resources they need to succeed - Coordinating on-call rotations and operational coverage - Supporting the team during critical incidents and outages - Managing team staffing, including hiring and headcount planning - Prioritize and coordinate work across operational initiatives, deployments, upgrades, and infrastructure improvements - Ensure high levels of system uptime, data integrity, and operational stability - Partner with Engineering Leads to align platform operations with product development needs - Maintain business continuity across all production assets - Monitor system health, performance, and capacity to proactively identify and resolve issues - Serve as a technical escalation point for complex infrastructure or platform challenges - Provide regular reporting on system availability, response times, and capacity trends - Ensure operations meet security, compliance, and regulatory requirements - Support and coordinate the team’s on-call rotation and incident response processes - Continuously improve operational practices through automation, tooling, and monitoring Technologies You’ll Work With Our platform relies on modern infrastructure and cloud technologies. Strong experience with several of the following areas is important: - Linux-based systems - Web server technologies (NGINX, PHP, Traefik, F5) - Virtualization platforms such as VMware - Cloud platforms including AWS and Azure - Containerization and orchestration (Docker, Kubernetes, Dynos) - Messaging and caching technologies (Redis, RabbitMQ) - A strong security mindset and experience implementing infrastructure security controls are essential. What You Bring You’re a thoughtful technical leader who enjoys solving complex operational challenges and helping engineers grow. We’re looking for someone who brings: - Strong analytical and problem-solving skills - Clear communication and collaboration skills - Experience leading teams in fast-moving technical environments - The ability to balance multiple priorities and make thoughtful decisions under pressure - Strong organizational and time management skills - A customer-focused mindset and commitment to system reliability - Bachelor’s degree in a technical discipline or equivalent professional experience - 3–5 years of experience leading or managing globally distributed engineering teams - 3–5 years of experience in a Site Reliability Engineering or similar infrastructure-focused role Preferred Experience - Experience serving as a technical lead on infrastructure or platform teams - Experience with modern observability and monitoring tools, such as OpenTelemetry, Instana, LogicMonitor, PagerDuty, or OpsGenie - Experience with infrastructure and automation tooling such as GitLab CI, Jenkins, Chef, Terraform, Elasticsearch, Kubernetes, or Rancher - Scripting experience in Ruby, Python, or Bash - Familiarity with SOC, PCI, or GDPR compliance standards - Experience working with issue tracking and collaboration tools such as the Atlassian suite - Experience supporting or developing applications built with Java, PHP, or Node - Experience automating operational processes and repetitive tasks Daxko is dedicated to pursuing and hiring a diverse workforce. We are committed to diversity in the broadest sense, including thought and perspective, age, ability, nationality, ethnicity, orientation, and gender. The skills, perspectives, ideas, and experiences of all of our team members contribute to the vitality and success of our purpose and values. We truly care for our team members, and this is reflected through our offices, and benefits, and great perks. These perks are only for our full-time team members. Some of our favorites include: 🏝 Flexible paid time off ⚕️ Affordable health, dental, and vision insurance options 💪 Monthly fitness reimbursement 🤑 401(k) matching 🍼 New-Parent Paid Leave 👖 Casual work environments 🏡 Flexible work - remote & hybrid All your information will be kept confidential according to EEO guidelines. #LI-Remote

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Postscript logo

Senior DevOps Engineer

Postscript

SMS marketing platform for ecommerce companies. Helping Shopify stores drive 30x ROI with text message marketing.

DevOps Engineer50 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

• Design, implement, and maintain infrastructure solutions on AWS, utilizing tools such as Terraform for Infrastructure as Code (IaC) and, ideally, Terraform Cloud. • Deploy and manage containerized applications using ECS, with EKS experience being a bonus. • Work closely with engineering teams to understand project requirements, ensuring seamless implementation of pre-designed infrastructure and deployment patterns. • Set up and manage CI/CD pipelines to automate and streamline the software delivery process, with a preference for GitHub Actions or similar tools. • Continuously monitor the health of the infrastructure, identifying and resolving issues to optimize performance, scalability, and security. • Leverage Python for scripting, automation, and other development tasks to enhance infrastructure and deployment processes. • Create and maintain comprehensive documentation of all infrastructure components, processes, and deployment workflows to facilitate knowledge sharing and continuity.

United States
$161K - $189K / year
DevOps Engineer50 days ago
ContractRemoteTeam 11-50Since 2021H1B Sponsor

• Build scalable API services. • Integrate Clerk Authentication for identity, RBAC, token validation, and authorization. • Implement core backend for human-led AI workflows. • Support multi-tenant architecture, secure session management, and API hardening. • Deploy and manage Azure infrastructure (Functions, App Services, AKS, Key Vault). • Build and maintain CI/CD pipelines (GitHub Actions or Azure DevOps). • Implement Infrastructure-as-Code using Terraform or Bicep. • Configure monitoring, logging, observability, and uptime automation. • Optimize cloud cost, performance, and reliability.

United States
DevOps Engineer50 days ago
ContractRemoteTeam 11-50Since 2021H1B Sponsor

• Design, deploy, and manage Azure infrastructure (App Services, Functions, AKS, Key Vault). • Implement Infrastructure-as-Code using Terraform.. • Ensure high availability, fault tolerance, and secure system boundaries. • Optimize cloud cost, performance, and reliability. • Build and maintain CI/CD pipelines using Azure DevOps or GitHub Actions. • Automate testing, deployments, and environment promotion. • Improve developer experience and release velocity across teams. • Support backend services (FastAPI, Python) with deployment, scaling, and runtime optimization. • Assist with API observability, logging, and performance tuning. • Collaborate closely with backend engineers to unblock delivery and reduce operational overhead. • Implement secure networking, secrets management, and identity controls. • Configure monitoring, alerting, and incident response workflows. • Enforce best practices around containerization (Docker) and orchestration (AKS).

United States
Inmetrics logo

Junior SRE Analyst

Inmetrics

We make a difference, solve outstanding problems and make the digital transformation of our clients possible.

DevOps Engineer50 days ago
Full TimeRemoteTeam 501-1,000Since 2002H1B No Sponsor

• Implementation and maintenance of strategies for system reliability and availability • Ensure the efficiency and effectiveness of the company's technology operations • Perform diagnostics and troubleshooting in production environments • Implement process automation and system monitoring • Contribute to the continuous improvement of IT service quality

Brazil
Job Closed