Avaya logo
Avaya

Avaya is an Equal Opportunity employer and a U.S. Federal Contractor. Our commitment to equality is a core value of Avaya. All qualified applicants and employees receive equal treatment without consideration for race, religion, sex, age, sexual orientation, gender identity, national origin, disability, status as a protected veteran or any other protected characteristic. In general, positions at Avaya require the ability to communicate and use office technology effectively. Physical requirements may vary by assigned work location. This job brief/description is subject to change. Nothing in this job description restricts Avaya's right to alter the duties and responsibilities of this position at any time for any reason.

Site Reliability Engineer, Azure – DevSecOps – IaC – Governance – Observability

DevOps EngineerDevOps EngineerOtherRemoteSeniorTeam 5,001-10,000H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

81 days ago

Salary

$129K - $143K / year

Seniority

Senior

Job Description

Site Reliability Engineer, Azure – DevSecOps – IaC – Governance – Observability

Avaya

• Serve as a key member of the 24×7 on-call rotation, responding to and managing incidents across production and pre-production environments. • Lead incident bridges, coordinate root cause analysis (RCA), and ensure post-incident reviews drive systemic improvements. • Maintain clear communication with cross-functional teams and leadership during major incidents. • Build, tune, and maintain observability dashboards (Azure Monitor, GCP Operations Suite, Prometheus, Grafana, Datadog, Log Analytics). • Perform deep-dive troubleshooting of application and service-level issues using distributed tracing and log analysis (Grafana, Datadog) to pinpoint root causes beyond infrastructure. • Define SLOs, SLIs, and error budgets to proactively identify and mitigate reliability risks before customer impact. • Integrate AI-Ops tools for anomaly detection, predictive alerting, and automated incident correlation. • Continuously enhance alert quality, reduce false positives, and automate runbooks for faster recovery.

Job Requirements

  • 5+ years in Site Reliability, DevOps, Cloud Operations, or Customer support roles.
  • Demonstrated experience in application-level troubleshooting by analyzing logs and traces to identify bugs, performance bottlenecks, and error conditions.
  • Expertise in Azure and GCP cloud operations and distributed system reliability.
  • Understanding of Terraform, Ansible, and CI/CD pipelines (Jenkins, GitHub Actions).
  • Experience with observability and AI-Ops tools (Azure Monitor, GCP Operations Suite, Grafana, Prometheus, Datadog, etc.).
  • Solid grasp of incident management frameworks (P1–P3 handling, RCA, PIRs, on-call rotations).
  • Excellent analytical, troubleshooting, and communication skills.

Benefits

  • performance-related bonus
  • benefits

Related Categories

Related Job Pages

More DevOps Engineer Jobs

F5 logo

Engineer II

F5

We secure every app.

DevOps Engineer81 days ago
OtherRemoteTeam 5,001-10,000H1B Sponsor

At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. F5, Inc. seeks Engineer II in Seattle, WA: Job Duties: Design, implement, and maintain CI/CD pipelines for F5 products. Collaborate with cross-functional teams, including product development, QA, and operations, to ensure efficient and reliable software delivery. Troubleshoot and resolve issues related to CI/CD pipelines and infrastructure. Monitor and analyze CI/CD metrics and KPIs to identify areas for improvement. Stay up-to-date with the latest DevOps tools and technologies and recommend improvements to the CI/CD process. Automate manual tasks and processes to improve efficiency and reduce errors. Conduct code reviews and provide feedback to developers on best practices for CI/CD. Provide on-call support for CI/CD issues and incidents. Document and maintain CI/CD processes and procedures. Participate in continuous improvement initiatives to enhance the overall DevOps process. Full-time telecommuting is an option. Minimum Requirements: Master’s degree (or its foreign degree equivalent) in Computer Science, Engineering (any field) or a related quantitative discipline and two (2) years of experience in the field of software engineering, or two (2) years of experience in the job offered. Special Skill Requirements: (1) Test automation frameworks; (2) JavaScript/Typescript; (3) Python; (4) NoSQL; (5) Azure DevOps; (6) CI/CD; (7) mature version control practices; (8) Azure Cloud; (9) Oracle, SL Server, and Snowflake; and (10) Selenium or Junit. Any suitable combination of education, training and experience is acceptable. Full-time telecommuting is an option. Salary: $149,240 - $156,000 per annum. Benefits: F5 offers competitive pay, 401k, and other benefits: https://www.f5.com/company/careers/benefits. Submit a resume with references using the apply button on this posting or by email at: PeopleOps@f5.com at Req.# L24-156607 #LI-DNI You may also be offered incentive compensation, bonus, restricted stock units, and benefits. More details about F5’s benefits can be found at the following link: https://www.f5.com/company/careers/benefits. F5 reserves the right to change or terminate any benefit plan without notice. Please note that F5 only contacts candidates through F5 email address (ending with @f5.com) or auto email notification from Workday (ending with f5.com or @myworkday.com). Equal Employment Opportunity It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race, religion, color, national origin, sex, sexual orientation, gender identity or expression, age, sensory, physical, or mental disability, marital status, veteran or military status, genetic information, or any other classification protected by applicable local, state, or federal laws. This policy applies to all aspects of employment, including, but not limited to, hiring, job assignment, compensation, promotion, benefits, training, discipline, and termination. F5 offers a variety of reasonable accommodations for candidates. Requesting an accommodation is completely voluntary. F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job. Request by contacting accommodations@f5.com.

United States
$149K - $156K / year
Job Closed
Luxury Presence logo

Staff DevOps Engineer – Developer Infrastructure

Luxury Presence

Do it all with Luxury Presence. Build your brand, expand your network, & close more deals.

DevOps Engineer81 days ago
OtherRemoteTeam 201-500Since 2016H1B Sponsor

• Design and operate ephemeral, pre-warmed development environments that agents and engineers can spin up on demand. • Own the infrastructure-level gates that prove a deploy is safe before it reaches production. • Build containerized mock services (generated from OpenAPI specs) so contributors can validate integration code against realistic third-party dependencies locally. • Build the review tooling, metrics dashboards, and operational controls that make our pipelines observable and improvable.

United States
$200K - $230K / year
Avaya logo

Site Reliability Engineer (SRE) - Azure | DevSecOps | IaC | Governance | Observability

Avaya

Avaya is an Equal Opportunity employer and a U.S. Federal Contractor. Our commitment to equality is a core value of Avaya. All qualified applicants and employees receive equal treatment without consideration for race, religion, sex, age, sexual orientation, gender identity, national origin, disability, status as a protected veteran or any other protected characteristic. In general, positions at Avaya require the ability to communicate and use office technology effectively. Physical requirements may vary by assigned work location. This job brief/description is subject to change. Nothing in this job description restricts Avaya's right to alter the duties and responsibilities of this position at any time for any reason.

DevOps Engineer81 days ago
OtherRemoteTeam 5,001-10,000H1B Sponsor

About Avaya Avaya is an enterprise software leader that helps the world’s largest organizations and government agencies forge unbreakable connections. The Avaya Infinity™ platform unifies fragmented customer experiences, connecting the channels, insights, technologies, and workflows that together create enduring customer and employee relationships. We believe success is built through strong connections – with each other, with our work, and with our mission. At Avaya, you'll find a community that values your contributions and supports your growth every step of the way. Learn more at https://www.avaya.com Description We are seeking a Site Reliability Engineer (SRE) who will drive stability, reliability, and performance across our Azure and GCP-based platforms. This role blends operational excellence, proactive incident management, and strong collaboration with DevOps, Cloud, and Security teams. The ideal candidate will have hands-on experience with multi-cloud environments (Azure and GCP), IaC (Terraform/Ansible), CI/CD (Jenkins/GitHub Actions), and modern observability and AI-Ops systems. The engineer will also contribute to governance, cost optimization, and automation strategies that reduce toil and prevent issues before they occur. A key aspect of this role is the ability to perform deep-dive troubleshooting of application performance and errors by analyzing logs and traces in platforms like Grafana and Datadog. This position includes 24×7 support coverage (rotational) and requires strong ownership in managing major incidents, RCA processes, and continuous service improvements. Key Responsibilities Reliability & Incident Management - Serve as a key member of the 24×7 on-call rotation, responding to and managing incidents across production and pre-production environments. - Lead incident bridges, coordinate root cause analysis (RCA), and ensure post-incident reviews drive systemic improvements. - Maintain clear communication with cross-functional teams and leadership during major incidents. Monitoring, AI-Ops, Alerts & Prevention - Build, tune, and maintain observability dashboards (Azure Monitor, GCP Operations Suite, Prometheus, Grafana, Datadog, Log Analytics). - Perform deep-dive troubleshooting of application and service-level issues using distributed tracing and log analysis (Grafana, Datadog) to pinpoint root causes beyond infrastructure. - Define SLOs, SLIs, and error budgets to proactively identify and mitigate reliability risks before customer impact. - Integrate AI-Ops tools for anomaly detection, predictive alerting, and automated incident correlation. - Continuously enhance alert quality, reduce false positives, and automate runbooks for faster recovery. - Analyze trends to prevent recurring issues and support teams in resilience engineering. Requirements Required Skills & Experience - 5+ years in Site Reliability, DevOps, Cloud Operations, or Customer support roles. - Demonstrated experience in application-level troubleshooting by analyzing logs and traces to identify bugs, performance bottlenecks, and error conditions. - Expertise in Azure and GCP cloud operations and distributed system reliability. - Understanding of Terraform, Ansible, and CI/CD pipelines (Jenkins, GitHub Actions). - Experience with observability and AI-Ops tools (Azure Monitor, GCP Operations Suite, Grafana, Prometheus, Datadog, etc.). - Solid grasp of incident management frameworks (P1–P3 handling, RCA, PIRs, on-call rotations). - Excellent analytical, troubleshooting, and communication skills. Desired Behaviours - Proactive Prevention: Identifies and resolves risks before they escalate into incidents. - AI-Driven Mindset: Applies AI and automation to improve reliability and reduce human intervention. - Accountability: Owns service reliability and communicates with clarity. - Collaboration: Works seamlessly with platform, DevOps, and product teams. - Efficiency: Focuses on automation to reduce manual effort and improve MTTR. - Continuous Improvement: Learns from failures, iterates processes, and enhances documentation. The pay range for this opportunity is from $129,00 to $143,000 + performance-related bonus + benefits. This range represents the anticipated low and high end of the salary for this position. This role is also eligible to receive an annual bonus that aligns with individual and company performance. Actual salaries will vary and are based on factors such as a candidate’s qualifications, skills, competencies. Footer Applicants must be currently authorized to work in the United States without the need for visa sponsorship now or in the future. Avaya is an Equal Opportunity employer and a U.S. Federal Contractor. Our commitment to equality is a core value of Avaya. All qualified applicants and employees receive equal treatment without consideration for race, religion, sex, age, sexual orientation, gender identity, national origin, disability, status as a protected veteran or any other protected characteristic. In general, positions at Avaya require the ability to communicate and use office technology effectively. Physical requirements may vary by assigned work location. This job brief/description is subject to change. Nothing in this job description restricts Avaya right to alter the duties and responsibilities of this position at any time for any reason.

United States
$129K - $143K / year
InComm Payments logo

DevOps Engineer II

InComm Payments

Quando você pensar na InComm Payments, pense em tecnologia inovadora de pagamentos. Fomos fundados há mais de 30 anos e continuamos a ser pioneiros na indústria de pagamentos (FinTech). Desde a nossa criação estamos em continuo crescimento e somos uma equipe de mais de 3.000 funcionários em mais de 34 países ao redor do mundo. Possuímos mais de 400 patentes técnicas globais e uma rede que inclui mais de 525.000 pontos de distribuição no varejo que apontam para nossa experiência no setor. A InComm Payments está altamente focada em nosso pessoal e em seu crescimento, e trabalhamos duro para tornar a sua carreira significativa e gratificante. Valorizamos a inovação, a qualidade, a paixão, a integridade e a responsabilidade em tudo o que fazemos e procuramos pessoas excelentes para se juntarem à nossa equipa à medida que avançamos em direção a um futuro muito brilhante. Antecipamos o desenvolvimento de futuros líderes para nossas equipes no Brasil!

DevOps Engineer81 days ago
OtherRemoteTeam 1,001-5,000

Overview When you think of InComm Payments, think of Innovative Payments Technology. We were founded over 30 years ago and continue to be a pioneer in the payment (FinTech) industry. Since our inception, we have grown to be a team of over 3,000 employees in 35 countries around the world. We own over 400 global technical patents and a network that includes over 525,000 points of retail distribution that points to our industry expertise. We are significantly growing our Engineering and IT teams in Brazil and are focused on finding talent for various financial technology (Fintech) engineering, database, development, and testing teams. InComm Payments is highly focused on our people and their growth, and we work hard to make a career at InComm Payments meaningful and rewarding. We value innovation, quality, passion, integrity and responsibility in all that we do, and we are looking for great people to join our team as we move forward towards a very bright future. We anticipate developing future leaders for our teams in Brazil! Benefits include health and dental insurance, meal and restaurant vouchers, fixed monthly stipend for internet and mobile expenses, InComm hardware/software, and annual bonuses! All positions are CLT. You can learn more about InComm Payments by visiting our Website or connecting with us on LinkedIn, YouTube, Twitter, Facebook, or Instagram. About This Opportunity As a DevOps Engineer II, you will be responsible for implementing and improving CI/CD tooling to drive total automation across environments. This team will be supporting the enterprise as a shared service for development teams to utilize in automating build and deployment pipelines. You will collaborate with development teams to enhance services through rigorous testing and release procedures and create sustainable systems and services through automation. Responsibilities - Implement and improve CI/CD tooling for automated releases. - Collaborate with development teams to enhance services through testing and release procedures. - Create sustainable systems and services through automation. - Lead initiatives to monitor DevOps build and deploy pipelines. - Triage user requests and manage work queue with appropriate prioritization. - Emphasizing automation, recommend solutions and implement processes, procedures and best practice guidelines for code, build, test and deployments. - Participate in after-hours on-call rotation. Qualifications - Bachelor’s degree in computer science or a related technical discipline. - Proficient in Groovy, YAML, Bash, PowerShell and Python programming. - Proficient in CI/CD tools, such as Jenkins, GitHub Actions, and Azure DevOps. - Experience with Configuration Management and tools like SonarQube, Artifactory, GitHub, Selenium, and Postman. - Extensive experience in Linux and Windows system administration. - Proficient in container orchestration using Kubernetes, with deep expertise in Docker-based containerization for microservices architecture and cloud-native deployments. - Experience with orchestration tools like Ansible and Terraform. - Hands-on experience deploying and managing scalable applications in Microsoft Azure. - Experience in the complete lifecycle of projects, including version control, build management, unit testing, and issue tracking software. - Understanding of software development best practices. - Proactive in identifying problems, areas for improvement, and performance bottlenecks. InComm provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity or national origin, citizenship, veteran’s status, age, disability status, genetics or any other category protected by federal, state, or local law. *This position is eligible for the Employee Referral Bonus Program-Tier 3 #LI-WS1 #LI-Remote

United States + 1 moreAll locations: United States | Canada