Nagarro (Frankfurt: NA9) is a leader in digital product engineering and drives technology-led business breakthroughs.
Staff Engineer – DevOps, Observability Engineer
Location
United States
Posted
15 hours ago
Salary
0
Seniority
Lead
Job Description
Staff Engineer – DevOps, Observability Engineer
Nagarro
• Analyze and optimize existing New Relic dashboards, telemetry, and monitoring setup for the TRAIT application. • Review and refine PagerDuty alert triggers, escalation policies, and incident workflows to ensure only actionable events generate alerts. • Identify obsolete dashboards, alerts, and monitoring components and optimize them based on current operational requirements. • Support DevOps operational activities including monitoring production environments, incident response, root cause analysis, and reliability improvements. • Collaborate with development, infrastructure, and support teams to improve application observability and operational health. • Assist in automation and monitoring integration within CI/CD and cloud environments. • Recommend and implement observability best practices for logging, metrics, tracing, and alerting.
Job Requirements
- Strong hands-on experience with New Relic including APM, telemetry, dashboards, alerting, and observability optimization.
- Experience in configuring and managing PagerDuty alerts, on-call workflows, escalation policies, and incident management processes.
- Good experience in DevOps/SRE operations including production monitoring, troubleshooting, and operational support.
- Experience with cloud and DevOps tools such as Amazon Web Services / Microsoft Azure, CI/CD pipelines, Linux, scripting, or infrastructure monitoring.
Benefits
- Employees can work remotely
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer - AWS DevOps
ZensarAt Zensar, we’re “experience-led everything”. We are committed to conceptualizing, designing, engineering, marketing, and managing digital solutions and experiences for over 130 leading enterprises. We are a company driven by a bold purpose: Together, we shape experiences for better futures. Whether for our clients, our people, or the world around us, this belief powers everything we do. At the heart of our culture is ONE with Client - a set of four core values that reflect who we are and how we work: One Zensar, Nurturing, Empowering, and Client Focus. Part of the $4.8 billion RPG Group, we’re a community of 10,000+ innovators across 30+ global locations, including Milpitas, Seattle, Princeton, Cape Town, London, Zurich, Singapore, and Mexico City. We believe the best work happens when individuality is celebrated, growth is encouraged, and well-being is prioritized. We are an equal employment opportunity (EEO) and affirmative action employer, committed to creating an inclusive workplace. All qualified applicants will be considered without regard to race, creed, color, ancestry, religion, sex, national origin, citizenship, age, sexual orientation, gender identity, disability, marital status, family medical leave status, or protected veteran status.
Role Description As part of normal business duties, the candidate will be expected to share the management of the team’s ticket queues, platform support, monitoring and any future project assignments. The candidate will also be expected to participate in an on-call rota to support London and Bermuda working hours and out of hours for business-critical application services. Qualifications - 10+ years of experience working in DevOps, SRE, Platforms infrastructure roles. - Strong experience with Infrastructure automation using Infrastructure as Code (IaC) and configuration using Terraform, Azure Resource Manager (ARM templates), YAML, PowerShell and Bash. - Experience in building YAML CI/CD pipeline using Azure DevOps and use of GIT, including build of .NET applications, deployments to environments and automated workflows. - Experience working with core Azure compute, storage, networking resources and a wide knowledge of the available Azure platforms across IaaS, PaaS and SaaS, including Azure app services, virtual machines, Azure policies, Azure Entra. - Strong experience and support of server operating systems, including Windows Server and Unix. - Experience in scripting languages like Azure PowerShell, Azure CLI, Bash, Python and REST. - Beneficial to have - Microsoft Certified: Azure Administrator Associate / Microsoft Certified: Azure Developer Associate. Company Description At Zensar, we’re “experience-led everything.” We are committed to conceptualizing, designing, engineering, marketing, and managing digital solutions and experiences for over 130 leading enterprises. We are a company driven by a bold purpose: Together, we shape experiences for better futures. Whether for our clients, our people, or the world around us, this belief powers everything we do. - At the heart of our culture is ONE with Client - a set of four core values that reflect who we are and how we work: One Zensar, Nurturing, Empowering, and Client Focus. - Part of the $4.8 billion RPG Group, we’re a community of 10,000+ innovators across 30+ global locations, including Milpitas, Seattle, Princeton, Cape Town, London, Zurich, Singapore, and Mexico City. - We believe the best work happens when individuality is celebrated, growth is encouraged, and well-being is prioritized. - We are an equal employment opportunity (EEO) and affirmative action employer, committed to creating an inclusive workplace.
Mid-Level Dev Ops Engineer
Hunt StWe help Aussie companies find top 3% remote talent in the Philippines & Nepal for a single finder's fee.
Role Description We are seeking a Mid-Level DevOps Engineer with strong Microsoft 365 expertise to support and enhance our cloud and collaboration environments. This role will play a critical part in maintaining and optimising our M365 ecosystem, supporting AWS-based infrastructure, and leading key platform migration initiatives over the next 12 months. The ideal candidate is technically strong, self-motivated, and comfortable working in a collaborative environment while taking ownership of projects and operational improvements. Key Responsibilities - Microsoft 365 Administration & Support - Administer and manage Microsoft 365 environment including users, groups, enterprise applications, licensing, SSO, and SCIM provisioning - Maintain identity and access management processes - Provide ongoing operational support across the M365 ecosystem - Ensure security, compliance, and best practices are implemented - DevOps & Cloud Infrastructure - Support and optimise AWS-based infrastructure and services - Improve operational efficiency of an AWS-based AI platform - Maintain CI/CD and repository environments - Assist in system performance monitoring, reliability, and automation initiatives - Platform Migration & Consolidation Projects (Next 12 Months) - Migrate Bitwarden from self-hosted to SaaS - Lead migration from Mattermost to Microsoft Teams - Transition from Maxotel to Microsoft Teams Phone - Migrate GitLab repositories to Bitbucket - Perform configuration cleanup and optimisation of Nginx and WordPress environments - Collaboration & Documentation - Work closely with internal stakeholders and technical teams - Provide regular reporting and updates on projects and operational improvements - Maintain clear technical documentation and process records Qualifications - Approximately 5+ years of experience as a DevOps Engineer - Strong Microsoft 365 systems administration experience (users, groups, SSO, SCIM, enterprise applications, licensing, etc.) - Hands-on AWS experience - Strong troubleshooting and systems optimisation skills - Ability to work independently while being a collaborative team player - Self-motivated, proactive, and quick learner Preferred / Nice to Have - Experience with similar platforms to Maxotel and Bitwarden - Atlassian suite experience (JIRA, JSM, Confluence, Bitbucket) - Experience with VoIP and MS Teams Phone environments - Experience supporting Nginx and WordPress environments - Exposure to SaaS migration and cloud consolidation projects - Have read “The Phoenix Project” by Gene Kim Work Arrangement & Expectations This is a remote role that will be set up as an independent contractor engagement. To ensure alignment and transparency, successful candidates will be expected to: - Disclose any existing ongoing roles or client work - Reflect this engagement on their LinkedIn profile (clearly marked as “Independent Contractor”)
Senior Site Reliability Engineer
Tempo SoftwareAdaptive SPM for AI-Accelerated Innovation | Modular Solutions, Compounding Value | 30,000+ Customers
• Design, implement, and maintain our infrastructure using best practices • Create and support CI/CD pipelines • Deploy enterprise-scale projects on AWS • Work with latest technologies like Kubernetes • Automate key processes, including build, release, and monitoring (alerting and observability), for both infrastructure and products • Design and execute technical solutions that improve speed and quality • Monitor system performance and troubleshoot issues • Participate in the on-call rotation to support our applications • Collaborate with team members and other staff • Ensure security and compliance requirements are met
• Design and implement tools and technologies to provision and configure an enterprise software system hosted in Kubernetes • Design and implement architecture and networking solutions within Azure • Automate/script existing processes using languages such as python and bash • Provision, configure, and maintain cloud resources using Terraform and Terragrunt • Actively monitor the application environment and respond to incidents • Conduct root cause analysis for production incidents • Oversee cloud infrastructure and recommend any potential improvements to technology or process • Contribute to team planning to solve engineering challenges • Be a dependable and highly skilled development resource for peers through education and review • Have a broad awareness of related projects and industry trends, and encourage innovative practices among peers • Provide detailed feedback and suggestions to team code reviews • Document and demonstrate solutions through written documentation, diagrams, and readable code



