Genesys logo
Genesys

Orchestrating billions of remarkable experiences in more than 100 countries – through cloud, digital and AI technology.

Senior Operations Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 5,001-10,000Since 1990H1B SponsorCompany SiteLinkedIn

Location

United Kingdom

Posted

3 days ago

Salary

0

Seniority

Senior

No structured requirement data.

Job Description

Senior Operations Reliability Engineer

Genesys

Role Description As a Senior Operations Reliability Engineer specializing in Enterprise Platforms and Tools, you will own the operational reliability, health, and lifecycle management of enterprise productivity and collaboration platforms. This role combines hands-on platform administration with day-to-day operational ownership and governance of enterprise SaaS tools such as Jira, Confluence, Figma, Lucid, and other SaaS related platforms. In addition to serving as a senior escalation point, you will: - Improve monitoring accuracy - Reduce alert noise - Validate automation workflows - Contribute to AIOps tuning and observability standards - Help transition enterprise tool operations from reactive issues handling toward proactive, automation-driven reliability practices Responsibilities - General Reliability Operations - Monitor observability and AIOps platforms to detect anomalies, performance degradation, and emerging issues across enterprise systems. - Perform advanced incident triage and event correlation to identify root cause and reduce duplicate or misrouted incidents. - Lead or contribute to post-incident reviews, identifying systemic fixes and automation opportunities. - Validate automated remediation workflows prior to production adoption. - Identify recurring manual tasks and translate them into automation requirements or scripted improvements. - Improve alert signal quality by refining thresholds, suppression logic, and event correlation rules. - Ensure platform telemetry, SaaS health signals, and configuration data align with monitoring and CMDB standards. - Collaborate with Cloud, IAM, Network, Security, and ServiceNow teams to improve enterprise service reliability. - Enterprise Tools Ownership & Operational Management - Own day-to-day operational health and administration of enterprise SaaS platforms (e.g., Jira, Confluence, Figma, Lucid, monitoring tools, and similar productivity platforms). - Monitor vendor service health dashboards and integrate SaaS outage signals into internal observability and AIOps workflows. - Lead user-impact communications during enterprise tool outages or service degradations in partnership with IT Communications and ServiceNow teams. - Review vendor release notes and roadmap updates; assess feature changes, security updates, and deprecations. - Plan and coordinate controlled feature rollouts, configuration updates, and tenant-level optimizations. - Provide guidance and education to end users on new features, configuration changes, and best practices. - Manage licensing, usage monitoring, and cost optimization for enterprise tools. - Partner with Security and IAM teams to ensure access governance and compliance standards are maintained. - Improve monitoring coverage for enterprise tools by integrating telemetry and health signals into AIOps platforms. - Document operational standards, support models, and escalation paths for each owned platform. - Enterprise Platform Responsibilities - Diagnose and remediate integration issues between enterprise platforms and supporting systems. - Validate patching and upgrade activities to ensure minimal service disruption. - Participate in resilience validation exercises, including failover and recovery testing. - Provide mentorship and knowledge-sharing to junior reliability engineers. - Support operational reliability of Microsoft Power Platform components (Power Apps, Power Automate, Power BI), including: - Monitoring flow failures - Troubleshooting environment-level issues - Supporting connector configuration - Assisting with environment governance and data loss prevention policies - Automation & AIOps Contributions - Develop and maintain automation scripts (PowerShell, Python) to reduce repetitive operational effort. - Contribute to ServiceNow and Power Automate workflow improvements tied to enterprise tool incidents. - Partner with teams to refine automated remediation logic. - Improve enterprise tool signal quality by integrating vendor health data and usage telemetry into AIOps systems. - Support tuning of alert correlation and anomaly detection models for enterprise services. - Track improvements in MTTR, alert noise reduction, automation coverage, and platform uptime. Qualifications - Bachelor’s degree in Computer Science, Information Technology, or related field; equivalent experience considered. - 5+ years of experience in enterprise platform operations, SaaS administration, or infrastructure support roles. - Hands-on experience administering enterprise tools such as Jira, Confluence, Figma, Lucid, or similar SaaS platforms. - Experience with SQL Server and IIS/Apache administration is an asset. - Experience managing SaaS service health, vendor communications, and feature rollouts. - Proficiency in PowerShell or equivalent scripting for automation tasks. - Solid understanding of monitoring, observability, and event management practices. - Familiarity with ITIL principles and ServiceNow workflows. - Strong troubleshooting and analytical skills. - Effective communication skills, including experience communicating user-facing outages or changes. - Motivation to deepen expertise in automation, AIOps, and reliability engineering. Preferred Qualifications - Experience integrating SaaS platforms with identity providers (Okta, Entra ID). - Familiarity with CI/CD pipelines or automation-driven configuration management. - Exposure to cloud platforms (AWS or Azure). Requirements - Participation in a shared, rotational on-call schedule is required. Company Description Genesys empowers organizations of all sizes to improve loyalty and business outcomes by creating the best experiences for their customers and employees. Through Genesys Cloud, the AI-powered Experience Orchestration platform, organizations can accelerate growth by delivering empathetic, personalized experiences at scale to drive customer loyalty, workforce engagement, efficiency and operational improvements. We employ more than 6,000 people across the globe who embrace empathy and cultivate collaboration to succeed. And, while we offer great benefits and perks like larger tech companies, our employees have the independence to make a larger impact on the company and take ownership of their work.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

EnCharge AI logo

LLM Inference Deployment Engineer

EnCharge AI

Where the future of AI compute is being defined and built, to unlock new levels of machine intelligence.

DevOps Engineer3 days ago
Full TimeRemoteTeam 11-50Since 2022H1B Sponsor

• Deploy and optimize LLMs (GPT, LLaMA, Mistral, Falcon, etc.) post-training from libraries like HuggingFace • Utilize inference runtimes such as ONNX Runtime, vLLM for efficient execution. • Optimize batching, caching, and tensor parallelism to improve LLM scalability in real-time applications. • Develop and maintain high-performance inference pipelines using Docker, Kubernetes, and other inference servers.

United States
$180K - $240K / year
Full TimeRemoteTeam 501-1,000H1B Sponsor

• Collaborate with and support our creative, tight-knit development team • Design, deploy, and operate Loadsmart's critical systems while balancing reliability, cost, and agility • Play a key role in driving reliability projects with engineering teams • Utilize your intuitive problem-solving skills and contagious positive attitude to tackle challenging and exciting issues, inspiring those around you • Collect metrics and understand their business impact, encouraging the team to do the same • Perform troubleshooting and root-cause analysis of system operation issues • Be accountable for the platform's Service Level Agreements and Objectives • Provide infrastructure support during off-hours as needed • Take ownership of software infrastructure projects • Seek, give, and receive constructive feedback through code and specification reviews.

Brazil
Truelogic Software logo

Senior DevOps Engineer – Financial

Truelogic Software

Premium boutique software development company that helps brands with big ideas to make a difference in people’s lives.

DevOps Engineer3 days ago
Full TimeRemoteTeam 501-1,000Since 2004H1B No Sponsor

• Design, build, and maintain cloud environments within Microsoft Azure using best practices for scalability, reliability, and cost efficiency. • Implement and manage IaC using Terraform to automate resource provisioning and environment configuration. • Deploy and manage Azure resources including Azure Kubernetes Service (AKS), App Services, Function Apps, Azure API Management, Virtual Networks, Load Balancers, Storage Accounts, and Azure SQL Server / Azure SQL Databases. • Build and maintain CI/CD pipelines using Azure DevOps for application deployments, infrastructure, and automated testing. • Collaborate with development teams using C#, .NET, Visual Studio, and Angular to optimize the build, test, and release processes. • Integrate pipelines with container registries, package management, and automated approval workflows. • Develop container strategies using Docker and manage workloads running on Kubernetes (AKS). • Implement best practices for scaling, monitoring, logging, and securing Kubernetes clusters. • Monitor system performance, troubleshoot issues, and implement improvements to enhance reliability and uptime. • Manage Linux-based infrastructure, ensuring proper configuration, patching, and hardening. • Work with SQL Server databases to support deployments, migrations, and performance optimization. • Implement secure configurations for Azure resources and Kubernetes clusters while enforcing governance policies, identity and access controls, and environment standards. • Ensure compliance with organizational and industry security requirements.

Dominican Republic
Truelogic Software logo

Senior DevOps Engineer – Financial

Truelogic Software

Premium boutique software development company that helps brands with big ideas to make a difference in people’s lives.

DevOps Engineer3 days ago
Full TimeRemoteTeam 501-1,000Since 2004H1B No Sponsor

• Design, build, and maintain cloud environments within Microsoft Azure using best practices for scalability, reliability, and cost efficiency. • Implement and manage IaC using Terraform to automate resource provisioning and environment configuration. • Deploy and manage Azure resources including Azure Kubernetes Service (AKS), App Services, Function Apps, Azure API Management, Virtual Networks, Load Balancers, Storage Accounts, and Azure SQL Server / Azure SQL Databases. • Build and maintain CI/CD pipelines using Azure DevOps for application deployments, infrastructure, and automated testing. • Collaborate with development teams using C#, .NET, Visual Studio, and Angular to optimize the build, test, and release processes. • Integrate pipelines with container registries, package management, and automated approval workflows. • Develop container strategies using Docker and manage workloads running on Kubernetes (AKS). • Implement best practices for scaling, monitoring, logging, and securing Kubernetes clusters. • Monitor system performance, troubleshoot issues, and implement improvements to enhance reliability and uptime. • Manage Linux-based infrastructure, ensuring proper configuration, patching, and hardening. • Work with SQL Server databases to support deployments, migrations, and performance optimization. • Implement secure configurations for Azure resources and Kubernetes clusters while enforcing governance policies, identity and access controls, and environment standards. • Ensure compliance with organizational and industry security requirements.

Colombia