Job Closed
This listing is no longer active.
The leading provider of enterprise open source solutions.
Senior Customer Reliability Engineer – OpenShift Managed Cloud Services
Location
Ireland
Posted
67 days ago
Salary
0
Seniority
Senior
Job Description
Senior Customer Reliability Engineer – OpenShift Managed Cloud Services
Red Hat
• Manage large-scale, distributed systems, focusing on minimizing downtime and improving system resilience. • Maintain customer trust and confidence by ensuring stability and functionality of services. • Drive continuous enhancement of processes, tools, and methodologies to support the evolving needs of the service. • Lead the development of code and automation scripts to optimize the scalability, reliability, and performance of services. • Lead and participate in high-priority customer escalations, adopting a customer-first mindset. • Coordinate and execute complex incident response procedures, ensuring timely resolution and thorough postmortems. • Collaborate with cross-functional teams to enhance system robustness. • Demonstrate a proactive mindset to help preempt escalations and ensure reliable operations. • Document resolutions, root causes, and best practices to enrich the knowledge base and promote self-service solutions. • Mentor and coach team members, fostering a culture of continuous learning, knowledge sharing and collaboration. • Participate in on-call rotation and provide leadership during critical incidents. • Collaborate on strategic AI and automation projects designed to increase the efficiency of fleet operations and troubleshooting, ultimately delivering a better product experience for customers.
Job Requirements
- Advanced Experience with Openshift/Kubernetes container platform support or administration.
- Proficient with container-based technologies on Linux.
- Proficient in managing Linux-based systems in a public cloud such as AWS, Azure, or GCP.
- Advanced experience with enterprise systems monitoring; knowledge of Prometheus is preferred.
- Advanced with enterprise configuration management such as Ansible, Terraform.
- Software engineering experience using object-oriented languages; golang is preferred.
- Superior communications skills and experience working directly with and presenting to customers.
- Ability to quickly learn new technologies and follow industry trends.
- Demonstrated ability to quickly and accurately troubleshoot systems issues.
- Solid understanding of standard TCP/IP networking and common protocols.
Benefits
- Health insurance
- Paid time off
- Flexible work arrangements
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineer
BMOAt BMO we are driven by a shared Purpose: Boldly Grow the Good in business and life. It calls on us to create lasting, positive change for our customers, our communities and our people. By working together, innovating and pushing boundaries, we transform lives and businesses, and power economic growth around the world. As a member of the BMO team you are valued, respected and heard, and you have more ways to grow and make an impact. We strive to help you make an impact from day one – for yourself and our customers. We’ll support you with the tools and resources you need to reach new milestones, as you help our customers reach theirs. From in-depth training and coaching, to manager support and network-building opportunities, we’ll help you gain valuable experience, and broaden your skillset. To find out more visit us at BMO Careers .
Application Deadline: 04/02/2026 Address: VIRTUAL(R)59 - REMOTE/TELETRAVAIL - ON - BMO Job Family Group: Technology Designs how code is deployed, configured, and monitored, as well as the availability, latency, change management, emergency response, and management capacity of services in production. Helps teams to determine what new features can be incorporated and when by using service-level agreements (SLAs) to define the required reliability of the system through service-level indicators (SLI) and service-level objectives (SLO). Applies software engineering to automate IT operations tasks - e.g. production system management, change management, incident response, and emergency response. Acts as a link between the development and operations teams. Applies expertise to conduct chaos tests and performance test for critical business requirements. - Deploys, configures, and monitors code as well as the availability, latency, change management, emergency response, and management capacity of services in production. - Helps the development and operations teams establish Service level indicators (SLIs), Service level objectives (SLOs) and Error budgets. - Performs automation to increase efficiency and decrease risk like log analysis, performance tuning, patch application, testing of production settings, incident response, and post-mortem analysis. - Supports in system design consulting, platform management, and capacity planning. - Debugs production issues across services and levels of the technology stack. - Improves service health visibility by recording metrics, logs, and traces across all services in order to pinpoint the reasons of an incident. - Computes the cost of SLA breaches and assists management in calculating the impact of system reliability. Helps development and operations teams understand the cost of downtime. - Focus is primarily on business/group within BMO; may have broader, enterprise-wide focus. - Exercises judgment to identify, diagnose, and solve problems within given rules. - Works independently on a range of complex tasks, which may include unique situations. - Broader work or accountabilities may be assigned as needed. - Take measured risks while protecting the bank by applying our Risk Management Framework in the execution of your role, in line with our Risk Culture and within our approved Risk Appetite, making sound and risk informed decisions that align to business strategy, protect assets, and adhere to applicable policy documents (Frameworks, Policies, Standards, Procedures and Supporting documents), laws and regulations. Qualifications: Foundational level of proficiency: - DevOps. - Cybersecurity and privacy concepts, principles and solutions. - Emotional agility. - IT infrastructure library. - Robot Process Automation. - Cloud Computing. - Configuration Management. - Container Orchestration. - System Design and Implementation. - Incident management. - Learning Agility. - Building and managing relationships. Intermediate level of proficiency: - API Management. - Automation and Automation Pipelines. - Automated Testing. - Quality Assurance and Control. - Verbal & written communication skills. - Collaboration & team skills. - Analytical and problem solving skills. - Data driven decision making. - Typically between 4 - 6 years of relevant experience and post-secondary degree in related field of study or an equivalent combination of education and experience. - Technical proficiency gained through education and/or business experience. Salary: $61,600.00 - $113,900.00 Pay Type: Salaried The above represents BMO Financial Group’s pay range and type. Salaries will vary based on factors such as location, skills, experience, education, and qualifications for the role, and may include a commission structure. Salaries for part-time roles will be pro-rated based on number of hours regularly worked. For commission roles, the salary listed above represents BMO Financial Group’s expected target for the first year in this position. BMO Financial Group’s total compensation package will vary based on the pay type of the position and may include performance-based incentives, discretionary bonuses, as well as other perks and rewards. BMO also offers health insurance, tuition reimbursement, accident and life insurance, and retirement savings plans. To view more details of our benefits, please visit: https://jobs.bmo.com/global/en/Total-Rewards About Us At BMO we are driven by a shared Purpose: Boldly Grow the Good in business and life. It calls on us to create lasting, positive change for our customers, our communities and our people. By working together, innovating and pushing boundaries, we transform lives and businesses, and power economic growth around the world. As a member of the BMO team you are valued, respected and heard, and you have more ways to grow and make an impact. We strive to help you make an impact from day one – for yourself and our customers. We’ll support you with the tools and resources you need to reach new milestones, as you help our customers reach theirs. From in-depth training and coaching, to manager support and network-building opportunities, we’ll help you gain valuable experience, and broaden your skillset. To find out more visit us at https://jobs.bmo.com/ca/en. BMO is committed to an inclusive, equitable and accessible workplace. By learning from each other’s differences, we gain strength through our people and our perspectives. Accommodations are available on request for candidates taking part in all aspects of the selection process. To request accommodation, please contact your recruiter. Note to Recruiters: BMO does not accept unsolicited resumes from any source other than directly from a candidate. Any unsolicited resumes sent to BMO, directly or indirectly, will be considered BMO property. BMO will not pay a fee for any placement resulting from the receipt of an unsolicited resume. A recruiting agency must first have a valid, written and fully executed agency agreement contract for service to submit resumes.
Senior DevOps Engineer
AllCloudAllCloud is a leader in amplifying organizations’ cloud potential through AI. With a track record of hundreds of successful migrations and implementations across AWS and Salesforce, AllCloud has developed strategies and solutions that enable businesses of all sizes to remain at the forefront of innovation. AllCloud serves clients across the globe with offices in EMEA and North America.
Senior DevOps Engineer Location: USA Job Type: Full-time, Permanent About AllCloud AllCloud is a leader in amplifying organizations’ cloud potential through AI. With a track record of hundreds of successful migrations and implementations across AWS and Salesforce, AllCloud has developed strategies and solutions that enable businesses of all sizes to remain at the forefront of innovation. AllCloud is a leader in AI-led professional and managed services. As an AWS Premier and audited managed services Partner, and Salesforce Consulting partner, AllCloud provides comprehensive AI-led cloud journey support, from initial migration to ongoing management through our Engage Managed Services. Our expertise ensures that clients remain aligned with ecosystem best practices while focusing on their core business growth. AllCloud serves clients across the globe with offices in EMEA and North America. www.allcloud.io Job Summary AllCloud is looking for a Senior DevOps Engineer, The primary role of the Senior DevOps Engineer is to design, architect, and lead the implementation of complex cloud environments. This role requires deep technical expertise, ownership of DevOps strategy, and the ability to mentor engineers while driving best practices across the organization. Summary of Key Responsibilities - Design and own cloud-native architectures and DevOps frameworks - Deployment of advanced services on the cloud - Lead design and implementation of scalable automation solutions - Maintain a feedback loop with the company’s customers to ensure deliveries are in accordance with the customer’s needs and expectations - Debug, troubleshoot and fix technical issues - Setup large-scale monitoring and logging solutions - Drive infrastructure scalability, performance optimization, and cost efficiency - Own CI/CD strategy and platform reliability at scale - Mentor and guide junior/mid-level engineers - Lead technical decision-making architecture reviews Requirements Summary of Experience - Expert-level understanding of networking, Linux and Windows administration - 5-8 years of experience in DevOps / Cloud Engineering roles - Experience with at least one of the following: Amazon Web Services, Microsoft Azure, Google Cloud Platform, VMware - Familiarity with the Agile methodology - Scripting abilities in at least one of the following: Bash, Python, Ruby, Pearl, PowerShell - Creative troubleshooting skills and out-of-the-box thinking - Ability to learn various technologies and topics Advantages - Experience with system monitoring platforms (Datadog, Prometheus) - Experience with centralized logging platforms (Graylog2, ELK) - Familiarity with version control tools (Git, SVN) - Configuration management knowledge (Puppet, Chef, Ansible) - Experience with CI/CD platforms (Terraform, Jenkins, TeamCity, TravisCI, Codeship) - Experience with Docker containers and orchestrators (Kubernetes, ECS, Swarm) - Information security orientation - Experience with production environments Why work for us? Our team inspires progress in each other and in our customers through our relentless pursuit of excellence; you will work with leaders who promote learning and personal development. We offer competitive salaries, bonus incentives, benefits, flexible hours, and mentoring. Apply now to become part of the team. AllCloud is an Equal Opportunity Employer and considers applicants for employment without regard to race, color, religion, sex, orientation, national origin, age, disability, genetics or any other basis forbidden under federal, provincial, or local law.
DevOps Engineer
Apptiva Core TechnologiesThe best partner in the Digital Transformation of your company.
• Collaborate with product development teams. • Focus on automating builds via CI/CD pipelines and getting code into production efficiently. • Support services before they go live via system design consulting. • Develop software platforms and frameworks, capacity planning and launch reviews. • Use automation to scale systems sustainably and improve reliability, speed and security. • Work with IT Product teams to deploy newer software versions in Cloud resources. • Collaborate with DevOps teams to ensure new environments meet requirements and best practices.
DevOps Engineer
AnteeloIdeate. Design. Evolve. Build and Transform your business through digital product strategy, design and development.
• Setup and maintain DevOps tools • Deploying, automating, maintaining AWS and GCP cloud systems • Assist in system troubleshooting and problem solving • Suggest architecture and process improvements



