Speed, Accuracy, and Cost savings... experience the TalentWerx difference.
Reliability Engineer IV
Location
United States
Posted
53 days ago
Salary
$120.2K - $127.0K / year
Seniority
Lead
Job Description
Reliability Engineer IV
TalentWerx
• Ensure the availability, performance, monitoring, and incident response of cloud platforms and services • Ensure compliance with requirements for production • Manage failures and resource issues • Use metrics like MTTR and MTTF • Develop technical solutions to complex problems • Document findings and conduct root cause analysis • Collaborate with engineering and development teams • Monitor production equipment diagnostics • Recommend design and process modifications to improve reliability
Job Requirements
- Bachelor's degree with 8 years of experience, or a Master’s degree with 6 years of experience
- Active Secret Clearance
- Recognized as an emerging authority in reliability engineering
- Ability to develop and implement reliability solutions
- Strong analytical skills
- Proficiency in reliability modeling, failure mode analysis, and predictive maintenance methodologies
- DoD 8570 / 8140 IAT Level II certification
- At least one cloud certification
Benefits
- health and wellness programs
- income protection
- paid leave
- retirement and savings
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Agentic DevOps Lead – Data & AI
AccentureAccenture Federal Services, a division of Accenture, provides technology and consulting services to U.S. federal agencies, delivering solutions that enhance performance and efficie
Accenture Technology: Through unmatched industry experience, leading technologies from our ecosystem partners and startups, and the largest delivery network in the world, we provide a powerful range of capabilities that can be tailored to our client’s most complex business needs. With over 100 innovation hubs deployed around the world, we help clients continuously innovate at speed and at scale so they can outpace their peers. You will bring innovation, intelligence, and industry experience together with the newest technologies to help clients innovate at scale and transform their businesses. Visit us at www.accenture.com. We Are: The beginning of a new Data & AI decade—one that will reshape work and society—has arrived. Accenture is stepping boldly into this future with a clear strategy and purpose: to help clients optimize and reinvent their business with data and AI. Backed by a $3B investment and a deep commitment to our people, we enable industry-defining work at global scale. With over 45,000 Data & AI professionals, Accenture’s Data & AI organization combines experienced innovation, strategic investment, exceptional talent, and a powerful ecosystem to deliver transformational outcomes for our clients. You Are: As an Artificial Intelligence and Machine Learning Computational Science professional, you will play a pivotal role in formulating real-world problems into practical, efficient, and scalable AI and Machine Learning solutions. You will be responsible for developing and implementing cutting-edge artificial intelligence solutions that drive innovation and enhance performance. You will collaborate with cross-functional teams and leverage your expertise in machine learning, deep learning, and data analysis to solve complex problems and deliver impactful AI-driven solutions. The Work: We are seeking an experienced Agentic DevOps Lead to lead our Agentic DevOps initiatives. This role is pivotal in scaling our Generative AI agentic solutions across diverse cloud environments. You will architect and operationalize a reusable, portable Agentic DevOps framework that ensures production readiness, observability, and deployment efficiency for agentic applications. You will lead a team of reinventors—engineers, architects, and DevOps specialists—focused on delivering industry-leading agentic systems that are robust, scalable, and client-ready. Key Responsibilities: - Lead Agentic DevOps Strategy: Define and implement scalable DevOps frameworks for agentic systems using LangGraph, Crew AI, Autogen, and other orchestration tools. - Framework Development: Build reusable scaffolding for agent lifecycle management, orchestration, monitoring, and metering. - Cloud-Native Deployment: Architect and manage CI/CD pipelines for public and private cloud environments (AWS, Azure, GCP). - Production Readiness: Ensure agentic applications meet enterprise-grade standards for security, reliability, and compliance. - Team Leadership: Mentor and manage cross-functional teams across DevOps, AI engineering, and client enablement. - Client Enablement: Collaborate with solution architects to tailor deployments for client environments and ensure seamless onboarding. Travel may be required for this role. The amount of travel will vary from 0 to 100% depending on business need and client requirements. Required Qualifications & Experience: - 15+ years of professional experience, supported by relevant educational qualifications - Strong background in Artificial Intelligence, Machine Learning, Computational Science, or Applied Machine Learning - Proven experience leading DevOps or platform engineering initiatives for AI-driven systems - Hands-on experience with Generative AI agentic frameworks and orchestration tools - Deep expertise in cloud-native architectures and CI/CD pipelines - Demonstrated experience deploying and operating solutions across AWS, Microsoft Azure, and/or Google Cloud Platform (GCP) - Strong leadership, communication, and stakeholder management skills Preferred Qualifications: - Experience designing and operating multi-agent systems in production environments - Familiarity with AI observability, monitoring, and governance frameworks - Experience working in large-scale enterprise or consulting environments - Exposure to regulated or security-sensitive industries Why Join Us? - We offer a transparent, fast paced approach career progression, with a focus on your strengths and continuous coaching from senior colleagues - You will benefit from working alongside Accenture experts who are solving some of the biggest industry challenges with innovative thinking and pioneering tools - Flexible work arrangements and a range of benefits including competitive rewards - You will have access to state-of-the-art technology that will give you the opportunity to deepen your existing skills even as you help create the latest business trends - You will also have opportunities to make a difference to the communities in which we work and live Next Steps: If this sounds like the ideal role, career and company for you, click to apply. To learn more about life @AccentureMiddleEast, follow us on social media and keep up with our latest news. Accenture Middle East: LinkedIn, Instagram, Facebook, Twitter, YouTube About Accenture Accenture is a leading global professional services company that helps the world’s leading businesses, governments and other organizations build their digital core, optimize their operations, accelerate revenue growth and enhance citizen services—creating tangible value at speed and scale. We are a talent- and innovation-led company with approximately 791,000 people serving clients in more than 120 countries. Technology is at the core of change today, and we are one of the world’s leaders in helping drive that change, with strong ecosystem relationships. We combine our strength in technology and leadership in cloud, data and AI with unmatched industry experience, functional expertise and global delivery capability. Our broad range of services, solutions and assets across Strategy & Consulting, Technology, Operations, Industry X and Song, together with our culture of shared success and commitment to creating 360° value, enable us to help our clients reinvent and build trusted, lasting relationships. We measure our success by the 360° value we create for our clients, each other, our shareholders, partners and communities.Visit us at www.accenture.com Equal Employment Opportunity Statement We believe that no one should be discriminated against because of their differences. All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, sexual orientation, gender identity or expression, marital status, citizenship status or any other basis as protected by applicable law. Our rich diversity makes us more innovative, more competitive, and more creative, which helps us better serve our clients and our communities.
Senior Java Developer
AcceptedHighly motivated [Dedicated S/W Engineering SCRUM Teams], focused on customer's success
Working model: Remote | Type: Full-time Accepted is a software and digital transformation services firm helping clients accelerate innovation in Finance, Energy, Gaming, Telco, and beyond. With 20+ years of engineering excellence, we’re known for building outcome-driven solutions and high-performing teams that feel like part of your own. We’re looking for a Senior Java Developer to strengthen our hybrid delivery teams. What You’ll Do - Play a key role in evolving a high-impact, real-time platform, working hands-on with a complex Java (Spring Boot) microservices architecture; - Dive deep into existing services to uncover, refine, and document critical business logic—laying the groundwork for a next-generation, AI-native system; - Actively contribute to a major platform transformation, helping reshape the system, while ensuring seamless continuity and delivery; - Design and build scalable, distributed solutions using modern technologies, tackling real-world complexity at scale; - Leverage cutting-edge AI tools and agent-based approaches to rethink how software is built—driving innovation toward spec-driven, intelligent systems.
• Keep Hazelcast cloud-based production systems running smoothly 24/7/365 • Design and Development: • Design, develop, and maintain our cloud infrastructure to support both our end user management center and microservice based platform • Implement new solutions using AWS and terraform, improving scalability, throughput, and reliability. • Support and manage our Keycloak IDP ensuring it provides appropriate security while meeting the needs of the development team • Security and Integration: • Implement security measures to protect data integrity and confidentiality, including encryption, access control, and compliance with relevant regulations. • Work with our operations team to maintain our SOC2 & ISO27001 compliance, and keeping our environment secure • Monitoring and Maintenance: • Monitor the system for performance issues, errors, and potential failures, and implement maintenance procedures such as backups, data recovery, and disaster recovery plans. • Troubleshoot issues related to data storage, including performance bottlenecks, data corruption, or compatibility issues with other software components. • Collaboration: • Collaborate with cross-functional teams, including software developers, architects, and product managers, to ensure the effective integration and operation of the components within the overall software infrastructure. • Document design decisions, implementation details, and operational procedures to facilitate collaboration among team members and ensure the maintainability of the system. • Continuous Learning: • Stay updated with the latest developments in storage technologies, Java programming language, and software engineering best practices, and apply this knowledge to improve existing storage systems and develop new solutions. • On-call participation • Be part of our on-call rotation to respond to availability incidents and work with support and engineers on customer incidents
• Oversee, lead and manage projects of a cross functional team of 5 DevSecOps engineers and cloud engineers in delivering secure, reliable, and scalable infrastructure and deployment pipelines across Azure and AWS. • Apply knowledge leveraging Infrastructure as Code with Terraform, automated CI/CD (Azure DevOps), and container oriented architectures to drive 3E modernization initiatives, embed security throughout the software lifecycle, and accelerate AI adoption to improve delivery velocity and operational insight. • Lead guide and develop engineering talent; set objectives and key results, plan and oversee projects, timelines and objectives, conduct performance reviews, and foster an inclusive, high- performance culture. • Define, plan and own the DevSecOps roadmap aligned to business objectives, cloud modernization, and AI enabled capabilities. • Design, develop, and maintain resilient cloud solutions in collaboration with application, security, data, and product teams that meet evolving requirements while continuously improving existing cloud applications. • Provide hands-on technical leadership to design, build, and operate secure CI/CD pipelines using Azure DevOps and Terraform across multi cloud (Azure & AWS) environments. • Architect, implement, and maintain containerized workloads (Docker, Kubernetes, ECS) with built in security scanning, policy enforcement, and automated scaling. • Establish, monitor, and improve SRE driven metrics, dashboards, and incident response practices to meet SLOs, compliance mandates, and operational excellence targets. • Implement adoption of AI/ML to predictive anomaly detection, generative code analysis, and AI driven infrastructure optimization. • Stay current on industry trends, emerging cloud services, and regulatory changes; disseminate knowledge through brown bags, documentation, and internal communities. • Ensure documentation, runbooks, and knowledge transfer are maintained and continuously improved to support on-call engineers and audit requirements.



