Transforming the health of the communities we serve, one person at a time.
Senior Site Reliability Engineer
Location
United States + 1 moreAll locations: United States | Tunisia
Posted
55 days ago
Salary
$87K - $161K / year
Seniority
Senior
No structured requirement data.
Job Description
Senior Site Reliability Engineer
Centene Corporation
You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. As a diversified, national organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: Helps lead projects that are focused on Disaster Recovery, managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs. Develops complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents. Understands and advocates for standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams through out the deployment process. Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability. - Assists application development teams create a Disaster Recovery playbook - Troubleshoots and resolves more complex problems with systems and services and initiates regular deployment of new versions of the systems and their subcomponents - Leads more complex projects focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility. - Helps make decisions around periodic system validation and testing, service monitoring, and standing up new services/tools - Uses knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization - Identifies and implements necessary manual and automated procedures for improved collaborative response in real-time - Leads lower level Engineers in stress, security, and performance testing - Resolves issues that come up through support escalation - Keeps documentation and runbooks up to date to effectively deal with new incidents that might arise - Leads post incident reviews and documents findings for future informed decision making - Reviews proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability and makes decisions around which proposals should move forward. - Communicates complex topics with development teams to investigate and document issues and leads internal team to develop solutions to mitigate them - Performs other duties as assigned - Complies with all policies and standards Education/Experience: A Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science) and Requires 4 – 6 years of related experience. Or equivalent experience acquired through accomplishments of applicable knowledge, duties, scope and skill reflective of the level of this position. Technical Skills: One or more of the following skills are desired. - Disaster Recovery - AWS - SQL - MongoDB Pay Range: $87,000.00 - $161,300.00 per year Centene offers a comprehensive benefits package including: competitive pay, health insurance, 401K and stock purchase plans, tuition reimbursement, paid time off plus holidays, and a flexible approach to work with remote, hybrid, field or office work schedules. Actual pay will be adjusted based on an individual's skills, experience, education, and other job-related factors permitted by law, including full-time or part-time status. Total compensation may also include additional forms of incentives. Benefits may be subject to program eligibility. Centene is an equal opportunity employer that is committed to diversity, and values the ways in which we are different. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other characteristic protected by applicable law. Qualified applicants with arrest or conviction records will be considered in accordance with the LA County Ordinance and the California Fair Chance Act
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design, implement, test and maintain our innovative software products • Take ownership of features and components • Extend existing systems and components as the business grows • Utilize test-driven development approach to deliver commercial quality code • Integrate the software artifacts with CI/CD frameworks • Mentor less experienced engineers • Perform code and design reviews
At Evolphin, we’re redefining how creative teams search, manage, and collaborate on content. While most companies apply AI to text, we’ve gone further: our platform embeds and indexes actual media files — video, image, design, and audio — enabling true semantic search across time-coded content and millions of creative assets. Our flagship platform, Zoom MAM, is trusted by global broadcasters, agencies, and top brands including Inter Milan FC, Merck, Mercedes Benz to power their visual workflows. We are looking for a DevOps Engineer with 6+ years of hands-on experience to build, scale, and optimize the infrastructure that powers our high-performance, media-centric applications. This role is critical in ensuring reliability, security, scalability, and continuous delivery across our platforms. Key Responsibilities - Design and manage scalable, secure, and highly available AWS infrastructure - Set up and operate Bedrock, OpenSearch, and DocumentDB for AI-driven and high-volume data workflows - Manage and optimize Kubernetes clusters (pods, services, autoscaling, networking) - Build and maintain robust CI/CD pipelines for faster and reliable releases - Implement Infrastructure as Code (IaC) for automated provisioning and environment consistency - Monitor system health, performance, and reliability using observability tools - Optimize cloud costs and resource utilization - Ensure high availability, backup, and disaster recovery strategies - Collaborate with Dev, QA, ML, and Product teams to improve deployment efficiency - Enforce security best practices across infrastructure and pipelines - Manage deployments on AWS ECS / Fargate and container registries (ECR). - Maintain environment parity across dev, staging, and production. - Integrate security scanning tools: SonarQube, Trivy, AWS Security Hub, AWS GuardDuty, and AWS Config. Required Skills & Experience - 6+ years in DevOps / Cloud Engineering roles - Strong hands-on experience with the AWS ecosystem — EC2, S3, ECS, Fargate, ALB, VPC, SQS, RDS, Elasticache, and more. - Proven experience managing OpenSearch clusters and performance tuning - Experience with DocumentDB or similar NoSQL databases - Hands-on experience with AWS Bedrock or ML infrastructure is highly preferred - Strong expertise in Kubernetes (pods, scaling, deployments, networking) - Experience with CI/CD tools and automation pipelines - Proficiency in Terraform or CloudFormation - Strong understanding of Linux, networking, and system design - Experience with monitoring, logging, and alerting systems - Scripting knowledge (Bash/Python) Good to Have - Experience with AI/ML pipelines and model deployment - Exposure to media workflows, video processing, or large asset systems - Knowledge of search optimization and indexing strategies - Understanding of security compliance (SOC2, ISO, etc.) What We’re Looking For - Strong ownership and problem-solving mindset - Ability to work in a fast-paced, cross-functional environment - Focus on reliability, scalability, and continuous improvement - Balance between speed, cost, and system stability
• Design and maintain a comprehensive observability platform using Grafana, Prometheus, Loki, and Tempo. • Implement proactive monitoring and alerting for: • Microservices and APIs (latency, error rates, availability) • Batch jobs, scheduled workloads, and ETL/data pipelines (success/failure, duration, SLA adherence) • Server and container health (CPU, memory, disk, network, capacity trends) • Database health and performance (availability, replication, query latency, resource utilization) • Application and infrastructure logging, including centralized log ingestion, indexing, and search. • Build actionable alerts with clear runbooks, ownership, and escalation paths to minimize mean time to detect (MTTD) and mean time to resolve (MTTR). • Partner with application, platform, and DevOps teams to instrument services with metrics, traces, and structured logs. • Continuously improve signal quality by reducing alert noise, eliminating false positives, and optimizing thresholds based on historical trends. • Create and maintain dashboards for real-time operational visibility and executive-level health reporting. Support incident response and post-incident reviews by providing high-fidelity telemetry and contributing to root cause analysis.
Customer Engineer – Infrastructure – Azure Monitor - F/M/D
CNXWe're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future.
Job Title: Customer Engineer – Infrastructure – Azure Monitor - F/M/D Job Description We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future. We’re solution-focused, tech-powered, intelligence-fueled. With unique data and insights, deep industry expertise, and advanced technology solutions, we’re the intelligent transformation partner that powers a world that works, helping companies become refreshingly simple to work, interact, and transact with. We shape new game-changing careers in over 70 countries, attracting the best talent. The Concentrix Technical Products and Services team is the driving force behind Concentrix’s transformation, data, and technology services. We integrate world-class digital engineering, creativity, and a deep understanding of human behavior to find and unlock value through tech-powered and intelligence-fueled experiences. We combine human-centered design, powerful data, and strong tech to accelerate transformation at scale. You will be surrounded by the best in the world providing market leading technology and insights to modernize and simplify the customer experience. Within our professional services team, you will deliver strategic consulting, design, advisory services, market research, and contact center analytics that deliver insights to improve outcomes and value for our clients. Hence achieving our vision. Our game-changers around the world have devoted their careers to ensuring every relationship is exceptional. And we’re proud to be recognized with awards such as "World's Best Workplaces," “Best Companies for Career Growth,” and “Best Company Culture,” year after year. Join us and be part of this journey towards greater opportunities and brighter futures.Customer Engineer – Infrastructure – Azure Monitor Job Description: The Azure Monitor Customer Engineer will work directly with customers, as a consultant and technical advisor to: - Design, Deploy, Review and Assess the health of the infrastructure - Upgrade and maintain deployments - Troubleshoot issues with infrastructure and agents - Tune and optimize for performance - Assist with reporting and visualizations - Implement new management packs - Assist in the development of custom management packs - Provide training in all areas of Azure Monitor to ensure customer goals are met Ideal candidate experience: 15+ years working as a depth expert and technology owner or consultant for Azure monitor Ability to present to multiple levels of customer leadership. Ability to act as a consultant and architect for multiple customers. Broad knowledge across multiple monitoring scenarios: - Windows and Linux Operating Systems - Azure Monitor - KQL Kusto Query language advanced level - URL, Network monitoring - Connecting to ITSM systems - Dashboards, Reporting, and Visualizations - PowerShell scripting Deep level knowledge in at least 3 of the above categories Technical Skills Requirements: Azure Monitor: Broad knowledge of ALL the below areas, with deep understanding of (at least) 4 of the following: Deep understanding of Azure Monitor architecture (metrics vs logs, data flow, ingestion, retention) Strong knowledge of: - Log Analytics workspaces - Azure Monitor Metrics - Diagnostic settings - Resource‑level vs platform‑level telemetry Ability to explain when to use Azure Monitor vs Azure Data Explorer / Grafana / third‑party tools. Additionally, be able to ; - Write complex KQL queries across multiple tables - Use: - parse, extend, mv-expand - joins, time series, summarize patterns - performance‑optimized queries - Build: - reusable queries - functions - summary rules for cost & performance optimization - Debug slow or expensive queries Tools - Visual Studio, Silect, MPViewer, Alert Update Connector, PowerShell Linux OS and Linux Monitoring Report Development Network Monitoring URL Monitoring Related Skills: - System Center Orchestrator - System Center Data Protection Manager - System Center Virtual Machine Manager - System Center Service Manager This position requires a fluent German and English level. #WAH #LI-Remote Location: DEU Work-at-Home Language Requirements: Time Type: Full time



