Job Closed
This listing is no longer active.
Director, Site Reliability Engineer
Location
Kentucky + 1 moreAll locations: Kentucky | Utah
Posted
56 days ago
Salary
0
Seniority
Lead
Job Description
Director, Site Reliability Engineer
Cision
• Provide strategic leadership and oversight for four SRE teams, setting clear direction, priorities, and expectations aligned to business and engineering objectives • Lead, mentor, and develop SRE managers and senior engineers, fostering a culture of accountability, operational ownership, innovation, and psychological safety • Define and own the SRE and Platform Engineering strategy and roadmap, ensuring alignment with cloud transformation initiatives and long-term organizational goals • Serve as a key voice in architectural and platform decisions, influencing designs with a focus on scalability, reliability, automation, and operational efficiency • Partner with executive leadership to communicate reliability posture, risks, and investment needs in clear business terms • Establish and continuously evolve SRE principles and best practices, including SLIs, SLOs, error budgets, toil management, and reliability-driven prioritization • Provide technical direction and governance across GCP (preferred) and AWS environments, ensuring consistent reliability and operational patterns • Drive the evolution of Platform Engineering, enabling self-service infrastructure and guard-railed service delivery for application teams • Own strategy and standards for Infrastructure-as-Code (IaC) and automation, leveraging tools such as Terraform or equivalent frameworks across cloud environments • Ensure observability excellence through metrics, logging, tracing, alerting, and proactive capacity and performance management • Provide executive leadership during large-scale or high-impact incidents, ensuring effective coordination, escalation, and stakeholder communication • Define, refine, and scale incident management and on-call practices, emphasizing resilience, sustainability, and rapid recovery • Champion blameless postmortems, ensuring root causes are addressed and learnings are translated into systemic improvements • Partner with Security and Compliance teams to ensure systems meet security, privacy, and regulatory requirements without compromising reliability • Own and report on reliability metrics, operational KPIs, and service health for leadership and executive stakeholders • Drive continuous improvement through reliability reviews, retrospectives, and data-driven decision-making • Balance reliability, velocity, and cost across platforms, applying error budgets and capacity planning to guide trade-offs
Job Requirements
- 10+ years of experience in SRE, infrastructure, platform, or systems engineering roles, with 5+ years leading managers and senior technical teams
- Direct, hands-on experience in Site Reliability Engineering, including operating production systems at scale
- Strong experience with Google Cloud Platform (GCP) or equivalent public cloud (AWS or Azure), including distributed, cloud-native architectures
- Proven expertise in Infrastructure-as-Code (IaC) and automation frameworks (e.g., Terraform or similar)
- Deep understanding of observability ecosystems (metrics, logging, tracing), CI/CD pipelines, and DevOps/SRE tooling
- Ability to communicate complex technical concepts clearly to both technical and non-technical stakeholders, influencing at all levels of the organization.
Benefits
- Competitive total rewards (base salary + bonus, if applicable)
- Customizable benefits package (3 medical plans with Health Saving Account company match)
- Generous paid time off for non-exempt team members, starting with 3 weeks + 13 paid holidays, including 2 personal floating holidays
- Flexible time off for exempt team members + 13 paid holidays
- Paid parental leave (including maternity + paternity leave)
- Education assistance opportunities and free LinkedIn Learning access
- Free mental health and family planning programs, including adoption assistance and fertility support
- 401(K) program with company match
- Pet insurance
- Employee resource groups
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Software Engineering, DevOps AI Rater/Evaluator
LILT AIMake anything multilingual. Translation, AI data set creation, and human expert evals. For businesses and governments.
• Evaluate AI outputs related to software engineering, DevOps, and infrastructure topics • Perform structured scoring, comparison, classification, and judgment tasks • Assess technical correctness, completeness, security implications, and best-practice alignment • Identify hallucinations, incorrect code, unsafe recommendations, or misleading system guidance • Apply domain-specific engineering and DevOps guidelines consistently across tasks • Validate and refine evaluation rubrics and edge-case handling • Perform adjudication where raters disagree • Conduct error analysis and qualitative reviews of model behavior • Partner with LILT research, product, and customer teams on evaluation design • Support red-teaming, security review, and model readiness assessments
• Help drive reliability, automation and performance within our cloud-based infrastructure • Coordinate and support daily activities for SREs on the team and partner with their managers to determine approach for managing daily tasks • Track success on the team based on established goals and objectives • Work on issues of limited scope with the ability to find and execute solutions to routine problems • Become embedded within an Engineering team helping them navigate production excellence and advocate for best practices • Mentor team members and drive initiatives • Drive a design for a feature while understanding system-wide and architectural concerns • Understand the basic day-to-day tasks traits of a production environment and participate in on-call support • Engage and collaborate with other disciplines within the design, deployment, operation and optimization of services • Debug production issues across services and levels of the stack as well as practice incident response and blameless postmortems • Identifies opportunities both in processes and tools to improve the overall productivity of the team • Identify great talent and excite them to join our team • Provide estimations, track progress and manage risk as well as team members' time • Participate in an on-call shift along with other disciplines to respond to incidents • Become involved in tech communities and add contributions to enhance them • Lean into our business domain and needs as well as our company vision, mission and strategy to deliver on our short and long term goals
About the Role: Grade Level (for internal use): 08Job Title: DevOps Support – Associate (Flex Shift) Location: India (IST) We are seeking a talented DevOps Support Associate to join our S&P Global MI Cloud Engineering Shared Services - Platform Engineering team. This position requires the associate to work a flexible shift tailored to coincide with EU business hours, ensuring optimal collaboration and support for our EU based operations. We facilitate software development by owning and managing vendor hosted applications such as Gitlab, Azure DevOps, GitHub, Artifactory, Jira, Confluence, Mend, SonarQube and TeamCity. This position will play a crucial role in providing technical support to our clients, ensuring smooth operation and resolution of any issues that arise. The ideal candidate will possess excellent technical skills, strong problem-solving abilities, and a passion for helping both our internal developer teams as well as external clients by providing a great developer experience. Shift Timing: The selected candidate will be required to work from 11:30 AM to 8:30 PM IST. This schedule allows for substantial overlap with EU business hours and facilitates timely communication and project continuity with our U.S. teams. Responsibilities: - Provide real-time support to clients and internal teams during EU business hours. - Manage and troubleshoot issues reported by clients. - Collaborate with U.S.-based teams for resolution with the use of Splunk Observability and PagerDuty alerts acknowledgement. - Document all support interactions and resolutions accurately in ServiceNow - mySolutions ticketing system daily. - Proactively identify opportunities for process improvements to enhance the Developer Experience with our applications such as automating our manual in take process and current BAU work efficiently. - Assist in creating and updating our existing documentations and knowledge base articles in Confluence. Requirements: - Bachelor’s degree in computer science, or related field, or equivalent work experience with 2 years of experience in cloud computing and applied solutions. - Willingness to work assigned flex shift hours to support EU business operations. - Excellent communication skills with the ability to engage effectively with global teams. - Experience in AWS in deployment using Terraform or CloudFormation. - Experience in developing CI/CD pipelines with Gitlab, GitHub, TeamCity, and Azure DevOps. - Experience in troubleshooting using Linux operating systems. - Solid understanding of languages such as Java, Python, Bash, PowerShell and the importance of automation. - Strong problem-solving skills with the ability to think creatively to resolve issues. - Ability to multitask and prioritize workload effectively in a fast-paced environment. - Experience working with ticketing systems and knowledge base tools is a plus. About S&P Global Market Intelligence At S&P Global Market Intelligence, a division of S&P Global we understand the importance of accurate, deep and insightful information. Our team of experts delivers unrivaled insights and leading data and technology solutions, partnering with customers to expand their perspective, operate with confidence, and make decisions with conviction. For more information, visit www.spglobal.com/marketintelligence. What’s In It For You? Our Mission: Advancing Essential Intelligence. Our People: We're more than 35,000 strong worldwide—so we're able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all.From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We’re committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. Join us and help create the critical insights that truly make a difference. Our Values: Integrity, Discovery, Partnership Throughout our history, the world's leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits: We take care of you, so you can take care of business. We care about our people. That’s why we provide everything you—and your career—need to thrive at S&P Global. Our benefits include: - Health & Wellness: Health care coverage designed for the mind and body. - Flexible Downtime: Generous time off helps keep you energized for your time on. - Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. - Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. - Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. - Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference. For more information on benefits by country visit: https://spgbenefits.com/benefit-summaries Global Hiring and Opportunity at S&P Global: At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. Recruitment Fraud Alert: If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to reportfraud@spglobal.com. S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, “pre-employment training” or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity here. ----------------------------------------------------------- Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: EEO.Compliance@spglobal.com and your request will be forwarded to the appropriate person. US Candidates Only: Know Your Rights: Workplace discrimination is illegal ----------------------------------------------------------- IFTECH203 - Entry Professional (EEO Job Group)
• Own platform security and reliability improvements across our GCP environment. • Harden identity and network controls in GCP, including IAM patterns, service accounts/workload identity, organization policies, and network segmentation. • Build security into CI/CD by implementing and enforcing SAST, SCA, secret detection, and container/image scanning. • Drive vulnerability management and reduce software supply chain risk across services, dependencies, container images, and build pipelines. • Lead threat modeling and security design reviews for new features and significant architecture changes. • Improve security observability by tuning telemetry, reducing alert noise, and building high-signal detections and dashboards. • Lead investigations and coordinate incident response for security alerts and incidents, and drive post-incident improvements. • Champion secure SDLC practices through standards, documentation, guardrails, and coaching for product engineering teams. • Define and maintain end-user device security standards, including requirements for EDR and remote access tooling, and partner with stakeholders for execution. • Support compliance and audit readiness by conducting internal security reviews and helping align practices with SOC 2, GDPR, and NIST frameworks.




