Job Closed

This listing is no longer active.

Element Solutions logo
Element Solutions

Element is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status, marital status, protected veteran status, or any other legally protected class. We believe in a world where solutions we build improve the lives of those who use them.

Sr. Site Reliability Engineer

Location

United States

Posted

88 days ago

Salary

$140K - $180K / year

Seniority

Senior

No structured requirement data.

Job Description

Sr. Site Reliability Engineer

Element Solutions

Who is Element? We serve as a partner at the intersection of innovation and our clients' needs, efficiently crafting meaningful user experiences for government and commercial customers. By breaking down complex problems to their fundamental elements, we create modern digital solutions that drive efficiencies, maximize taxpayer dollars, and deliver essential outcomes that serve the people. Why Work at Element? Make an impact that resonates-join our vibrant team and discover how you can improve lives through digital transformation. Our talented professionals bring unparalleled energy engagement, setting a higher standard for impactful work. Come be a part of our team and shape a better future. Position Summary The Senior Site Reliability Engineer (SRE) serves as the Technical Architecture & Stability Assessment Lead responsible for evaluating the reliability, scalability, and operational resilience of complex enterprise infrastructure environments. This role supports a structured 16-week technical assessment and optional implementation phase focused on identifying stability risks, mapping infrastructure dependencies, and strengthening existing architecture to support operational continuity during modernization initiatives. Element’s approach prioritizes practical stabilization over unnecessary redesign. Rather than introducing large-scale architectural transformations, this role emphasizes reinforcing the current infrastructure to withstand coexistence pressures and operational demands while sequencing improvements responsibly. Key Responsibilities - Conducting current-state infrastructure mapping across application, platform, and hosting layers, documenting and recommending improvements. - Performing dependency and integration analysis across interconnected enterprise systems. - Identifying single points of failure and systemic reliability risks. - Supporting datacenter transition modeling and infrastructure transition sequencing. - Conducting Citrix dependency analysis and migration sequencing recommendations. - Evaluating system performance, scalability, and operational resilience under high-volume workloads. - Providing modularization and decoupling recommendations to reduce operational fragility and support phased modernization and migration efforts. - Advising on supporting operations and development teams on continuity strategies that allow legacy and modern systems to operate reliably in tandem during modernization and migrations. Minimum Qualifications - Bachelor’s degree in Computer Science, Information Systems, Information Technology, Engineering, or a related technical discipline. - This individual brings 10+ years of professional experience in infrastructure engineering, site reliability engineering, or enterprise platform architecture, including experience supporting complex enterprise environments. - Demonstrated experience conducting enterprise architecture or infrastructure assessments. - Enterprise infrastructure architecture and systems engineering - Extensive experience in hybrid hosting environments and modernization and cloud migration planning. - Virtualization platform management. - Performance engineering and scalability assessments for high-volume systems. - Experience designing or advising on system resilience and operational continuity strategies. - Experience in Infrastructure stabilization during large-scale multi-application modernization or migration initiatives. - Strong analytical and systems-thinking abilities. - Excellent technical documentation and architecture diagramming skills. - Strong stakeholder communication and facilitation abilities. - US Citizenship or Permanent Residency required. - Must reside in the Continental US; located in the state of Pennsylvania a plus, but not required. - Depending on the government agency, specific requirements may include public trust background check or security clearance. Preferred Qualifications - Experience with Citrix App and Desktop Virtualization. - Certification in any of the following is a plus: - AWS Certified Solutions Architect - Google Professional Cloud Architect - Microsoft Certified: Azure Solutions Architect Expert - Certified Kubernetes Administrator - ITIL Foundation - Experience working with large public sector, healthcare system environments, or within the State/Commonwealth is a plus. - Familiarity with enterprise architecture frameworks (e.g., TOGAF or similar).Experience supporting legacy system environments undergoing modernization. $140,000 - $180,000 a year The likely salary range for this position is $140,000-$180,000. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range. Location Be in your Element. We are a remote-first company based in Washington, DC. Element is an equal opportunity employer All qualified applicants will receive consideration for employment without regard to age, ancestry, race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status, marital status, protected veteran status, or any other legally protected class. We believe in a world where solutions we build improve the lives of those who use them.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Oddball logo

Junior DevOps Engineer

Oddball

Oddball is a software development company that focuses on designing and building tools for enterprises and institutions. The company delivers services, includin

DevOps Engineer88 days ago

• Support and maintain CI/CD pipelines using tools such as GitHub Actions and Jenkins. • Assist with provisioning and managing cloud infrastructure in AWS, including services like EC2, S3, and RDS. • Help automate infrastructure and environment configuration using Terraform or CloudFormation. • Support container-based application deployments using Docker. • Assist with monitoring and troubleshooting environments using tools such as Datadog and CloudWatch. • Write basic automation scripts using Python or Bash to improve deployment and operational workflows. • Collaborate with engineering teams operating in Agile development environments.

United States
$80K - $100K / year
Job Closed
ClickHouse logo

Senior Site Reliability Engineer

ClickHouse

ClickHouse, Inc. is a database management system that allows users to generate analytical reports using real-time SQL queries. The company’s technology works

DevOps Engineer88 days ago

• Collaborate with various engineering teams in ClickHouse to design and implement scalable, secure, and highly available systems for ClickHouse. • Establish and manage service level objectives (SLOs) and service level agreements (SLAs) for ClickHouse Cloud. • Ensure all the infrastructure components in ClickHouse Cloud (including Dataplane, Control Plane and ClickHouse Core) have monitoring and alerting in place to ensure timely detection and resolution of incidents. • Enhance and refine incident response processes and post-mortem analysis for any outages in ClickHouse Cloud including working with the support team to communicate to the impacted customers. • Continuously improve the reliability and performance of our ClickHouse services. • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities. • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize downtime.

United States
$141K - $208K / year
Aledade logo

IT Operations Engineer I

Aledade

Self-described as "a new company with an old-fashioned goal," Aledade aims to put healthcare control back into the hands of doctors. Headquartered in Bethesda, Maryland, the compan

DevOps Engineer88 days ago

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description As an IT Operations Engineer I, you are a vital contributor to the health, stability, and efficiency of our production environments. Sitting at the intersection of traditional systems administration and modern DevOps, you are responsible for deploying standard infrastructure components and ensuring that our systems remain reliable and secure. While this role focuses on the execution of foundational IT operations, you will work closely with Senior Engineers to automate manual processes and uphold rigorous compliance standards. You will be expected to understand how server and cloud uptime impacts the broader business, ensuring that every task—from server patching to incident resolution—is performed with accuracy, documentation, and a culture of continuous improvement in mind. Primary Duties - Hybrid Infrastructure & Identity Support: Deploy standard infrastructure components; assist in cloud computing architectures and identity migrations (e.g., AD to Microsoft Entra). - Automation & Modernization: Execute infrastructure tasks using scripting (PowerShell, Python); assist in managing VDI and computing infrastructure in Azure. - System Reliability & Incident Management: Resolve alerts/tickets in a timely fashion; participate in the On-Call rotation and support root-cause analysis (RCA) activities. - Security, Compliance & Audit: Maintain firewalls, automated patching, and security monitoring to ensure audit-readiness (ITGC, SOX, SOC II Type II). - Documentation & Standardization: Contribute to the team Wiki/SOP library; accurately estimate time for server configs and notify leads of potential risks. Qualifications - Education: Bachelor’s degree in Information Technology, Computer Science, or a related field. - Experience: 6+ years of experience in IT operations or similar roles, with demonstrated expertise in system administration and cloud network management. - Technical Skill: Strong analytical and problem-solving skills, with a focus on system efficiency and user satisfaction. Requirements - Proficiency in managing IT infrastructure, including security, networking, and systems administration. - Familiarity with IT compliance frameworks (ITGC, SOX, SOC II Type II, NIST) and security protocols. - Strong communication skills for effective collaboration across departments. - Experience identifying infrastructure gaps and contributing to complex project solutions. - Experience with Mobile Device Management tools. Physical Requirements - Environment: Prolonged periods of sitting; extensive use of computers and keyboards. - Physicality: Occasional walking and lifting may be required. - Availability: Must be available for on-call duties as necessary to maintain system uptime. Benefits - Flexible work schedules and the ability to work remotely are available for many roles. - Health, dental and vision insurance paid up to 80% for employees, dependents and domestic partners. - Robust time-off plan (21 days of PTO in your first year). - Two paid volunteer days and 11 paid holidays. - 12 weeks paid parental leave for all new parents. - Six weeks paid sabbatical after six years of service. - Educational Assistant Program and Clinical Employee Reimbursement Program. - 401(k) with up to 4% match. - Stock options. - And much more!

United States
Job Closed
Full TimeRemoteTeam 11-50Since 2015H1B No Sponsor

• Own and drive infrastructure projects end-to-end — from breaking down the problem into subtasks, through implementation, to communicating results to stakeholders. • We don't just "do tasks"; we solve problems and explain how and why. • Evolve Kubernetes (with Argo) and cloud infrastructure — mostly GCP. • Take part in cloud infrastructure unification. • Develop and maintain Terraform configurations for scalable, reliable systems. • Build and optimize CI/CD pipelines using GitHub Actions. • Strengthen observability with OpenTelemetry and Datadog. • Integrate and act on insights from AIkido and other security tools to detect and mitigate issues within workloads. • Support and tune PostgreSQL and other managed databases used by our applications. • Collaborate with engineering teams — proactively communicate progress, share context, and manage expectations. • Troubleshoot and resolve production issues as part of our on-call rotation. • Participate in internal and external security audits, ensuring our systems meet compliance and resilience standards. • Drive SRE and GitOps principles — from post-mortems to automation and clear documentation.

Poland
zł22K - zł30K / month
Job Closed