Job Closed

This listing is no longer active.

Coalfire

Cyber solutions that move you forward, faster.

Senior Site Reliability Engineer

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 1,001-5,000Since 2001H1B SponsorCompany Site LinkedIn

Location

United States

Posted

55 days ago

Salary

$86K - $148K / year

Seniority

Senior

Bachelor Degree7 yrs expEnglishAnsible AWS Azure Cloud Google Cloud Platform Linux Terraform

Job Description

• Hands-on engineering work, including developing new deployments, automation scripts, and tooling to meet client deliverables focused on vulnerability management, infrastructure updates, and compliance requirements. • Develop and maintain Infrastructure-as-Code (IaC), utilizing Coalfire standard modules for Terraform, Ansible, and CI/CD pipelines across projects. • Partner with Technical Managers and engagement leads to evaluate risks, prioritize issues, and develop actionable mitigation plans across an SRE team’s portfolio of M&O clients. • Contribute to technical playbooks, standards, and frameworks for operational excellence in managed services delivery. • Own the patch management strategy for assigned environments, ensuring regulatory compliance and timely remediation of vulnerabilities. • Oversee Identity and Access Management (IAM), implementing and enforcing security best practices to protect sensitive data and ensure proper access controls. • Perform cloud administration and system administration tasks, such as provisioning resources, optimizing performance, and troubleshooting infrastructure issues. • Adhere to established quality standards for engineering deliverables, aligning with internal protocols, compliance regulations, and project deadlines. • Identify and communicate potential risks, working with relevant stakeholders to incorporate mitigation strategies that meet regulatory and client expectations. • Contribute to day-to-day agile project management tasks, including tracking progress, providing updates, and ensuring assigned activities are completed on schedule. • Mentor junior engineers, review their work, and help mature engineering practices.

Job Requirements

5–7 years in systems engineering/SRE with increasing responsibility, including architecture design, operations, and automation.
4+ years in cloud infrastructure management (AWS, Azure, or GCP) with multi-account and multi-environment experience.
4+ years developing and maintaining IaC with Terraform/Ansible at scale.
Direct experience leading at least 1 operational improvement (e.g., reducing toil, enhancing SLAs, improving incident response).
Possess AWS Solutions Architect Professional certification
Demonstrated experience driving at least 1 successful team initiative and serving as the technical SME for a complex initiative.
Advanced Cloud Expertise: Strong hands-on experience with a major public cloud (AWS, Azure, or GCP), including architecture, security, and performance optimization.
IaC Leadership: Deep understanding of Terraform, and modern CI/CD automation practices; capable of reviewing and improving team IaC workflows.
Operational Excellence: Ability to lead troubleshooting of high-impact incidents, implement monitoring solutions, and improve system reliability.
Security-First Mindset: Experienced in aligning engineering solutions with frameworks such as FedRAMP, CIS, and NIST.
Collaboration & Leadership: Proven ability to lead cross-functional projects, mentor team members, and influence stakeholders.
Documentation & Communication: Skilled at creating technical documentation, architecture diagrams, and presenting complex solutions clearly to both technical and non-technical audiences.
Manage and maintain Windows and Linux server environments, including system hardening, GPO configuration, user management, OS-level troubleshooting, and ensuring consistent patching across hybrid environments.
US citizenship (required due to client contractual requirements)

Benefits

paid parental leave
flexible time off
certification and training reimbursement
digital mental health and wellbeing support membership
comprehensive insurance options

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior Site Reliability Engineer

Circle

The all-in-one community platform for creators and brands. https://circle.so/

DevOps Engineer55 days ago

Full Time RemoteTeam 51-200Since 2019H1B Sponsor

Company Site LinkedIn

About UsCircle is building the world’s leading all-in-one platform for online communities. We make it possible for creators, coaches, educators, and businesses to bring together their audience with engaging discussions, live streams, events, chat, courses, and payments — all in one place, all under their own brand. We’re proud to be a fully remote company of around 200 (and growing!) team members from 30+ countries around the world. We seek exceptional individuals around the world, set them up to do the best work of their lives, and in turn, create a meaningful impact in their own lives. We don't track hours, but we do manage for high expectations very closely. We collaborate across time zones, are highly async, and like to document a lot. Twice a year, we bring the whole company together in beautiful places around the world for our company offsites. So far, we’ve hosted offsites in Turkey, Portugal, Mexico, Thailand, Colombia, Italy, Ireland, and more, with still more to come! Check out our Careers page for more about working at Circle. About the roleWe're growing fast — and so is the demand on our infrastructure. We're hiring a Senior Site Reliability Engineer to help us scale Circle’s platform with confidence. You'll play a critical role in keeping our systems fast, reliable, and secure as usage climbs. This is a highly impactful, hands-on role for someone who thrives in high-growth environments and wants to build for the long term. What you'll be doing - Act as a first responder for system incidents and outages, helping Circle stay highly available and performant - Own and evolve our monitoring, alerting, and log management systems - Manage and optimize our database infrastructure (including MySQL, Postgres, Clickhouse, and Redis) - Maintain and improve our server infrastructure and deployment pipelines - Collaborate closely with engineering teams to build scalable, resilient systems - Contribute to internal SRE tooling and automation efforts What you'll need to be successful - Strong alignment with our values (find our values on our career page if you haven’t read up on them yet) - You are proficient in English (spoken, written, and reading) at a CEFR Level C2 / ILR Level 5 - Deep expertise with AWS and Kubernetes - 3+ years of experience in a Site Reliability, DevOps, or Infrastructure Engineering role - Experience managing incident response and production system outages - Hands-on experience with database operations and optimization - Familiarity with observability tooling, monitoring, and logging best practices - Based in North or South America (AMER region) — this is a requirement for timezone alignment with our team Bonus points - Experience with SOC2 compliance or building secure infrastructure - You’ve helped scale an early-stage product to 1M+ monthly active users - Experience with Clickhouse or similar technologies $130,000 - $140,000 USD per yearThe cash compensation range shown is a starting point. In addition to equity, benefits and perks, your cash compensation is subject to an annual review and increase on a once per year basis. The fun stuff - Fully remote: work from anywhere in the world! - Autonomy and trust to do your job: we care about outcomes over everything else. - Paid time away: all employees are given 35 days of PTO annually. We also offer a paid sabbatical after 5 years. - Generous U.S. benchmarked compensation and startup equity no matter where you are in the world.* - Awesome medical coverage with 100% coverage for you and your family, or medical reimbursement options where applicable!* - Parental leave for parents expanding their family, or just starting one. - Home office stipend to help you get up and running. - Learning & development stipend to help you level up your professional skills. - Annual bonus potential for roles that don't already receive variable income or commission. - Company retreats: Twice a year, the Circle team gets together for a fully paid company retreat in incredible places around the world! We’ve had past retreats in Colombia, Portugal, and Mexico, with more planned on the horizon. - Check out our Careers page for more. *Your role, location and unique circumstance may affect this. Candidate Safety & Interview Process NoticeAt Circle, the safety and trust of our candidates is extremely important to us. Unfortunately, recruiting fraud is increasingly common in the job market, and bad actors sometimes impersonate companies or employees. To help protect you, here’s what you can expect from our hiring process: How Our Process Works - All applications are submitted through our official applicant tracking system (Greenhouse). - If selected to move forward, you may be invited to record a short introductory video, or directly into a live interview. - All live interviews are conducted face-to-face over video (Zoom or Google Meet) with a member of the Circle team. - We do not conduct text-only interviews. - We do not conduct interviews via chat apps or messaging platforms. - We do not use AI bots to interview candidates. Official Communication Channels - All official communication from Circle will come from no-reply@circle.co, or an email address ending in @circle.co or @circle.so. - We will never contact you from unofficial domains (such as “.team”, “.careers”, or similar variations). - We will never request sensitive personal information early in the process, or ask for payment of any kind. If you receive a suspicious message claiming to be from Circle, please do not respond. We appreciate your vigilance, and look forward to connecting with you through our official channels. Diversity, Equity & InclusionAs a fully-remote international company, diversity is baked into our DNA. Here’s how our CEO, Sid Yadav, frames our hiring mission: “let’s find talent in underserved and under-represented corners of the world, set them up to do the best work of their lives, and in turn, change their life.” To achieve this hiring mission, we offer competitive U.S. benchmarked compensation no matter where someone’s located in the world, and we proactively seek candidates who expand representation of backgrounds, cultures and lived experiences in our teams. Equal Employment OpportunityCircle is an equal opportunity employer and as such, we do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or any other characteristic protected by applicable laws. If you require any accommodations during the recruitment process, please let us know and we will work with you to meet your needs. How We Use Candidate DataAt Circle, we are committed to protecting your personal information. As a job applicant, the personal data you provide to us is collected and processed in accordance with the General Data Protection Regulation (GDPR) in the EU and the California Consumer Privacy Act (CCPA). This notice outlines the types of personal information we collect, the purpose for collecting it, and your rights. Information We Collect: We collect the following categories of personal information from job applicants: - Contact information (such as name, email address, phone number) - Employment history and qualifications - Education history - References and any other information you choose to share with us during the application process Purpose of Collection: We collect this information for the following purposes: - To assess your qualifications and suitability for the position - To communicate with you during the recruitment process - To comply with legal and regulatory obligations Your Rights Under GDPR and CCPA: You have the following rights regarding your personal information: - The right to request access to the personal information we hold about you. - The right to request the deletion of your personal information, subject to certain legal exceptions. - The right to opt out of the sale of your personal information (Note: We do not sell personal information). For more information about how we handle your personal data or to exercise your rights, please refer to our full Privacy Policy. By submitting your application, you acknowledge that you have read and understood this privacy notice.

AWS Clickhouse Kubernetes MySQL PostgreSQL Redis

View details: Senior Site Reliability Engineer

United States + 30 more

Apply

DevSecOps Engineer, Azure, Openshift

knowmad mood

growing together

DevOps Engineer55 days ago

Full Time RemoteTeam 1,001-5,000Since 1994H1B No Sponsor

Company Site LinkedIn

• Trabajar en prácticas DevSecOps en un entorno de Azure y Openshift • Gestionar entornos Azure a gran escala y administrando Kubernetes AKS y Openshift • Implementar CI/CD con Azure DevOps • Desarrollar IaC con Terraform

Azure Cloud Kubernetes OpenShift Terraform TypeScript

View details: DevSecOps Engineer, Azure, Openshift

Spain

Apply

Job Closed

Senior Java Engineer

Ciklum

At Ciklum, we are always exploring innovations, empowering each other to achieve more, and engineering solutions that matter. With us, you’ll work with cutting-edge technologies, contribute to impactful projects, and be part of a One Team culture that values collaboration and progress. As one of Ukraine’s largest IT companies and a top employer recognized by Forbes, we’ve spent over 20 years delivering meaningful tech solutions. We proudly support diverse talent and military veterans, recognizing their unique skills and perspectives they bring to shaping the future.

DevOps Engineer55 days ago

Full Time RemoteTeam 1,001-5,000

Ciklum is looking for a Senior Java Engineer to join our team full-time in Ukraine. We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer technology that redefines industries and shapes the way people live. About the role: As a Senior Java Engineer, become a part of a cross-functional development team engineering experiences of tomorrow. You will be developing and extending a Product Testing Portal for a leading European sportswear manufacturer, augmenting existing team. We are using technology to create solutions that are transforming how products are being digitally created. We are putting collaboration tools in place that ease and speed up product creation and sourcing for thousands of employees. We are harnessing data which enables our employees to make better decisions. We are working in a fast paced, startup alike environment where we truly believe that technology can automate most of the mundane tasks and free up time for creators to do what they really need to do – create. As part of the engineering team your mission is to deliver high quality software at speed. You will be developing and operating our software products, showcasing your creativity and using the latest technologies to bring ideas to life. Responsibilities: - You will bring your ideas to life in a buzzing environment of highly engaged, multinational agile teams, who at their core build game-changing software products. Right there with you - You will focus on elegant Java backend solutions - You will understand the full E2E process along frontend and backend components - Get ready to work with QA & DevOps cracks and collaborate with solution architects, product owners and project managers - You will be a sponge, continuously learning the latest tech from experienced colleagues, conferences, and trainings – always growing and improving - You will follow existing release process to enable developed features in live systems - You will ensure team code is compliant with code quality and standards - You highlight tech debt and ensuring it’s addressed as part of product roadmap Requirements: - College or university degree with focus on IT or equivalent - At least 5 years overall IT experience with more than 3 years in relevant area (ideally with Java - Hands-on experience with microservices (design and development) - Experience with designing and implementing REST APIs would be plus - Eager to look for perfection in your coding through software development best practices - Continuous Integration/Delivery, Test Automation and Everything as Code would be a plus - Motivation to never stop learning in the digital IT space - Strong interpersonal and communication skills. Fluent in English - Languages: Java 8 /11/17/21 - Frameworks: Spring Boot, Spring Cloud - Container knowledge: Kubernetes, Docker - Database: DynamoDB, Elastic Search, Oracle, Mysql, solr - CI/CD: Jenkins - Agile methodologies: Scrum/Kanban - Platform: Kafka What`s in it for you? - Strong community: Work alongside top professionals in a friendly, open-door environment - Growth focus: Take on large-scale projects with a global impact and expand your expertise - Tailored learning: Boost your skills with internal events (meetups, conferences, workshops), Udemy access, language courses, and company-paid certifications - Endless opportunities: Explore diverse domains through internal mobility, finding the best fit to gain hands-on experience with cutting-edge technologies - Flexibility: Enjoy radical flexibility – work remotely or from an office, your choice - Care: We’ve got you covered with company-paid medical insurance, mental health support, and financial & legal consultations About us: At Ciklum, we are always exploring innovations, empowering each other to achieve more, and engineering solutions that matter. With us, you’ll work with cutting-edge technologies, contribute to impactful projects, and be part of a One Team culture that values collaboration and progress. As one of Ukraine’s largest IT companies and a top employer recognized by Forbes, we’ve spent over 20 years delivering meaningful tech solutions. We proudly support diverse talent and military veterans, recognizing their unique skills and perspectives they bring to shaping the future. Want to learn more about us? Follow us on Instagram, Facebook, LinkedIn. Explore, empower, engineer with Ciklum! Interested already? We would love to get to know you! Submit your application. We can’t wait to see you at Ciklum.

View details: Senior Java Engineer

Ukraine

Apply

Job Closed

Senior Site Reliability Engineer - Ireland

Arista Networks

Data-Driven Networking

DevOps Engineer55 days ago

Full Time RemoteTeam 1,001-5,000Since 2004H1B Sponsor

Company Site LinkedIn

Company Description Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. Arista is a well-established and profitable company with over $8 billion in revenue. Arista’s award-winning platforms, ranging in Ethernet speeds up to 800G bits per second, redefine scalability, agility, and resilience. Arista is a founding member of the Ultra Ethernet consortium. We have shipped over 20 million cloud networking ports worldwide with CloudVision and EOS, an advanced network operating system. Arista is committed to open standards, and its products are available worldwide directly and through partners. At Arista, we value the diversity of thought and perspectives each employee brings. We believe fostering an inclusive environment where individuals from various backgrounds and experiences feel welcome is essential for driving creativity and innovation. Our commitment to excellence has earned us several prestigious awards, such as the Great Place to Work Survey for Best Engineering Team and Best Company for Diversity, Compensation, and Work-Life Balance. At Arista, we take pride in our track record of success and strive to maintain the highest quality and performance standards in everything we do. Job Description Who You'll Work For We are seeking an experienced and analytically-minded Site Reliability Engineer to join our organisation on a permanent, remote basis from Ireland. In this role, you will be instrumental in building, deploying, and operating critical production systems with a steadfast commitment to scalability, reliability, observability, and security. You will work collaboratively with cross-functional teams to ensure our infrastructure remains resilient, efficient, and future-ready. This is an excellent opportunity for a detail-oriented professional who thrives in a dynamic environment and is passionate about solving complex infrastructure challenges. What You'll Do - Design, build, and deploy production systems with a focus on scalability, reliability, observability, and performance, ensuring systems meet stringent security standards - Develop and maintain comprehensive automation solutions to eliminate toil and streamline operational efficiency across production environments - Proactively monitor production systems, establish intelligent alerting strategies, and implement automated incident response mechanisms to minimise downtime - Create and maintain detailed incident response runbooks; conduct thorough postmortem analyses following incidents to identify root causes and prevent recurrence - Collaborate with software engineering teams to identify and resolve infrastructural bottlenecks, designing innovative solutions that enhance product deployment workflows - Manage and optimise monitoring infrastructure using industry-standard tools, ensuring comprehensive visibility across all systems - Plan, communicate, and execute maintenance windows on production systems with minimal disruption to service availability - Triage platform and infrastructural issues with decisiveness and analytical rigour; engage with third-party vendors and support teams as required - Deploy new systems and updates in a staged, risk-managed manner, ensuring safe and incremental rollouts - Survey and adopt best practices in infrastructure and platform management to maintain secure, scalable, and fault-tolerant systems - Study the design and implementation details of open-source systems to enhance troubleshooting capabilities and accelerate issue resolution - Work transparently with stakeholders to communicate system status, planned maintenance, and infrastructure improvements #LI-EO1 #automation #Ansible #Terraform #observability #Prometheus #Grafana #cloud platforms #AWS #GCP #Azure #container #orchestration #Kubernetes #Docker #CI/CD #Jenkins #GitLab Qualifications **Essential Requirements:** - Bachelor's degree in Computer Science, Engineering, or equivalent professional experience (5+ years in a related infrastructure or systems role) - Proficiency in one or more programming languages: Go, Python, or bash shell scripting, with the ability to implement medium-complexity automation workflows - Strong knowledge of Linux or UNIX from both administration and debugging perspectives - Hands-on experience operating software systems, infrastructure, and complex applications at scale in production environments - Demonstrated expertise in infrastructure-as-code principles and practices - Strong problem-solving and software troubleshooting skills with a methodical, analytical approach - Experience with server provisioning, particularly from storage and networking perspectives - Proven ability to work collaboratively within cross-functional teams and communicate technical concepts clearly - Experience with incident response, postmortem analysis, and continuous improvement methodologies **Desirable Skills and Experience:** - Experience with container orchestration platforms, particularly Kubernetes - Hands-on experience with Docker and virtualisation technologies - Proficiency in managing monitoring stacks, including Prometheus and Grafana - Experience with CI/CD systems such as GitLab tools or Spinnaker - Knowledge of infrastructure-as-code frameworks, particularly Terraform - Experience managing databases such as PostgreSQL or equivalent relational database management systems - Experience with artifact repositories and Docker registries - Familiarity with cloud platforms (Google Cloud Platform, Amazon Web Services, or Microsoft Azure) - Understanding of distributed systems architecture and principles - Experience with performance tuning and system optimisation - Knowledge of security best practices in infrastructure and systems design - On-call support experience and comfort with incident response responsibilities Additional Information Arista stands out as an engineering-centric company. Our leadership, including founders and engineering managers, are all engineers who understand sound software engineering principles and the importance of doing things right. We hire globally into our diverse team. At Arista, engineers have complete ownership of their projects. Our management structure is flat and streamlined, and software engineering is led by those who understand it best. We prioritize the development and utilization of test automation tools. Our engineers have access to every part of the company, providing opportunities to work across various domains. Arista is headquartered in Santa Clara, California, with development offices in Australia, Canada, India, Ireland, and the US. We consider all our R&D centers equal in stature. Join us to shape the future of networking and be part of a culture that values invention, quality, respect, and fun.

View details: Senior Site Reliability Engineer - Ireland

Ireland

Apply

Senior Site Reliability Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Site Reliability Engineer

DevSecOps Engineer, Azure, Openshift

Senior Java Engineer

Senior Site Reliability Engineer - Ireland