Site Reliability Engineer – III
Location
Greece
Posted
13 days ago
Salary
0
Seniority
Senior
Job Description
Site Reliability Engineer – III
Proofpoint
• Deploy, manage, and scale distributed platforms across multiple geographic regions • Design and maintain Kubernetes-based infrastructure for large-scale applications • Build and manage Helm charts for efficient and repeatable deployments • Monitor system health using Grafana dashboards and metrics; proactively identify and resolve issues • Improve system reliability, performance, and scalability through automation and best practices • Handle large-scale deployments and improve infrastructure for growth • Collaborate with development teams to ensure smooth CI/CD and production readiness • Implement observability, alerting, and incident response processes • Troubleshoot production issues and perform root cause analysis • Write and maintain run books for incident response
Job Requirements
- 4–5 years of experience in Site Reliability Engineering, DevOps, or similar roles
- Hands-on experience with Kubernetes in production environments
- Proven experience with infrastructure as code (Terraform, git...)
- Experience with AWS (eks, vpc, s3, ecr, iam role etc)
- Solid experience with Helm charts for application deployment
- Bash scripting and tools automation experience
- Experience with large-scale distributed systems and high-availability architectures
- Strong understanding of containerization, micro-services, and cloud-native ecosystems
- Experience with CI/CD pipelines and automation tools
- Good debugging and problem-solving skills in production environments
Benefits
- Competitive compensation
- Comprehensive benefits
- Career success on your terms
- Flexible work environment
- Annual wellness and community outreach days
- Always on recognition for your contributions
- Global collaboration and networking opportunities
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Staff Engineer – DevOps
DispelMoving Target Defense-based remote access systems for people and machines.
• Security for critical industries connecting and operating securely. • Delivering secure, scalable connectivity and remote access. • Unified applications and systems to streamline operations.
Staff SRE Engineer
Stellar CyberEmpowering lean security operations teams of any skill to successfully secure their environments. WE ARE HIRING!
• Administer and maintain container orchestration platforms and containerized workloads. • Monitor and troubleshoot production systems, participating in on-call rotations to ensure reliability. • Drive observability improvements by enhancing monitoring, logging, and alerting capabilities across systems and data platforms. • Administer and optimize cloud-based environments across multiple providers. • Manage and support distributed data platforms and real-time processing systems. • Develop and maintain continuous integration and delivery pipelines for efficient and reliable deployments. • Own and implement Infrastructure as Code (IaC) practices to ensure consistency and scalability. • Automate and orchestrate infrastructure using programming and scripting languages. • Perform system administration and networking tasks to support internal and external environments. • Collaborate effectively with engineers and stakeholders across different time zones.
• Design, implement, and maintain robust and scalable automation solutions for physical and virtual server infrastructure provisioning using tools like Ansible, Python, or similar • Develop and maintain automation scripts and workflows leveraging various APIs (e.g., cloud provider APIs, virtualization platform APIs, hardware management APIs, internally developed APIs) • Architect, implement, and manage our Kubernetes infrastructure for deploying, scaling, and maintaining containerized applications • Develop and maintain CI/CD pipelines to automate the build, test, and deployment of applications to Kubernetes • Implement and maintain infrastructure monitoring, alerting, and logging solutions to ensure system health and performance • Collaborate closely with development teams to integrate automation and DevOps practices into their workflows • Drive the adoption of DevOps best practices, including infrastructure as code, continuous integration, continuous delivery, and automated testing • Troubleshoot and resolve infrastructure and deployment-related issues in a timely and efficient manner • Contribute to the development and maintenance of internal tooling and documentation • Mentor and guide junior members of the team • Stay up-to-date with the latest trends and technologies in DevOps, automation, and cloud computing • Participate in on-call rotations as needed to ensure the availability of critical systems.
Senior Engineering Manager – DevOps, Infrastructure, Release Engineering
Abacus InsightsImproving people’s lives by harnessing the healthcare data explosion through an intelligent data integration platform.
• Lead, coach, and grow DevOps, Infrastructure, and Release Engineering teams across geographies • Establish clear ownership and accountability across CI/CD, environments, and production readiness • Conduct regular 1:1s, performance reviews, goal setting, and career development planning • Hire, onboard, and retain senior DevOps and release engineering talent • Foster a culture of operational ownership, blameless postmortems, and high availability • Own the release engineering strategy for a SaaS platform supporting frequent, reliable deployments • Design and continuously improve CI/CD pipelines that support: • Trunk-based or hybrid branching strategies • Automated testing gates (unit, integration, security, performance) • Progressive delivery (feature flags, canaries, blue-green deployments) • Establish deployment standards, change classification, and risk levels across teams • Reduce release toil, manual steps, and human error through automation • Drive improvements in deployment frequency, change failure rate, and mean time to recovery (DORA metrics) • Ensure consistent release processes across services, platforms, and cloud environment • Provide technical leadership across AWS, Azure, and GCP environments • Ensure infrastructure and delivery pipelines follow security, compliance, and cloud best practices • Partner with Architecture and Security to enforce: • Infrastructure as Code (Terraform, CloudFormation, ARM/Bicep) • Secure-by-default deployments • Policy-as-code and guardrails • Influence architecture decisions to improve operability, scalability, and release safety • Own production readiness standards and release go/no-go criteria • Ensure clear incident response processes, escalation paths, and on-call readiness • Partner deeply with Product, QE, Security, and Engineering leaders to align on release expectations




