Fortyx logo
Fortyx

AI powered cybersecurity assistant, combining threat prevention and cyber awareness training in real time.

Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 1-10H1B No SponsorCompany SiteLinkedIn

Location

United Kingdom

Posted

71 days ago

Salary

0

Seniority

Senior

Bachelor DegreeEnglishAWSCloudEC2PythonTerraform

Job Description

Site Reliability Engineer

Fortyx

• Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform • Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components • Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues • Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents • Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning • Identify opportunities to automate manual processes and improve system resilience • Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments • Implement and improve continuous integration and continuous deployment (CI/CD) pipelines • Collaborate with security teams to implement best practices for securing cloud infrastructure and services • Ensure compliance with relevant industry standards and regulations • Support CI/CD pipelines for application deployments and updates • Contribute to the design and implementation of deployment strategies that promote zero-downtime releases • Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures • Participate in knowledge sharing with team members to enhance overall expertise and skill sets

Job Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
  • Proven experience as a Site Reliability Engineer or similar role
  • Extensive experience with Amazon Web Services (AWS) and its core services (EC2, S3, RDS, IAM, etc.)
  • Strong proficiency in infrastructure-as-code (IaC) tools, with a focus on Terraform
  • Proficient in scripting with Python or Bash for automation and operational tasks
  • Solid understanding of networking principles and protocols
  • Knowledge of CI/CD pipelines and related tools

Benefits

  • equity-only position
  • opportunity to gain a stake in a rapidly growing company
  • contribute directly to its success

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Tempo Software logo

Manager, Site Reliability Engineer

Tempo Software

Adaptive SPM for AI-Accelerated Innovation | Modular Solutions, Compounding Value | 30,000+ Customers

DevOps Engineer71 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

• Lead, mentor, and grow a team of Site Reliability Engineers, focusing on career development, performance management, and hiring. • Define the team's roadmap and strategy for platform reliability, scaling, and operational efficiency. • Provide technical oversight and direction for the design and implementation of key infrastructure projects, including CI/CD pipelines and automation for build, release, and deployment processes. • Partner closely with engineering teams and product managers to ensure the reliability and performance requirements of new products and features are met. • Oversee the maintenance and continuous improvement of the AWS-based platform to ensure it scales effectively. • Drive the adoption of AI tooling to enhance SRE productivity and introduce intelligent automation of SRE processes. • Champion SRE best practices, including error budget management, effective on-call rotations, incident response, and post-mortem processes.

Canada
Canonical logo

Site Reliability Engineer

Canonical

Ubuntu is a community-developed, Linux-based operating system that is published and commercially supported by software development firm Canonical. Like Canonica

DevOps Engineer71 days ago

• We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. • To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers. • As a member of the team, you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.

Worldwide
Canonical logo

Senior Site Reliability Engineer

Canonical

Ubuntu is a community-developed, Linux-based operating system that is published and commercially supported by software development firm Canonical. Like Canonica

DevOps Engineer71 days ago

• Bring Python software-engineering skills and rigour to the operations domain • Practise devsecops from bare metal to application • Architect and run OpenStack, Kubernetes and software-defined storage • Enable devsecops for applications running on that infrastructure • Gain experience in a broad range of cloud technologies

Worldwide
Canonical logo

Site Reliability Engineer, GitOps

Canonical

Ubuntu is a community-developed, Linux-based operating system that is published and commercially supported by software development firm Canonical. Like Canonica

DevOps Engineer71 days ago

• Apply your experience of IaC to develop infrastructure as code practice within IS by constantly increasing automation and improving IaC processes • Automate software operations for re-usability and consistency across private and public clouds, taking into consideration the complexities of distributed systems • Develop new features and improve the resilience and scalability of the existing cloud and container portfolio at Canonical • Maintain operational responsibility for all of Canonical’s core services, networks, and infrastructure • Develop skills in troubleshooting, capacity planning, and performance investigation, Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain monitoring and alerting for various systems and services • Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures • Provide assistance and work with globally distributed engineering, operations, and support peers • Be given uninterrupted development time to focus on larger projects and automation of manual tasks • Share your experience, know-how and best practices with other team members in design sessions, mentorship and ‘doing work together’ • Carry final responsibility for time-critical escalations.

Worldwide