SEON logo
SEON

The command center for fraud prevention and AML compliance that enriches data, provides context and directs action.

Senior Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 201-500H1B No SponsorCompany SiteLinkedIn

Location

Hungary

Posted

71 days ago

Salary

0

Seniority

Senior

Job Description

Senior Site Reliability Engineer

SEON

• Ensure the reliability, availability, and performance of our systems by implementing SRE best practices • Develop and maintain comprehensive monitoring and alerting systems using tools such as Prometheus, Grafana, ELK stack, etc. • Manage incident response and root cause analysis for production issues • Conduct postmortems to learn from failures and drive continuous improvement in the system’s reliability • Continuously monitor and optimize the performance of cloud infrastructure to ensure efficient resource utilization and cost-effectiveness • Automate routine tasks and processes to reduce manual intervention and increase efficiency • Analyze current system capacity and plan for future growth to ensure the infrastructure can scale with increasing demands • Define, measure, and monitor SLOs and SLIs to ensure that services meet their reliability targets • Work closely with engineering, and product teams to provide feedback and suggestions on new architectures, ensuring they meet reliability and performance standards • Develop and maintain comprehensive documentation for architecture, infrastructure, and troubleshooting processes. • Provide on-call support to ensure the continuous availability of our applications and infrastructure • Ensure that systems meet security and compliance requirements, performing regular audits and assessments based on the internal security team’s guidelines • Stay current with new technologies and industry trends, evaluating their potential impact on our infrastructure and reliability practices

Job Requirements

  • 6+ years of experience as a SRE, DevOps or in a similar engineering role, with a focus on reliability principles and practices
  • Strong hands-on experience working with Kubernetes (AWS EKS preferred)
  • Strong hands-on expertise in Terraform
  • Extensive experience working in multi-region and multi-account AWS setup
  • Strong experience with monitoring and logging tools such as Prometheus, Grafana, Elasticsearch, and Kibana.
  • Strong experience deploying, maintaining and troubleshooting scalable distributed components in microservice-based architecture
  • Experience researching, troubleshooting and improving customer critical requests related to latency, availability and performance issues
  • Ability to quickly troubleshoot complex issues related to infrastructure
  • Proficiency with incident management tools such as PagerDuty, Opsgenie, etc.
  • Familiarity with CI pipelines and tools (Github Actions preferred)
  • Experience working with GitOps practices and CD tools (ArgoCD preferred)
  • A proactive approach to identifying and resolving issues independently with a strong problem-solving attitude
  • Excellent communication and collaboration skills to work effectively with cross-functional teams

Benefits

  • Flexible work arrangements

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Skydio logo

Deployment Engineer

Skydio

Skydio is a robotics company that is known for its advanced and innovative, autonomous drone technology. As an employer, the company strives to foster a mission

DevOps Engineer71 days ago

• Work closely with internal teams to become an expert on Skydio’s Deployment products, processes, specifications, and product roadmap • Deploy and ensure our cloud connected devices software and hardware are functioning and providing value as agreed in our customer’s ecosystem after it has been installed, configured, tested, and modified as per requirements. • Communicate with customers to ensure that all of their needs are understood and addressed • Collaborate with various internal departments to ensure that they fulfill all customer requests • Resolve complaints and keeping track of all processes that pertain to the client’s needs • Act as the customer’s representative to ensure that their demands are met with a focus on improving the customer experience • Track and manage all implementation projects with our large enterprise customers for successful delivery of technology and services. • Develop and maintain deployment and installation documentation, and documenting Standard Operating Procedures for the customer to ensure proper usage and value out of deployment. Quantify product feedback and briefing executives to drive software and hardware engineering to better fit our customers needs • Build customer loyalty through proactive support and account management • Build scalable processes for installation of cloud connected devices on Enterprise grade secure networks

Arizona + 6 moreAll locations: Arizona | California | Colorado | Nevada | New Mexico | Oregon | Washington
$115K - $135K / year
Job Closed
TechTorch logo

DevOps Multicloud

TechTorch

Accelerate your Services delivery with AI

DevOps Engineer71 days ago
Full TimeRemoteTeam 51-200Since 2021H1B No Sponsor

• Design and implement cloud infrastructure primarily on AWS, including networking, security, runtime, data, and logging layers • Support and integrate with Azure and GCP environments where part of broader enterprise architectures • Build and maintain CI/CD pipelines for: • Application deployments (frontend and backend) • Serverless and container-based services • Infrastructure-as-Code promotion across environments using Terraform • Define environment strategy (dev / stage / prod), release workflows, and deployment safety mechanisms (rollback, blue/green) • Implement security and compliance guardrails (encryption, secrets management, IAM, audit logging, WAF/edge security) • Own observability and SLO readiness (monitoring, alerting, tracing, log aggregation, synthetic checks) • Support connectivity and operational patterns across enterprise systems, APIs, and third-party integrations • Partner with architects and engineers on capacity planning, performance optimization, cost control, and incident response

Poland
IT.HR | Recruitment Agency logo

Senior DevOps Engineer

IT.HR | Recruitment Agency

IT Recruitment | Permanent Recruitment | SAP Recruitment | IT Team Recruitment | Express IT Recruitment | IT Sourcing

DevOps Engineer71 days ago
Full TimeRemoteTeam 11-50Since 2009H1B No Sponsor

• Design, build, and maintain scalable and secure cloud infrastructure for AI and Big Data solutions, ensuring high availability and performance in a production environment. • Implement end-to-end automation for the complete Machine Learning (ML) lifecycle, including infrastructure for model training, real-time inference, and data processing. • Develop, manage, and optimise Infrastructure as Code (IaC) using tools like Terraform to provision and configure cloud resources consistently and efficiently. • Create, manage, and improve robust Continuous Integration and Continuous Delivery (CI/CD) pipelines to enable rapid and safe deployment of code, models, and infrastructure changes. • Manage and optimise Kubernetes clusters (e.g., AKS, EKS, or GKE) specifically tailored for running containerised data processing frameworks and GPU-enabled ML workloads at scale. • Establish advanced monitoring, logging, and alerting systems to ensure the health, performance, and cost efficiency of the data and AI platforms. • Implement security measures, access controls, and compliance frameworks to ensure data privacy, model governance, and adherence to security best practices across all environments. • Work closely with Data Scientists, AI Engineers, and Software Development teams to translate complex AI requirements into robust, deployable infrastructure solutions.

Poland
IT.HR | Recruitment Agency logo

Senior DevOps Engineer – Azure, Kubernetes

IT.HR | Recruitment Agency

IT Recruitment | Permanent Recruitment | SAP Recruitment | IT Team Recruitment | Express IT Recruitment | IT Sourcing

DevOps Engineer71 days ago
Full TimeRemoteTeam 11-50Since 2009H1B No Sponsor

• Design and implement highly available, scalable, and secure cloud infrastructure architectures utilising Microsoft Azure services, including AKS, VNETs, Private Endpoints, Azure Firewall, Key Vault, and Azure Application Gateway. • Develop and maintain Infrastructure as Code using tools like Terraform or ARM Templates to provision and manage cloud resources in a repeatable and consistent manner. • Design, implement, and optimise robust CI/CD pipelines (e.g., using Azure DevOps, GitHub Actions, or GitLab CI) for both application deployment and infrastructure changes. • Administer, secure, and optimise Kubernetes clusters (preferably AKS), focusing on networking, scaling, security hardening, and performance tuning. • Implement comprehensive monitoring, logging, and alerting solutions using tools like Prometheus, Grafana, and Azure Monitor/Log Analytics to maintain platform health and identify issues proactively. • Enforce security best practices, implement policies (e.g., Azure Policies), and ensure all deployments comply with industry standards and internal IT security requirements. • Work closely with development, security, and product teams to define requirements, troubleshoot complex production issues, and mentor junior team members on DevOps methodologies and tools.

Poland