Soum logo
Soum

Reimagining recommerce in the MENA region and beyond

DevOps Engineer, Mid Level

Location

Egypt

Posted

6 days ago

Salary

0

Seniority

Mid Level

Professional Certificate2 yrs expExperience acceptedEnglishAWSCloudDockerFirewallsGrafanaKubernetesLinuxPrometheusTerraform

Job Description

DevOps Engineer, Mid Level

Soum

• Take responsibility for the scalability, stability and availability of our low-latency & mission-critical systems • Enhance the CI/CD pipeline using Github Actions • Able to generate and maintain helm manifests for Kubernetes • Maintain our IaC • Able to setup and maintain monitoring via NewRelic, Prometheus & Grafana • Ability to debug infrastructure and detect bottlenecks • Take responsibility for company-wide whole tech infrastructure for all environments. • Responsibility for managing vpn, load balancers and firewalls • Responsible for high availability and disaster recovery • Implementation of secure and stable infrastructure • Responsibility for continuous improvement and cost optimization • Responsibility of 24x7 monitoring and resolving infrastructure tier incidents and escalate to upper tiers where necessary • Responsibility for investigating and clear documentation of incident reports

Job Requirements

  • Excellent understanding in **AWS cloud **
  • Knowledge of containerized environments with **Kubernetes **&** Docker**
  • Ability to develop well managed infrastructure setup
  • Deep knowledge of **Linux **internals, networking routing & protocols
  • Strong hands-on building High availability infrastructure using EKS on AWS
  • Strong understanding in IaC specifically Terraform
  • Strong understanding in **GitOps** specifically **ArgoCD**
  • Strong understanding in securing production infrastructure
  • Minimum **2 years **of **DevOps experience** in production applications
  • At least **1 year** working with** Terraform**
  • At least** 1 year** with monitoring tools (**NewRelic, Prometheus, Grafana, etc.**)
  • Experience designing and architecting high-performing applications
  • Background in Agile development practices
  • Experience in a fast-growing startup (preferred)
  • AWS Certifications** **(preferred)

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full TimeRemoteTeam 201-500H1B No Sponsor

• Architect, implement, and optimize cloud-based solutions using industry best practices • Provide mentorship and technical guidance to junior engineers • Drive continuous improvement and innovation across DevOps practices • Collaborate with cross-functional teams to design and deploy robust, automated systems • Support mission-critical applications and services through scalable infrastructure solutions • Monitor and maintain the health, performance, and reliability of cloud environments • Proactively identify and resolve issues to minimize downtime and service disruptions • Participate in the on-call rotation and incident response activities

United States
$150.3K - $225.4K / year
General Dynamics logo

Reliability Engineer

General Dynamics

A business unit of General Dynamics, General Dynamics Information Technology (GDIT) supports some of the United States' most complex government, defense, and in

DevOps Engineer6 days ago

• Design, build, and maintain scalable, reliable infrastructure and services that support Hosting Services, Site Reliability Engineering, virtualization, and data center operations for a federal customer. • Collaborate closely with software developers, infrastructure engineers, and IT operations teams to plan and execute deployments, improve system architectures, and enhance service reliability. • Use automation and scripting (e.g., Python, Bash) to reduce manual work, streamline deployments, and improve consistency across environments. • Monitor system performance, availability, and capacity using modern tooling; proactively identify issues and participate in on-call support to restore services quickly when incidents occur. • Implement and support Continuous Integration/Continuous Delivery (CI/CD) pipelines using tools such as Jenkins, Git, and Terraform to enable reliable and repeatable releases. • Leverage containerization and orchestration technologies such as Docker and Kubernetes to build resilient, scalable platforms. • Work with databases (e.g., SQL, MySQL) and application stacks (e.g., Java-based services) to ensure data integrity, performance, and fault tolerance. • Partner with cross-functional teams, using Jira and other collaboration tools, to track work, communicate status, and drive continuous improvement in reliability and operational excellence. • Contribute to a culture of teamwork and collaboration by sharing knowledge, participating in post-incident reviews, and helping define best practices for reliability engineering.

United States
$111.2K - $150.4K / year
Syniti logo

Senior Site Reliability Engineer

Syniti

There's nothing status quo about your data. It's time to put Data First.

DevOps Engineer6 days ago
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

Role Description The Senior SRE – Applications will design and implement automation, CI/CD tooling, and resilient infrastructure for Syniti's cloud-hosted SaaS platform (SKP Northstar). This role sits within the Application SRE team, supporting 79+ microservices across data migration, quality, matching, orchestration, and catalog product lines. The engineer will lead Azure environment buildout while maintaining and improving existing AWS infrastructure, supporting Northstar's global compliance and scalability objectives including FedRAMP High, Cyber Essentials+, and SecNumCloud adherence. The role requires close collaboration with development, infosec, platform SRE, and compliance teams to enforce Zero Trust patterns and observability across all layers of the stack. What You Will Do - Design and build automated cloud infrastructure for Syniti-hosted SaaS workloads across Azure and AWS. - Implement and manage CI/CD pipelines using GitHub Actions and ArgoCD GitOps workflows for multi-environment deployments (dev, preprod, prod, GovCloud). - Develop and maintain Terraform modules for infrastructure provisioning, including EKS/AKS clusters, Aurora PostgreSQL, Redis, OpenSearch, and S3/Azure Storage. - Integrate and maintain observability frameworks (Prometheus, Grafana, Loki, Mimir, Jaeger) and enforce structured logging of application, audit, and security events. - Support and extend Istio service mesh configurations including mTLS policies, AuthorizationPolicies, and SPIFFE-based workload identity. - Implement and maintain supply chain security controls: container image signing (Cosign), SBOM generation (Syft), provenance attestation (SLSA), and vulnerability scanning (Trivy, Inspector, Snyk). - Collaborate with security teams to meet FedRAMP High, Cyber Essentials+, NIST 800-53, and SecNumCloud control objectives across Azure and AWS environments. - Provide L3 incident response and lead root cause analysis efforts for application-tier outages across the Northstar platform. - Support the automated compliance gate (11 controls: NAC, FIPS, STIG, IRSA, SAST, DAST, SBOM, Sign, Vuln, Audit, Auth) and ensure application services pass all gates. - Manage Kubernetes workload autoscaling (Karpenter, KEDA, HPA) and pod security policies (Kyverno) for production services. Qualifications - Strong communication skills and ability to operate in a globally distributed team. - Ability to work across development, compliance, platform SRE, and infrastructure teams. - Self-starter with strong service ownership and ability to handle on-call escalations autonomously. - Commitment to continuous improvement and infrastructure-as-code principles. - Solid documentation skills for runbooks, architecture decisions, and operational procedures. Requirements - 10+ years of experience in SRE, DevOps, or Cloud Engineering. - 5+ years of hands-on experience with Microsoft Azure including AKS, Storage, Monitor, and IAM/Entra ID. - 3+ years of hands-on experience with AWS (EKS, RDS/Aurora, IAM, S3, SQS, Cognito). - 3+ years of Terraform module development for cloud infrastructure provisioning. - 3+ years in CI/CD tooling (GitHub Actions, ArgoCD, or equivalent GitOps platforms). - Strong Kubernetes operations experience: cluster management, pod security, autoscaling, troubleshooting. - Experience with service mesh technologies (Istio, Envoy, or Linkerd) and workload identity (SPIFFE/SPIRE, IRSA, or workload identity federation). - Proficiency in scripting languages: Python, Bash, or PowerShell. Go or .NET experience a plus. - Experience implementing regional compliance controls (FedRAMP, SOC 2, Cyber Essentials+, or equivalent). - Understanding of Zero Trust principles, mTLS, service identity, and network segmentation. - Experience with observability stacks (Prometheus, Grafana, Loki, or equivalent) and distributed tracing. - Familiarity with container supply chain security (image signing, SBOM, vulnerability scanning) is a plus. Benefits - Trust in your talent. At Syniti you will find a supportive environment and access to learning tools, but micromanagement is not our style. - Growth. We are growing rapidly and steadily solving the biggest challenges enterprise companies are faced with today. There was never a better time to join and grow with us. Most importantly you will have the chance to shape our journey and share in our success story. - Support. We all rely on each other and enable each other to be successful. You won’t stand alone. - Curiosity and genuine interest in you. We all have our different stories, all equally fascinating with each depicting a different journey and we want to hear them all. - Recognition. We are the sum of individual achievements, and we always take the time to celebrate them. - An open organisation. Titles don’t define access at Syniti. We stay humble regardless of where we sit in the organisation. We want to hear every voice, listen to all the ideas and make sure everyone’s work is seen and valued. Company Description At Syniti, we’re committed to creating a respectful, inclusive, and fair workplace where everyone belongs and thrives. We believe that diverse perspectives make us stronger — and we value the unique backgrounds, experiences, and voices each person brings to our team. We welcome applicants based on their skills and potential, and we’re dedicated to ensuring equal opportunities for all, regardless of personal background. If you need accommodations during the hiring process, please let us know — we’re here to support you.

United States
Accenture logo

SRE Analyst - Junior/Semi-Senior

Accenture

Founded in 1989, Accenture is an outsourcing, management consulting, and technology services company with an international presence and annual revenues exceeding $20 billion. Accen

DevOps Engineer6 days ago

Role Description Estamos em busca de um(a) Analista SRE Jr/Pl para atuar em projetos estratégicos, contribuindo no desenvolvimento de aplicações robustas, integradas a ambientes cloud e orientadas à qualidade e performance. Qualifications - Práticas e princípios da Engenharia de Confiabilidade de Sites do Google (SRE), com destaque para SLO, SLI, Orçamento de Erros, Investigação Sem Culpa e Redução do Toil; - Experiência com mapeamento de jornada de cliente; - Codificação, incluindo entendimento sobre Python e Java; - Sistemas operacionais, principalmente Linux; - Funcionamento de ferramentas de monitoramento (pelo menos uma delas): Zabbix ou Nagios ou HP Operations ou Prometheus/Thanos ou Grafana Labs ou ELK Stack ou padrão open source de Telemetria (openTelemetry); - Conceitos de computação distribuída e microsserviços; - Métricas de observabilidade: 4 sinais de ouro (latência, tráfego, taxa de erro e saturação); - Investigação de incidentes relevantes; Requirements - Desejável: Orquestração de contêineres: Kubernetes ou Red Hat OpenShift ou Rancher; - Métodos ágeis; - Práticas de DevSecOps; Benefits - Assistência Médica - 100% subsidiada pela empresa (titular e dependentes); - Assistência Odontológica; - Vale Refeição ou Alimentação sem desconto; - Seguro de vida; - Previdência Privada; - Gympass; - Opção de compra de ações da empresa com desconto; - Desconto Farmácia; - Auxílio Creche; - Parceria com escola de idiomas; - Licença Maternidade e Paternidade Estendida; - PPR; - *Conforme política vigente;

Brazil