Job Closed
This listing is no longer active.
Where enterprise AI runs and outcomes scale
Cloud & DevOps Engineer
Location
Egypt
Posted
91 days ago
Salary
0
Seniority
Senior
Job Description
Cloud & DevOps Engineer
Rackspace Technology
• Deploy and maintain highly available AWS environments (EC2, ECS, EKS) using Terraform to ensure infrastructure is versioned and reproducible • Build and manage automated pipelines in GitLab CI or Jenkins, and use Ansible to eliminate manual configuration and drift • Manage the full lifecycle of containerized applications using Docker and Kubernetes • Perform administration and automated deployments across Linux and Windows environments using Python or Bash • Design, secure, and troubleshoot cloud networking components including VPCs, Subnets, Route53, and Load Balancers • Implement and support infrastructure for AI/ML workloads and utilize AI-powered tools to enhance system monitoring and automated incident response
Job Requirements
- 4–6 years in Cloud and DevOps Engineering roles
- Hands-on expertise with AWS (specifically EKS, EC2, RDS, S3, and Secrets Manager)
- Proven ability to manage production stacks using Terraform
- Deep understanding of VPC peering, DNS, Load Balancing, and network security protocols
- Proficiency in Docker and Kubernetes (running and troubleshooting clusters)
- Solid experience with GitLab CI / Jenkins and Ansible
- Practical experience or strong knowledge in supporting AI infrastructure (e.g., GPU instances, model deployment pipelines)
Benefits
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design infrastructure, networking, and software platform architecture. • Define platform guidelines, requirements and processes while considering DevOps methodology. • Build and maintain: infrastructure automation using Infrastructure as Code tools; auditable delivery of infrastructure definition and changes; automation of Continuous Integration and Continuous Deployment pipelines; Developer Experience and Productivity initiatives service catalogs and service maturity; the application platform used by all engineering teams; multiple Kubernetes clusters. • Design, develop and maintain core systems using common programming languages. • Build and maintain internal tooling used by all engineering teams. • Troubleshoot infrastructure, internal applications, networking, and security issues. • Build and maintain an observability platform, guidelines, and standards. • Define the internal platform SLI/SLO/SLAs. • Manage backup policies and operation. • Maintain the fleet of databases, including upgrades, security patches, performance analysis, optimizations and troubleshooting. • Conduct security risk assessments, vulnerability scans, VPNs, tests. • Utilize tools including Linux; Python, Go, JavaScript, Shell script.
Responsibilities *This position is 100 percent Remote* The primary responsibilities of a DevSecOps Specialist include: - CI/CD Pipeline Management: Selecting, deploying, and maintaining Continuous Integration/Continuous Deployment (CI/CD) tools and processes. - Software Maintenance: Ensuring the deployed software product is maintained throughout its lifecycle. - Security Integration: Embedding security practices into the development and deployment processes. - Observability: Implementing monitoring and logging to ensure the software’s performance and security can be observed and analyzed. - Collaboration: Working closely with development, operations, and security teams to streamline workflows and improve efficiency. Qualifications - 3-5 years of hands-on experience - Bachelor's degree in Computer Science, Engineering, Physics, Mathematics or a related field -preferred - Must have an active Secret security clearance - Certifications - CKA, AWS Solutions Architect or AWS DevOps – Associate - Sec+ (within six months of onboarding) - Possesses demonstrated knowledge (mastery preferred) in the following: - Terraform - Kubernetes - AWS EKS & ECS - Docker - Istio - Jenkins - GitHub - GitLab - Artifactory - Cloud native tools - CI/CD Pipelines developing automation - Help onboarding application on the PaaS and Runtime environment
Senior Site Reliability Engineer
ClickHouseClickHouse, Inc. is a database management system that allows users to generate analytical reports using real-time SQL queries. The company’s technology works
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance of our cloud infrastructure that runs ClickHouse databases. You will collaborate with different teams like Control Plane, Dataplane, Core, Security, Support, and Operations and guide them to design and implement scalable, secure, highly available, and fault-tolerant distributed systems. You will also own the areas of incident management and response, post-mortem analysis including running blameless postmortems, and continuous improvement of our ClickHouse services. This role is a unique opportunity to make a significant impact on our elastic, limitless scale, high-performance, serverless ClickHouse Cloud. - Collaborate with various engineering teams in ClickHouse to design and implement scalable, secure, and highly available systems for ClickHouse. - Establish and manage service level objectives (SLOs) and service level agreements (SLAs) for ClickHouse Cloud. - Ensure all the infrastructure components in ClickHouse Cloud (including Dataplane, Control Plane, and ClickHouse Core) have monitoring and alerting in place to ensure timely detection and resolution of incidents. - Enhance and refine incident response processes and post-mortem analysis for any outages in ClickHouse Cloud including working with the support team to communicate to the impacted customers. - Continuously improve the reliability and performance of our ClickHouse services. - Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities. - Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize downtime. Qualifications - Bachelor’s or Master’s degree in Computer Science or a related field. - At least 8 years of experience in Site Reliability Engineering or a related field. - Previous experience using ClickHouse in production. - Hands-on experience with Go and/or Python. - Strong knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform. - Excellent understanding of distributed databases and SQL, particularly ClickHouse is a major plus. - Hands-on experience with container orchestration tools such as Kubernetes or Docker Swarm. - Strong experience with automation and configuration management tools such as Ansible, Terraform, or Puppet. - You are a strong problem solver and have solid production debugging skills. - You are passionate about efficiency, availability, scalability, and data governance. - You thrive in a fast-paced environment, and see yourself as a partner with the business with the shared goal of moving the business forward. - You have a high level of responsibility, ownership, and accountability. - Excellent communication and interpersonal skills. Requirements - The typical starting salary for this role in the US is $141,000 — $208,000 USD. - The typical starting salary for this role in US Premium Markets is $157,000 — $230,000 USD. - Compensation may vary based on various factors including education, qualifications, certifications, experience, skills, location, performance, and the needs of the business or organization. - If you have any questions or comments about compensation as a candidate, please get in touch with us at paytransparency@clickhouse.com. Benefits - Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries. - Healthcare - Employer contributions towards your healthcare. - Equity in the company - Every new team member who joins our company receives stock options. - Time off - Flexible time off in the US, generous entitlement in other countries. - A $500 Home office setup if you’re a remote employee. - Global Gatherings - We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites. - Culture - As part of our first 500 employees, you will be instrumental in shaping our culture.
Senior DevOps Engineer
Stillfront GroupA global games company founded in 2010. Our digital games are enjoyed by ~70 million people every month.
• Design and maintain AWS cloud infrastructure • Implement Infrastructure as Code using AWS CDK • Build and manage CI/CD pipelines with GitHub/GitHub Actions • Develop and maintain Docker-based container environments • Implement DevSecOps practices across the deployment lifecycle • Manage IAM, secrets, and security controls in AWS • Monitor systems for vulnerabilities and security risks



