Job Closed
This listing is no longer active.
Enable enhanced customer engagement, efficient workforce collaboration and AI driven insights. ⭐HIRING NOW⭐
Cloud Operations Engineer
Location
South Africa
Posted
114 days ago
Salary
0
Seniority
Senior
Job Description
Cloud Operations Engineer
AnywhereNow
• Operate and maintain Azure production environments, ensuring high availability, performance, and stability. • Collaborate with the wider CloudOps team to improve platform stability applying practical SRE principles. • Act as an L4 escalation point for complex cloud incidents across compute, networking, storage, identity, and AKS layers. • Contribute to and maintain Terraform-managed infrastructure, including reviewing, modifying, and troubleshooting infrastructure-as-code. • Operate and troubleshoot AKS clusters and workloads. • Participate in the Cloud Operations on-call rotation for critical production incidents. • Maintain clear technical operational procedures and runbooks for common scenarios.
Job Requirements
- 5+ years of experience in Cloud Operations, Platform Engineering, or SRE-type roles.
- Strong hands-on experience with Microsoft Azure, including VMs, networking, identity, storage, and monitoring.
- Proven experience working with Terraform in production environments.
- General Systems Troubleshooting: A methodical approach to isolation—distinguishing between application, network, and provider-level issues in a complex, distributed environment.
- Strong operational experience with Azure Kubernetes Service (AKS).
- Experience building automation using scripting languages such as PowerShell, Python, or Bash.
- Experience optimizing Azure environments for reliability and cost efficiency.
- Monitoring Tools: Experience using Azure Monitor to track performance trends.
Benefits
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development programs
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Director, SRE – Cloud Infrastructure
CyberhavenWe protect important data other tools can’t see, from threats they can’t detect, across technologies they can’t control.
• Lead, grow, and mentor high-performing globally distributed SRE and Infrastructure teams, including managers and senior ICs • Own the reliability, availability, scalability, and performance of our production and developer platforms • Define and execute the SRE and infrastructure strategy, including cloud architecture, Kubernetes platforms, CI/CD, and automation • Drive horizontal scaling and enable teams to operate independently, through decoupling and modularization of both architecture and processes • Drive infrastructure cost (COGS) optimization, capacity planning, and cloud financial management in close partnership with Finance and Engineering leadership • Establish and evolve SLOs, SLIs, error budgets, and operational best practices across the organization • Oversee incident management, postmortems, and continuous improvement, ensuring a strong culture of learning and ownership • Collaborate closely with security to ensure our infrastructure is secure, compliant, and resilient by design • Contribute to and uphold strong documentation, operational standards, and knowledge sharing across teams
• Designing, implementing, and maintaining the infrastructure, CI/CD pipelines, and automation frameworks • Managing cloud environments (AWS, Azure, or GCP) • Orchestrating containerized workloads (Kubernetes, Docker) • Building robust deployment pipelines (Jenkins, GitLab CI, GitHub Actions, ArgoCD) • Collaborating with software development, security, and operations teams • Architecting and maintaining infrastructure as code (Terraform, CloudFormation, Pulumi) • Implementing monitoring, logging, and alerting solutions (Prometheus, Grafana, Datadog, ELK stack) • Establishing site reliability engineering practices (SLOs, SLIs, incident response) • Driving security best practices through DevSecOps integration • Managing secrets and access controls • Conducting capacity planning • Mentoring junior engineers
Want to help everyday Americans invest and build wealth? Financial inequality is increasing, and too many people are getting left behind. At Stash, we are passionate about democratizing wealth creation through education, advice, and products that help customers achieve greater financial freedom. Join our Infrastructure team as a Staff Site Reliability Engineer and play a key role in building and scaling Stash’s platforms. You’ll drive initiatives that strengthen reliability, design secure and resilient systems, and lead automation efforts that make our infrastructure faster and more efficient in a high-growth environment. What you'll do: Design, build, and operate AWS networking and infrastructure, including VPCs, Transit Gateway, PrivateLink, routing, and security boundaries. Lead Kubernetes (EKS) platform operations — scaling clusters, optimizing workloads, and ensuring reliability of critical services. Automate infrastructure workflows with Terraform and CI/CD pipelines (GitHub Actions) to increase speed and consistency. Configure and maintain Nginx for high-availability, load balancing, and secure traffic management. Troubleshoot and resolve complex issues across systems, networks, and applications (DNS, routing, TCP, container orchestration). Collaborate with engineering teams to design scalable cloud solutions and embed best practices for reliability. Continuously improve observability using Datadog and related tooling to monitor performance and proactively prevent outages. Drive architectural decisions that strengthen system reliability, security, and scalability in AWS. What we're looking for: 8+ years of experience in site reliability engineering or similar roles. Deep expertise in AWS networking (VPC design, Transit Gateway, PrivateLink, routing, security groups, NACLs). Strong experience with Nginx (configuration, tuning, scaling, troubleshooting). Strong expertise in Kubernetes (K8s) and Amazon EKS. Advanced skills in AWS infrastructure setup, management, and optimization. Proficiency in infrastructure as code (Terraform, Terraform Cloud). Strong programming skills in Python and/or Go. Experience with system monitoring (Datadog) and logging/archiving practices. Extensive experience with GitHub Actions for CI/CD pipelines. Proven track record with containerized microservice architectures (Docker). Experience with Kafka. Experience working in PCI or other regulated environments. Gold Stars: Advanced network security design — experience with segmentation strategies, zero-trust architectures, and firewall policy management. Performance optimization expertise — analyzing latency and throughput, tuning DNS resolution, load balancing, and packet-level troubleshooting. Observability leadership — hands-on with Datadog dashboards, metrics strategy, log pipelines, and tracing at scale. Resiliency and chaos engineering — designing fault-tolerant architectures and running game days to validate recovery plans. Compliance and governance experience — prior work in regulated industries (e.g., PCI, SOC 2, HIPAA) beyond just technical enforcement. Cross-team leadership — ability to influence architecture decisions across product and platform teams, and mentor engineers on reliability and networking best practices. Startup and scale-up experience — familiarity with rapid growth environments where infrastructure must evolve quickly while staying reliable. #LI-REMOTE Our Commitment to Diversity, Equity, and Inclusion We proudly celebrate the unique qualities that make you you, 365 days a year, and not just because it’s the right thing to do or good for business. We embed the principles and practices of diversity, equity, and inclusion (DEI) into all that we do to prioritize people, a Stash core value, and to ensure Stashers of all backgrounds and experiences can be their authentic selves. We are also proud to be the first and only venture-backed fintech to join the CEO Action for Diversity & Inclusion™, and as an Equal Opportunity Employer, Stash is committed to building an inclusive environment for people of all backgrounds. If you require any reasonable accommodations to make your application process more accessible, please reach out to recruiting@Stash.com. Helping You Invest in Yourself Comprehensive total rewards package, comprising compensation (salary and equity) and health care benefits Complimentary subscription to Stash+ account Remote-first work policy – Live and work where you feel the most productive, whether that is in your home, in an office. Flexible PTO Work-from-home equipment stipends; home internet subsidy Paid Parental Leave (offerings for birth giving and non-birth giving parents) Primary & Secondary Enhanced health and wellness benefits through One Medical, Gympass, and Maven Health External Recognition for Stash Benzinga’s 2023 Best Brokerage for Beginners and Best Robo-Advisor Awards Qorus-Accenture’s 2023 Banking Innovation Awards USA Today and Statista’s 2023 Top 500 Best Financial Advisory Firms Comparably's Best Company Awards: Best Places to Work, Best Company Outlook, and Best Engineering Team for Diversity, Women, Culture, and more! (2023) Fintech Breakthrough Award: Best Personal Finance App (2023) BuiltIn’s Best Places to Work (2022, 2021, 2020, 2019) Forbes Fintech 50 (2021, 2020, 2019) Best Digital Bank, Finovate Awards (2020) Tearsheet Challenge Awards, Best Banking Card Product - Stock-Back® Card, 2020 LendIt Fintech Innovator of the Year (2020, 2019) Salary Range: $149,180 - $222,040 The base salary range represents the reasonably anticipated low and high end of the salary range for this position. Actual salaries will vary and will be based on various factors, such as the candidate’s qualifications, skills, experience and competencies, as well as internal equity and alignment with market data for companies of our size and industry.
Software Engineer – DevSecOps, Experienced/Senior
BoeingA leading global aerospace company and top U.S. exporter, Boeing develops, manufactures and services commercial airplanes, defense products and space systems for customers in more than 150 countries. Our U.S. and global workforce and supplier base drive innovation, economic opportunity, sustainability and community impact. Boeing is committed to fostering a culture based on our core values of safety, quality and integrity.
• Develops, documents, and maintains standardized, efficient, and innovative processes, tools, methodologies, and performance metrics to streamline the software engineering lifecycle • Automates, develops, monitors, improves, and troubleshoots across software engineering development, tooling, testing, integration, deployment, configuration processes, and security controls • Analyzes, plans, and executes mitigation strategies to prevent potential security risks, threats, and vulnerabilities • Implements an environment that enables high levels of quality, safety, compliance, and continuous improvement across the organization • Supports collaboration with cross-functional teams to build and maintain robust, scalable, and secure software engineering systems




