Job Closed
This listing is no longer active.
Specializes in building world-class development teams and extending runways for groundbreaking startups.
Senior Platform Engineer – SRE
Location
Brazil
Posted
41 days ago
Salary
0
Seniority
Senior
Job Description
Senior Platform Engineer – SRE
Wizdaa
• Lead IaC architecture • Drive GitOps at scale • Architect and operate multi-tenant Kubernetes infrastructure on AWS EKS • Build self-service infrastructure automation • Lead the use of agentic coding tools for infrastructure work • Own reliability • Set observability standards • Partner with security on zero-trust architecture • Contribute to technical roadmap • Mentor mid-level engineers
Job Requirements
- 6+ years in platform engineering, SRE, or infrastructure
- Deep IaC expertise
- Strong GitOps background
- Deep Kubernetes knowledge
- Strong AWS background
- Experience with multi-tenant infrastructure
- Automation-first thinking at a senior level
- Active user of agentic coding tools
- Reliability engineering track record
- Strong communicator
Benefits
- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineer (DevOps) - Poland This role has been designated as ‘Remote/Teleworker’, which means you will primarily work from home. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE. Job Description: Site Reliability Engineer (DevOps) - Poland Mist AI is the AI-native networking solution from HPE Juniper Networking and our Software Engineering team is seeking a Site Reliability Engineer to join our talented team and build high quality technology solutions that revolutionize networking, powered by Artificial Intelligence in the cloud. Mist AI provides services through SaaS applications to many Fortune 100 and Fortune 500 customers. You will take ops projects from concept through to launch. You will be responsible for maintaining and improving the company's production environment for rapid scaling and outstanding performance. You will be responsible to help us keep stellar uptime and reliability. The improvements you implement will be felt by the entire organization. For you to be successful, you need to have a hunger to learn and adapt to new technology quickly. We demand people who are naturally curious, can self-start and share learnings and outcomes effectively with a distributed team. You need to be a builder at heart. Responsibilities: - Express your passion about infrastructure as code and continuous deployment to build scalable and highly reliable systems. - Define and own KPIs around system availability, quality and scale. - Partner with our developers and quality engineering teams to automate the monitoring, alerting, availability and scalability of our applications and systems. - Ensure system availability and business continuity by implementing redundant servers/services. - Manage after-hours infrastructure updates and maintenance. - Proactively research and propose the use of new concepts, processes, technologies, and tools. - Partner with software developers to create Mist standards for Microservices (APIs, schemas, serialization, data stores and best practices) - Run secure and scalable applications for highly available, multi-region, AWS and GCP deployments - Ship code several times per week. - Be a part of our On-Call rotation. - Own disaster recovery and business continuity plans. Experience required for you to be successful: - An extensive background in developing and operating large-scale cloud-based distributed applications. - Direct experience developing/running applications on AWS or Google Cloud. - Laser focus and be able to design infrastructure solutions for scalability, reliability, high availability, performance, security, software maintainability, and operational excellence. - The ability to "fix the plane while in flight" (not just support greenfield solutions). - The ability to prioritize existing technical and infrastructure debt, and experience to build and execute a plan to pay it off. Required skills: - Delivering web-scale infrastructure for a global market at high release velocity. - A deep understanding of distributed system design and dependency management. - Must have solid experience with at least 2 of the languages: Go, Java, Python. - 10+ years industry experience in managing infrastructure. - 5 years Kubernetes administration in a large-scale SaaS environment. - 5 years maintaining production systems on AWS or GCP. - 3 years in implementing, managing, and monitoring metrics specific to SaaS applications. - 3 years using infrastructure as code software (eg. Terraform, AWS and Google Cloud Deployment, CloudFormation). - 5 years’ experience in continuous integration practices & tools (Jenkins, Travis CI, CircleCI, etc…). Desired skills - Experience with Kafka, Spark, Storm, Cassandra, ElasticSearch, PostgreSQL, Redis, Zookeeper, Nginx, Airflow. - Experience of working with or contributing directly to Open Source projects. - Understanding and experience of leading/managing technology products. - Understand machine learning techniques and tools. Translate business requirements into data models and implement them for scale and production ready systems. - Experience of working with failure-based testing. - Experience working in a test-driven development environment. Personal skills - Previous experience of contributing to war rooms and blameless postmortems. - Superb communication skills, written and verbal. - Experience of working in a true DevOps environment with daily collaborations. - Thrives in a fast-paced startup environment where there may be multiple competing priorities. - Customer-service mindset. - Passion for improvement. Additional Skills: Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX) What We Can Offer You: Health & Wellbeing We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing. Personal & Professional Development We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division. Unconditional Inclusion We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. Let's Stay Connected: Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE. #poland #technologyandsoftware Job: EngineeringJob Level: TCP_04 "The expected salary/wage range for this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level. – Poland: Annual Salary PLN 206,500 - 409,500 The listed salary range reflects base salary. Variable incentives may also be offered." HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity. Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities. HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories. No Fees Notice & Recruitment Fraud Disclaimer It has come to HPE’s attention that there has been an increase in recruitment fraud whereby scammer impersonate HPE or HPE-authorized recruiting agencies and offer fake employment opportunities to candidates. These scammers often seek to obtain personal information or money from candidates. Please note that Hewlett Packard Enterprise (HPE), its direct and indirect subsidiaries and affiliated companies, and its authorized recruitment agencies/vendors will never charge any candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process. The credentials of any hiring agency that claims to be working with HPE for recruitment of talent should be verified by candidates and candidates shall be solely responsible to conduct such verification. Any candidate/individual who relies on the erroneous representations made by fraudulent employment agencies does so at their own risk, and HPE disclaims liability for any damages or claims that may result from any such communication.
Site Reliability Engineer
LeidosLeidos is an innovation company rapidly addressing the world’s most vexing challenges in national security and health.
Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented SREs to join our team and develop reusable solutions that support our customers in any environment. You will have the opportunity to contribute to the design and implementation of Continuous Integration and Continuous Delivery (CI/CD) pipelines that accelerate the secure delivery of software to production. You will automate the buildout of infrastructure in cloud and on-premises environments to operate Kubernetes clusters and microservices deployments. In this role, you will join dynamic Agile software teams that are singularly focused on providing world-class solutions to our customers in an exciting, collaborative, and inclusive atmosphere. You will be intellectually challenged and provided with a tremendous opportunity for growth in a fast-paced, and fun environment. You’ll learn, master, and improve the Continuous Integration Continuous Delivery (CI/CD) processes and tools we use to develop, test, integrate, and deploy our Cloud-based and on-premises solutions into multiple hosting environments, such as AWS, Azure, VMWare, and others. You’ll learn new technologies and tools and apply what you’ve learned to overcome technological challenges with innovative solutions. You’ll collaborate with other software engineers and SREs to share your knowledge with the team and the organization to make us all better at what we do. You’ll perform technical spikes and develop prototypes to help test product concepts and achieve customer validation. Primary Responsibilities - Design, develop, troubleshoot, and maintain mission-critical infrastructure across cloud and on-premises environments using infrastructure-as-code (IaC) - Build and support scalable, highly available, and secure cloud-native architectures, including Kubernetes clusters and microservices deployments - Enable and optimize CI/CD pipelines by applying best practices for automated provisioning, configuration, testing, and deployment - Gather and analyze system and application metrics to support performance tuning, capacity planning, and proactive issue resolution - Partner with development teams to improve system reliability through rigorous testing, release processes, and continuous improvement initiatives - Participate in system design, platform engineering, and technical decision-making to ensure solutions meet functional, performance, and SLA requirements - Collaborate across engineering teams and stakeholders to deliver solutions, resolve technical challenges, and coordinate key deliverables - Develop prototypes, perform technical spikes, and evaluate new tools or approaches to solve complex technical problems - Continuously assess deployed systems and implement improvements to enhance reliability, scalability, and operational efficiency - Mentor team members and contribute to knowledge sharing across the organization Basic Qualifications - Bachelor’s degree in Computer Science, Computer Engineering, or a related field, with 4+ years of relevant experience - Demonstrated ability to deliver projects or processes spanning multiple technical domains, including experience in a technical lead capacity - Solid understanding of Agile development practices, along with CI/CD methodologies and supporting tools - Strong proficiency with Linux and Windows operating systems, as well as networking fundamentals (e.g., HTTP, HTTPS, SSL/TLS, SMTP, DNS) - Hands-on experience provisioning and managing resources within cloud and IaaS environments (AWS, Azure, Google Cloud Platform, etc.) - Practical experience with infrastructure-as-code and automation tools such as Terraform, Ansible, CloudFormation, Chef, or Puppet - Experience working with container technologies (Docker) and orchestration platforms like Kubernetes, including use of kubectl - Proficiency with version control systems, such as Git - Demonstrated curiosity and initiative in learning new tools, frameworks, and technologies - Ability to work independently with minimal supervision while also collaborating effectively within cross-functional engineering teams Travel: - Travel will be 50% within the US as well as overseas Preferred Qualifications - Experience with enterprise event streaming technologies such as Kafka or NATS - Familiarity with monitoring and observability tools like Grafana and Prometheus - Exposure to service mesh and API gateway technologies (e.g., Istio) - Experience with GitOps tools such as Argo CD, Flux CD, or similar platforms - Professional cybersecurity certification (e.g., Security+ or equivalent) - Understanding of Agile development methodologies and practices - Working knowledge of relational database systems such as Oracle, MySQL, PostgreSQL, or SQL Server If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo — because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 — and moving faster than anyone else dares. Original Posting: April 16, 2026 For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above. Pay Range: Pay Range $73,450.00 - $132,775.00 The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
• Manage and evolve cloud infrastructure (GCP), ensuring applications are scalable, secure and resilient; • Build, operate and maintain infrastructure components such as Kubernetes, GitHub Actions, Cloud Run, and others; • Prepare technical documentation and architectural diagrams for infrastructure flows (CI/CD pipelines, networking, security, databases); • Identify, propose and implement DevOps best practices; • Lead technical meetings with clients; • Analyze and evaluate technical and non-technical requirements, ensuring solutions are scalable, secure, resilient, and aligned with business needs; • Work on resolving critical incidents and optimizing performance in development and production environments; • Implement architectural patterns and best practices, such as microservices and event-driven architecture; • Develop services in Kotlin and/or Python that support the platform (internal APIs, integrations, tools and pipelines); • Collaborate with R&D teams (Frontend, Design and Backend) to ensure solutions are scalable, observable and easy to operate in production.
Role Description We are looking for a DevOps Engineer. - Development and support of infrastructure as a code (Terraform, Terragrunt) - Management and support of container orchestrations (Kubernetes, Docker) - Monitoring and logging of infrastructure (Prometheus, Grafana, ELK) - Implementation of GitOps methods using ArgoCD - Working with cloud platforms (AWS) - Optimization and support of networks and servers (Linux, Nginx, Cloudflare) - Support of microservices and monolithic applications Qualifications - Experience as a DevOps engineer for at least 6 years - Good knowledge and experience of working with Linux systems (Ubuntu) - Ability to write scripts in Bash, Python - Experience with containerization systems (Docker, Kubernetes) - Monitoring and alerting skills using Prometheus, Grafana - Experience with AWS cloud solutions - Ukrainian C1+ - English B1+ Benefits - Rewards & Celebrations - Quarterly Bonus System - Team Buildings Compensations - Memorable Days Financial Benefit - Learning & Development - Annual fixed budget for personal learning - English Language Courses Compensation - Time Off & Leave - Paid Annual Leave (Vacation) - 24 working days - Sick leave - unlimited number of days, fully covered - Wellbeing Support - Mental Health Support (Therapy Compensation) - Holiday Helper Service - Workplace Tools & Assistance - Laptop provided by Company (after probation) Work conditions: - Remote work from EU - Flexible 8-hour workday, typically between 9:00 - 18:00 CET - Five working days, Monday to Friday - Public holidays observed according to Ukrainian legislation - Business trips to Bratislava every 3-6 months (company provides compensation of expenses)



