Reshaping the future of energy
SRE Specialist – Platform Engineering
Location
Brazil
Posted
4 days ago
Salary
0
Seniority
Senior
Job Description
SRE Specialist – Platform Engineering
Raízen
• Evolve and maintain the enterprise Kubernetes platform (AKS/EKS), ensuring scalability, security and high availability of the environments; • Build and enhance infrastructure and operations automation using Infrastructure as Code and GitOps practices; • Develop and maintain CI/CD pipelines, supporting teams in the continuous delivery journey; • Implement observability, monitoring and distributed tracing solutions to ensure visibility and reliability of services; • Respond to critical incidents, perform root cause analysis and implement continuous improvements to the platform; • Support development teams in adopting cloud, Kubernetes, observability and automation best practices; • Evolve the internal engineering platform to improve developer experience and accelerate delivery of business value; • Implement and optimize autoscaling strategies, capacity management and operational efficiency for cloud environments; • Collaborate with cross-functional teams to define architecture, security and governance standards for Azure and AWS environments; • Evaluate, test and implement new solutions and technologies focused on Platform Engineering, SRE and enterprise automation.
Job Requirements
- Bachelor's degree;
- Solid experience administering and evolving Kubernetes environments, preferably on managed platforms such as AKS (Azure Kubernetes Service) and/or EKS (Amazon Elastic Kubernetes Service);
- Experience in Public Cloud environments, working with Azure and/or AWS, including infrastructure, networking and security services;
- Experience implementing and maintaining CI/CD pipelines and GitOps practices, using tools such as GitHub Actions or similar;
- Advanced knowledge of Infrastructure as Code (IaC), using Terraform, Crossplane or equivalent tools for provisioning and infrastructure governance;
- Experience with modern observability, monitoring, logging and distributed tracing solutions, using tools such as Grafana, Prometheus, OpenTelemetry, Loki, Tempo or similar;
- Strong knowledge of Linux, containers and Docker, including troubleshooting and optimization of containerized environments;
- Experience with automation and scripting using Bash, PowerShell, Python or equivalent languages;
- Knowledge of networking, DNS, load balancers, connectivity and security in cloud environments;
- Ability to analyze and resolve issues in distributed, mission-critical environments;
- Experience building, operating and evolving corporate platforms with a focus on reliability, scalability, automation and developer experience.
- Preferred Qualifications:**
- Experience with Argo CD and the GitOps ecosystem;
- Knowledge of Argo Workflows and Argo Events for orchestration and process automation;
- Experience with Karpenter, Cluster Autoscaler or other advanced Kubernetes autoscaling solutions;
- Experience with Service Mesh technologies such as Istio, Linkerd or similar;
- Knowledge of FinOps, capacity management and cost optimization in cloud environments;
- Experience with AIOps initiatives, intelligent automation and applying AI to platform operation;
- Familiarity with Terragrunt, Crossplane and advanced infrastructure management tools;
- Experience with distributed observability, tracing and performance analysis for large-scale applications;
- Experience with hybrid architectures and multi-cloud environments.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer / Linux Administrator
ASM ResearchIt is the policy of ASM that an individual's race, color, religion, sex, disability, age, sexual orientation or national origin are not and will not be considered in any personnel or management decisions. We affirm our commitment to these fundamental policies. All recruiting, hiring, training, and promoting for all job classifications is done without regard to race, color, religion, sex, disability, or age. All decisions on employment are made to abide by the principle of equal employment.
Role Description The DevOps Engineer / Linux Administrator supports and enhances enterprise Linux environments through automation, infrastructure management, CI/CD pipeline development, and system administration. This role is responsible for maintaining secure, reliable, and scalable Linux-based platforms while partnering with development, security, and operations teams to improve deployment efficiency, system performance, and operational stability. - Administer, maintain, troubleshoot, and optimize enterprise Linux environments. - Perform Linux system logging, auditing, patching, and performance tuning across production and non-production systems. - Develop and maintain automation solutions – including providing scripting – for Linux administration and other applications related processes utilizing Jenkins and Ansible Core. - Troubleshoot and manually find and resolve Linux issues. - Build and set up new development tools and infrastructure utilizing knowledge in continuous integration, operational delivery, deployment management (CI/CD), cloud technologies, container orchestration, and security. - Modify existing software and scripts to correct errors, adapt to new infrastructure requirements, and improve performance. - Analyze user needs and technical requirements to determine the feasibility of design and implementation within time and cost constraints. - Collaborate with developers, engineers, security teams, and other stakeholders to design systems and define interfaces, capabilities, and performance requirements. - Build and test end-to-end CI/CD pipelines to ensure the systems are safe against security threats. - Provide accurate and realistic work effort estimates, commit, and deliver results accordingly. - Create and maintain technical documentation, operational procedures, and knowledge transfer materials. Qualifications - 3+ years of experience implementing, administering, and troubleshooting Linux in an enterprise environment including Linux patching with DNF and YUM. - Strong experience building and supporting CI/CD pipelines using tools. Must have strong working knowledge of Jenkins (groovy), Ansible Core (yaml), GitLab CI/CD, FlexDeploy, or similar technologies. - Strong experience with Ansible and Jenkins. - Strong knowledge of DNS/Networking and networking debugging with packet capture. - Strong scripting knowledge in Python, Bash, Zsh, Ksh, Csh. - Strong configuration management knowledge and experience. - Experience working with REST APIs. - Experience working in secure environments. - Experience in an OCI environment on virtual images. - Strong verbal, written, organizational, and process documentation skills. Requirements - Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field, or equivalent relevant experience. - Strong hands-on experience with Linux administration, including patching with DNF and YUM, logging, auditing, performance tuning, and issue resolution. - Experience with scripting and automation using several of the following: Python, Bash, Zsh, Ksh, or Csh. - Experience working with REST APIs and integrating automation with external systems. - Strong knowledge of DNS, networking fundamentals, and network troubleshooting, including packet capture analysis. - Experience working in secure environments with a strong understanding of operational discipline and system hardening. - Experience with configuration management and infrastructure automation. - Experience supporting Linux systems in OCI environments using virtual images. - Ability to provide accurate effort estimates, manage assigned priorities, and deliver work as committed. - Strong verbal, written, organizational, and technical documentation skills. - Experience supporting Linux platforms in highly regulated or government-secured environments. - Familiarity with container orchestration, cloud-native deployment practices, and secure CI/CD implementations. - Experience building hardened Linux images and supporting secure software delivery pipelines. - Experience partnering across development, operations, and cyber security teams to improve deployment efficiency and platform reliability. - Proven ability to identify process improvement opportunities and implement automation that reduces manual administration. - Secret clearance required. - U.S. citizenship required. - Ability to work remotely. - No travel required. Benefits - Compensation ranges for ASM Research positions vary depending on multiple factors; including but not limited to, location, skill set, level of education, certifications, client requirements, contract-specific affordability, government clearance and investigation level, and years of experience. - The compensation displayed for this role is a general guideline based on these factors and is unique to each role. - Monetary compensation is one component of ASM's overall compensation and benefits package for employees.
• Effectual Senior DevOps Architects are responsible for technical leadership of Professional Services projects. • Partner with Engagement Managers (EMs) to deliver an exceptional customer and delivery team experience. • Lead DevOps transformation and platform engineering initiatives for enterprise clients. • Design and implement enterprise-scale CI/CD platforms, container orchestration systems, and cloud automation solutions while mentoring client teams through their DevOps adoption journey. • Architect CI/CD platforms and Kubernetes infrastructure using best practices. • Design automation frameworks, observability strategies, and security integration patterns. • Develop Infrastructure-as-Code using Terraform, CloudFormation, and AWS CDK. • Provide technical guidance and DevOps mentorship to client engineering teams. • Lead DevOps transformation workshops, knowledge transfer sessions, and incident response. • Troubleshoot complex infrastructure and pipeline challenges; create comprehensive documentation.
Dev-Ops Lead Engineer
Spectrum.LifeHeadquartered in Dublin, Leinster, Ireland, Spectrum.Life is the nation's largest provider of employer health and wellness services. Founded in 2018, Spectrum.Life has since expand
Role Description We are seeking an experienced and forward-thinking DevOps Lead to own the infrastructure, security, and continuous delivery pipelines that form the backbone of our AI-driven projects. This is a critical, hands-on role where you will be responsible for building and maintaining a resilient, secure, and highly automated environment across our cloud platforms. You will be the designated expert for all things related to infrastructure and DevOps. Your deep understanding of cloud architecture, security principles, and CI/CD will ensure our engineering teams can build and release software quickly, safely, and efficiently. If you are passionate about automation and leveraging AI to create intelligent, self-healing systems, we want to hear from you. Responsibilities - Cloud Infrastructure & Automation - Architect, build, and manage scalable and secure infrastructure on AWS using Infrastructure-as-Code principles, primarily with Terraform. - Develop and maintain bespoke automation scripts to accelerate project setup, on-demand environment creation, and other operational tasks. - Champion and implement solutions like LocalStack to streamline local development and testing workflows for engineers. - Provide expert guidance on systems architecture, ensuring our infrastructure is designed for performance, scalability, and resilience. - Collaborate with engineering teams to manage and automate the infrastructure for our services, including APIs and databases, ensuring their performance and reliability. - CI/CD & Release Management - Develop and improve CI/CD pipelines using GitHub Actions, from code commit to production deployment. - Integrate and manage automated testing, dependency updates, and security scans within the pipelines to ensure code quality and security. - Empower engineers with the tools and automation needed to reduce friction, manage technical debt, and focus on building great products. - Define and continuously improve our release processes, ensuring smooth and predictable deployments. - Security & Compliance - Act as the subject matter expert for security, compliance, and data flows within our cloud infrastructure. - Implement and manage security best practices and automated tooling (SAST/DAST, dependency scanning) to protect our applications and data. - Oversee the security and compliance of AI-related data flows, ensuring that any data sent to third-party services is minimized, anonymized, and explicitly not used for external training purposes. - Ensure all infrastructure and processes adhere to legal and regulatory requirements, maintaining customer trust and data privacy. - Observability & Incident Management - Implement and manage a robust observability strategy using tools like Sentry, Datadog, and native cloud services. - Configure critical alerting and monitoring (e.g., AWS CloudWatch Alarms) and integrate them with notification services to ensure rapid response. - Lead the incident management process for infrastructure-related issues, with a focus on root cause analysis and proactive prevention to minimize hotfixes. - Champion the use of AI in operations, exploring and implementing tools for anomaly detection, predictive analysis, and automated remediation. - Collaborate with our existing Core infrastructure engineer on business-as-usual projects to ensure strategic alignment across the company, while maintaining a primary focus on the AI project initiatives. Qualifications - Proven experience in a DevOps/Infrastructure Engineering role with a focus on automation. - Proficiency in managing cloud infrastructure on AWS. - Experience supporting infrastructure for ML/AI projects (MLOps) is highly desirable. - Deep, hands-on experience with Infrastructure-as-Code using Terraform. - Hands-on experience with containerization technologies (Docker, Kubernetes) and networking (VPCs, Load Balancers). - Expert-level knowledge of building and managing complex CI/CD pipelines, with a strong preference for GitHub Actions. - Strong understanding of system architecture, security best practices, and compliance standards. - Comfortable with scripting languages (e.g., Python, Bash) to build automation and tooling. - Experience with the operational lifecycle of APIs and databases from an infrastructure perspective. - Hands-on experience with modern observability and error tracking tools such as Sentry, Datadog, Prometheus, or Grafana. - You have a deep technical curiosity and a passion for automation. - You act as a force multiplier, empowering the engineering team with the tools and processes they need to succeed. - You take complete ownership of your domain and are a reliable partner to the engineering teams you support. - You are a strategic thinker who can balance speed and safety, enabling developers to move fast without compromising on security or stability. - You are a strong communicator who can explain complex technical concepts to a variety of audiences. - You are proactive in identifying potential issues, reducing technical debt, and improving the overall development lifecycle. Benefits - Full-time permanent contract - Work from home - Competitive salary (Dependent on experience) + employee benefits - Continuous professional development and training opportunities. - 25 days of annual leave - 24/7 EAP and a wide range of health and wellbeing supports - Extensive list of employee perks and benefits: Employee Perks
Manager, DevOps Engineering
ImpinjImpinj is a leading RAIN RFID provider and Internet of Things pioneer. We’re inventing ways to connect every thing to the Internet—including retail apparel, retail general merchandise, healthcare items, automobile parts, airline baggage, food, and much more. With more than 100 billion items connected to date, and multiple Fortune 500 enterprises around the world using our platform, we solve for a better understanding of our world.
Role Description Impinj is seeking a DevOps Engineering Manager to lead our Developer Infrastructure team within the Software organization. You will be responsible for developing a high-performing team and establishing the standards, architecture, and practices that enable multiple engineering organizations – spanning software, innovation, and silicon – to deliver high-quality products at scale. You will serve as the primary partner to engineering leadership across the organization, translating business and product priorities into a coherent infrastructure and platform strategy. - Lead and develop a team of DevOps engineers, including hiring, performance management, and career development. - Define and own the team's technical roadmap for infrastructure, CI/CD, and platform tooling, aligned to broader organizational goals. - Partner with engineering leadership across software, firmware, and adjacent teams to streamline delivery and improve release processes. - Establish and govern CI/CD standards using GitHub Enterprise, ensuring consistent and scalable build, test, and deployment practices across teams. - Lead cloud infrastructure strategy and architecture (AWS, Azure, GCP), with accountability for architectural decisions and cloud spend optimization across the organization. - Drive governance of artifact management and dependency policies through JFrog Artifactory across multiple engineering teams. - Build and own monitoring and alerting solutions using Datadog. - Set standards for configuration management, automation, and infrastructure as code (Terraform, Ansible), ensuring the team delivers with consistency and quality. - Manage vendor relationships and contribute to budget planning for infrastructure and tooling. - Monitor industry trends and emerging technologies to shape platform strategy and inform long-term investments. Qualifications - Bachelor's degree in Computer Science, Engineering, or a related field. - 8+ years of experience in DevOps, software development, or systems engineering, with 3+ years in a people management or team lead role. - Proven track record of building and retaining high-impact engineering teams across a range of experience levels. - Technical fluency in cloud platforms (AWS, Azure, GCP) and containerization technologies (Docker, Kubernetes), sufficient to make sound architectural trade-off decisions. - Experience with scripting languages (e.g., Python, Bash) and infrastructure as code tools (e.g., Terraform, CloudFormation). - Broad knowledge of CI/CD platforms, artifact management (Artifactory), monitoring and logging tools (Datadog, Prometheus, Grafana), and configuration management (Ansible, Chef, Puppet). - Strong communication skills with the ability to translate technical concepts for non-technical stakeholders and influence decisions at the leadership level. - Demonstrated ability to manage competing priorities, drive clarity in ambiguous situations, and deliver results in a dynamic environment. Benefits - The typical base pay range for this role across the US is $158,700 - $238,100. - Individual base pay depends on various factors such as complexity and responsibility of role, job duties, requirements, and relevant experience and skills. - At Impinj, certain roles are eligible for additional rewards, including merit increases, annual bonus, and stock. - US-based employees have access to healthcare benefits; a 401(k) plan and company match among others. Company Description Impinj is a leading RAIN RFID provider and Internet of Things pioneer. We’re inventing ways to connect every thing to the Internet — including retail apparel, retail general merchandise, healthcare items, automobile parts, airline baggage, food, and much more. Impinj is committed to creating a diverse and inclusive work environment and welcomes applicants from all backgrounds.

