Job Closed

This listing is no longer active.

RecargaPay

Nossa missão é democratizar os meios de pagamentos pelo celular por meio de um serviço inovador, econômico e seguro.

Head of Engineering – Cloud & Platform

Engineering ManagerEngineering ManagerFull Time Remote LeadTeam 201-500Since 2010H1B No SponsorCompany Site LinkedIn

Location

Brazil

Posted

143 days ago

Salary

Seniority

Lead

Bachelor DegreeEnglishSpanishAWS Distributed Systems Grafana Java Apache Kafka Kubernetes Microservices Node.js Prometheus Python Spring Spring Boot Terraform HashiCorp Vault

Job Description

- Define and execute the Cloud and Platform strategy, ensuring alignment with corporate objectives, regulatory frameworks, and cost-efficiency goals. - Lead a multi-disciplinary organization covering Cloud Infrastructure, SRE, Platform Engineering, and DevSecOps, fostering collaboration and shared accountability for uptime, security, and performance. - Drive modernization of infrastructure and delivery pipelines, enabling a unified, automated, and compliant cloud environment. - Partner with executive leadership to define scalable operating models, balancing autonomy for product squads with standardized guardrails and golden paths. - Establish a long-term architectural vision for cloud services, platform frameworks, and developer enablement tools. - Sponsor AI-assisted engineering adoption to enhance developer productivity, reduce toil, and accelerate delivery (e.g., Copilot, Cursor, LLM-based agents). - Serve as the ultimate technical and strategic authority for AWS, Kubernetes, IaC, Observability, and Reliability practices across the organization. - Oversee the design, scalability, and governance of the AWS multi-account organization, enforcing security, compliance, and cost policies (Control Tower, SCPs, Service Catalog). - Lead the definition and implementation of multi-region, multi-environment architectures ensuring reliability, latency optimization, and disaster recovery readiness (RPO/RTO). - Institutionalize well-architected principles (Security, Reliability, Performance, Cost, Sustainability) and drive continuous improvement programs based on regular audits. - Evolve network and connectivity architectures (VPC, Transit Gateway, PrivateLink, Global Accelerator) to meet scaling, compliance, and availability goals. - Own identity, access, and secrets management lifecycle (IAM least privilege, mTLS, KMS/HSM key rotation, Vault integration). - Oversee monitoring and observability frameworks, implementing standards, and unified dashboards across all services. - Ensure SLO-driven operations, with well-defined SLIs, error budgets, and automated incident management loops. - Lead resilience and reliability engineering practices, including chaos engineering, failover drills, dependency fallback design, and proactive degradation handling. - Build and scale the company’s Internal Developer Platform (IDP), empowering teams with self-service capabilities for environment provisioning, deployments, and observability. - Define golden paths, opinionated tooling, and reusable infrastructure modules, enabling consistent, secure, and fast software delivery across squads. - Ensure trunk-based development, progressive delivery (canary, blue/green), automated rollback, and health/SLO-gated deployments are embedded into CI/CD flows. - Drive GitOps adoption to achieve deterministic deployments, auditability, and drift detection. - Expand event-driven and streaming platforms (e.g., Kafka), defining keying, partitioning, and schema evolution strategies to support scalability and data integrity. - Partner with Security and Compliance to embed DevSecOps and Policy-as-Code practices into CI/CD and Kubernetes admission controllers. - Establish and lead a FinOps program, optimizing compute, storage, and data transfer costs while ensuring transparency through chargeback/showback models. - Define cost-to-serve models per service and implement automated guardrails for budgeting and right-sizing. - Integrate cost and performance telemetry into platform dashboards to drive data-informed decision-making. - Partner with Finance to align cloud spend forecasts and track savings initiatives tied to architecture decisions. - Lead and mentor senior engineering managers and principal engineers, building high-performance, high-accountability teams. - Promote a culture of reliability, automation, and continuous improvement through transparent metrics and post-incident learning loops. - Establish governance rhythms such as architecture councils, platform guilds, and reliability reviews to align technical direction and eliminate systemic friction. - Collaborate closely with Risk, Compliance, and Security to uphold standards like PCI-DSS, SOC2, ISO27001, LGPD, and GDPR within cloud and platform operations.

Job Requirements

Academic background oriented toward Computer Science, Engineering, or Software Development disciplines.
Deep expertise in AWS cloud architecture, including multi-account management, VPC design, EKS, ECS, Lambda, and networking topologies.
Proven experience with Infrastructure as Code (Terraform, Pulumi) and GitOps automation at scale.
Strong understanding of Kubernetes internals, workload orchestration, and cost/performance optimization.
Experience implementing SRE and reliability frameworks: SLOs, error budgets, chaos testing, and automated incident remediation.
Mastery of observability and monitoring (CloudWatch, Grafana, Datadog, NewRelic) with trace/metric/log correlation.
Proficiency in security and compliance engineering: IAM, KMS, encryption, secrets lifecycle, policy enforcement (OPA/Rego), and regulatory controls (PCI, LGPD, GDPR).
Experience defining and governing API and event-driven architectures (OpenAPI/AsyncAPI, Kafka schema registries).
Deep knowledge of progressive delivery, service mesh (e.g., Istio), and DevSecOps pipelines.
Strong FinOps acumen: right-sizing, egress optimization, reserved instance and savings plan strategy, and service-level cost attribution.
Experience integrating AI-assisted workflows (GitHub Copilot Enterprise, LLM-based linters and others) into development and CI pipelines, with measurable productivity impact.
Extensive hands-on experience in software engineering roles, with solid proficiency in Java (Spring Boot) and working knowledge of Python and asynchronous programming.
Strong foundation in Object-Oriented Programming and relational database systems.
Solid understanding of web and mobile application architectures, including security, session management, and development best practices.
Expertise in Domain-Driven Design and microservices architecture, with proven ability to design high-performance, scalable, and reliable distributed systems.
Demonstrated experience defining and executing architectural roadmaps aligned with business and developer-experience goals.
Deep knowledge of networking in AWS.
Advanced experience architecting VPC topologies, including Transit Gateway, private/public subnet design, NAT/GW cost optimization, and egress control for regulated environments.
Hands-on experience implementing observability pipelines at scale, integrating NewRelic, CloudWatch, Prometheus, Grafana, Datadog.
Familiarity with EKS internals: node group management, autoscaling, and Kubernetes cost/latency optimization.
Proven experience managing multi-region and multi-environment deployments.
Expertise in AWS security hardening and compliance controls, including IAM least-privilege modeling, KMS envelope encryption, CloudTrail auditing, GuardDuty detections, and automatic remediation with Lambda/Step Functions.
Deep understanding of container security, image signing, ECR scanning, and OPA/Rego policy design for admission controllers.
Advanced experience with Infrastructure as Code using Terraform (modules, workspaces, policy enforcement) and Pulumi (multi-language stacks, secrets providers, CI integration).
Proven ability to implement GitOps workflows, ensuring deterministic deployments and drift detection.
Strong policy-as-code practice to codify security/SRE guardrails across CI/CD and Kubernetes admission controllers.
Expertise automating application stack provisioning (app resources, service accounts, IAM bindings, egress controls) through reusable IaC modules and pipelines.
Deep understanding of progressive delivery (canary, blue/green, shadow traffic, automated rollback) and service mesh (Istio/Linkerd/App Mesh) for safe deployment strategies.
Mastery of resilience and reliability patterns: timeouts, bounded retries with jitter, circuit breakers, bulkheads, back-pressure, outbox/saga orchestration, and graceful degradation.
Deep knowledge of event-driven and streaming architectures (Kafka and others), including partitioning strategies, compaction/retention policies, rebalancing, ordering guarantees, exactly-once semantics, and schema evolution via registries.
Strong background in data performance engineering: caching (read-through/write-behind), connection pool tuning, pagination/cursoring, latency budgeting, and throughput modeling.
Experience with SLO-driven reliability: defining SLIs, error budgets, and reducing alert fatigue via multi-signal correlation.
Proficiency with production monitoring tools (NewRelic, Grafana, Datadog, CloudWatch) and advanced observability instrumentation.
Proven experience building self-service developer platforms (Backstage, Internal Developer Portals) that expose golden paths for application scaffolding, environment provisioning, and secure deployments.
Experience implementing event-driven DevEx tooling (e.g., ephemeral environments, automated CI insights, preview deployments).
Strong knowledge of API lifecycle management and governance (OpenAPI/AsyncAPI, contract testing, versioning, idempotency, error modeling).
Expertise in CI/CD automation and DevSecOps (GitHub Actions, CodeBuild/CodePipeline, artifact provenance, environment promotion, changelog automation).
Practical compliance-by-design experience translating PCI-DSS, KYC/AML, GDPR, and LGPD controls into technical patterns (tokenization, segmentation, audit trails, retention/erasure).
Experience leading AWS Well-Architected Framework reviews across all pillars (Security, Reliability, Performance, Cost, Operational Excellence, Sustainability).
Experience designing cost-aware architectures, balancing performance, resilience, and financial efficiency.
Exposure to edge computing and CDN optimization (Lambda@Edge, CloudFront Functions, custom caching policies).
Fluent in English and Spanish (Portuguese a plus).

Benefits

Competitive and market-aligned salary.
Annual Award Program
Remote work — wherever you are, you’re part of the team!
Home office allowance through a monthly deposit in the RecargaPay app.
Health and dental plans with no co-pay.
Life insurance.
Flexible meal allowance (via Flash).
TotalPass membership to take care of your health.
Language classes.

Related Categories

Engineering Manager

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Engineering Manager Jobs

Engineering Manager – Aggregates Systems

Abnormal Security

Abnormally-Precise, Cloud-Native Email Security

Engineering Manager143 days ago

Full Time RemoteTeam 501-1,000H1B Sponsor

Company Site LinkedIn

• Own the technical direction, reliability, and long-term evolution of aggregation systems spanning both batch and real-time processing. • Guide architectural decisions for distributed data processing, storage, and retrieval systems with strict correctness, latency, and availability requirements. • Ensure aggregation systems consistently meet SLAs for data freshness, accuracy, and uptime across detection and messaging use cases. • Act as a senior technical steward for systems whose failure or inaccuracy would have direct customer and security impact. • Manage, mentor, and develop a team of software engineers working across batch and streaming aggregation systems. • Foster a culture of technical excellence, operational ownership, and continuous improvement.

Kafka Spark

View details: Engineering Manager – Aggregates Systems

United Kingdom

$187.9K - $221K / year

Apply

Job Closed

Manager, Data Engineering

PracticeTek

PracticeTek is a San Diego-based healthcare technology company whose primary goal is to revolutionize healthcare practices by enabling growth and scalability fo

Engineering Manager143 days ago

Other Remote

Company Site

Role Description At PracticeTek, you’ll get to: - Shape the future of healthcare with technology solutions that are always evolving to meet real-world needs. - Team up with passionate, talented people who care deeply about patients, providers, and making a difference. - See your impact firsthand by helping practices deliver care that’s simpler, smarter, and better for everyone. - Grow your career and your skills in an environment that celebrates curiosity, collaboration, and continuous development. Qualifications - 8+ years’ experience in data engineering, with 2–3 years managing engineering teams. - Proven leadership experience in Snowflake data warehousing, data modeling, ETL frameworks (dbt, Airflow), and cloud environments (AWS, Azure). - Strong understanding of business analytics and reporting, with a demonstrated ability to translate business requirements into technical solutions. - Track record of successfully delivering large-scale data projects on time with technical excellence. - Expertise in the areas of data governance, data privacy, information security, and compliance. - Outstanding written and verbal communication, stakeholder management, and strategic thinking capabilities. Requirements - Lead, mentor, and grow the data engineering team, fostering professional growth and a culture of excellence. - Define and execute the data engineering roadmap, emphasizing Snowflake infrastructure and integration solutions. - Oversee architectural decisions, project timelines, and high-quality technical implementations. - Ensure reliability, scalability, and security of Snowflake data solutions and integrations. - Promote cross-functional collaboration to prioritize data projects aligned with strategic goals and initiatives. - Establish data governance, quality frameworks, and compliance standards (HIPAA, SOC2). Benefits - Comprehensive health, dental, and vision coverage options. - Wellness benefits that support lifestyle, behavioral health, and overall wellbeing. - Flexible paid time off, sick time, and 10 company-paid holidays. - 401(k) plan with company match to help you build your future. - Culture Committee driving initiatives that spark connection, fun, and belonging.

Snowflake dbt Airflow AWS Azure ETL

View details: Manager, Data Engineering

United States

$150K - $190K / year

Apply

Job Closed

VP of Software Engineering

CoLab Software

CoLab Software is a Canada-based company that specializes in providing engineering teams with solutions and services to help them with their complex projects. As an employer, the c

Engineering Manager143 days ago

Full Time Remote

• Lead Software Engineering within CoLab’s Product Development org • Guide and grow our teams to improve quality, development speed and confidence • Partner with Product to drive roadmap planning, resource allocation, and feasibility tradeoffs • Ensure compliance with key standards (e.g. SOC 2, ISO) through a mature SDLC • Set and manage operating metrics (e.g. uptime, velocity, quality, cost) that drive business value • Help scale our team with world-class talent and build functions that support career growth across the team • Collaborate across departments — from GTM to CS — to support technical enablement and customer success • Join CoLab’s Senior Leadership Team and help shape company strategy across all areas of the business

AWS Cloud GraphQL PostgreSQL Python React SDLC

View details: VP of Software Engineering

Canada

Apply

Software Engineering Manager, Mobile

Ford Motor Company

At Ford Motor Company, we believe freedom of movement drives human progress. We also believe in providing you with the freedom to define and realize your dreams. With our incredible plans for the future of mobility, we have a wide variety of opportunities for you to accelerate your career potential as you help us define tomorrow’s transportation.

Engineering Manager143 days ago

Other RemoteTeam 10,001+Since 1903H1B Sponsor

Company Site LinkedIn

• Lead software engineering teams that deliver end-to-end software products from conception to production for millions of drivers around the world. • Manage and grow software engineers by providing regular performance feedback, career coaching, and development opportunities. • Lead with customer focus alongside partners in the software development triad, including Digital Product + Digital Design, to deliver high-quality features to our customers. • Actively participate in reviewing, evaluating, and providing feedback on product designs and architectures with a software engineering focus. • Actively seek to improve the engineering delivery pipeline reducing cycle time and increasing quality and security posture. • Apply advanced concepts, theories, and principles to create multi-disciplinary innovations and solutions for the most complex or risky business situations. • Develop and socialize new engineering principles and practices to improve the organization. • Provide thought leadership and perspective across multiple organizations to eliminate knowledge silos. • Drive continuous improvement and build a learning organization. • Evaluate and recommend new and emerging products and technologies.

View details: Software Engineering Manager, Mobile

Texas

Apply

Head of Engineering – Cloud & Platform

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Engineering Manager Jobs

Engineering Manager – Aggregates Systems

Manager, Data Engineering

VP of Software Engineering

Software Engineering Manager, Mobile