EITACIES Inc. logo
EITACIES Inc.

EITACIES, The Edge where we bring the difference. Accelerating performance. Achieving #business goals.

DevOps Architect / SME, MultiCloud

DevOps EngineerDevOps EngineerFull TimeRemoteLeadTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

California

Posted

57 days ago

Salary

$60 / hour

Seniority

Lead

Job Description

DevOps Architect / SME, MultiCloud

EITACIES Inc.

• Architect and standardize DevOps practices across AWS, GCP, and hybrid cloud environments • Build and maintain Infrastructure as Code frameworks using Terraform, CDK, or Pulumi • Design and implement scalable Kubernetes-based platforms (Helm, deployments, orchestration) • Develop reusable CI/CD modules and automation frameworks • Lead cloud architecture for large-scale systems including compute, storage, networking, and databases • Debug complex multi-cloud and third-party integration issues • Implement observability, incident management, and reliability systems • Define and enforce security best practices across infrastructure and pipelines • Drive performance tuning, scalability, and system optimization • Establish disaster recovery and backup strategies • Mentor engineers and drive adoption of DevOps best practices across teams • Collaborate with cross-functional teams and offshore engineering teams

Job Requirements

  • 15+ years of experience in DevOps / Platform Engineering / Cloud Architecture
  • Strong hands-on experience in multi-cloud environments (AWS + GCP)
  • Deep expertise in: Infrastructure as Code (Terraform, AWS CDK, or Pulumi)
  • Kubernetes (Helm, deployments, scaling, production systems)
  • Docker and containerized platforms
  • Strong programming/scripting skills (Python preferred)
  • Experience building reusable infrastructure modules and automation frameworks
  • Strong understanding of networking (DNS, VPCs, load balancers, VPNs, firewalls)
  • Experience with databases (SQL, tuning, DynamoDB, Aurora, etc.)
  • Experience implementing CI/CD systems and reusable pipelines
  • Experience with observability and incident management systems
  • Strong documentation practices (GitHub, CLI tooling, etc.)
  • Proven experience leading DevOps initiatives across teams (Guild / COE model preferred)
  • Experience mentoring engineers and driving adoption of best practices
  • Ability to define standards, not just execute tasks
  • Strong understanding of security best practices and vulnerability management
  • Experience in performance tuning and optimization
  • Experience in disaster recovery and backup strategies
  • Experience in hybrid cloud environments

Benefits

  • 401(k)

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Zensar logo

Engineer Sr Lead, Site Reliability

Zensar

At Zensar, we’re “experience-led everything”. We are committed to conceptualizing, designing, engineering, marketing, and managing digital solutions and experiences for over 130 leading enterprises. We are a company driven by a bold purpose: Together, we shape experiences for better futures. Whether for our clients, our people, or the world around us, this belief powers everything we do. At the heart of our culture is ONE with Client - a set of four core values that reflect who we are and how we work: One Zensar, Nurturing, Empowering, and Client Focus. Part of the $4.8 billion RPG Group, we’re a community of 10,000+ innovators across 30+ global locations, including Milpitas, Seattle, Princeton, Cape Town, London, Zurich, Singapore, and Mexico City. We believe the best work happens when individuality is celebrated, growth is encouraged, and well-being is prioritized. We are an equal employment opportunity (EEO) and affirmative action employer, committed to creating an inclusive workplace. All qualified applicants will be considered without regard to race, creed, color, ancestry, religion, sex, national origin, citizenship, age, sexual orientation, gender identity, disability, marital status, family medical leave status, or protected veteran status.

DevOps Engineer57 days ago
Full TimeRemoteTeam 10,001

JD - Engineer Sr Lead, Site Reliability What you will be doing: Software Engineer/Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments and Capital Markets business. In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey, drive customer-centric innovation and automation, and position the organization as a leader in the competitive banking, payments and investment landscape. Specifically, the Site Reliability Engineer will be responsible for the following: • Design and maintain monitoring solutions for infrastructure, application performance, and user experience. • Implement automation tools to streamline tasks, scale infrastructure, and ensure seamless deployments. • Ensure application reliability, availability, and performance, minimizing downtime and optimizing response times. • Lead incident response, including identification, triage, resolution, and post-incident analysis. • Conduct capacity planning, performance tuning, and resource optimization. • Collaborate with security teams to implement best practices and ensure compliance. • Manage deployment pipelines and configuration management for consistent and reliable app deployments. • Develop and test disaster recovery plans and backup strategies. • Collaborate with development, QA, DevOps, and product teams to align on reliability goals and incident response processes. • Participate in on-call rotations and provide 24/7 support for critical incidents. What you bring: • Proficiency in development technologies, architectures, and platforms (web, API). • Experience with cloud platforms (AWS, Azure, Google Cloud) and IaC tools. • Hands-on experience with Docker, Kubernetes. • Knowledge of monitoring tools (Prometheus, Grafana, DataDog) and logging frameworks (Splunk, ELK Stack). • Experience in incident management and post-mortem reviews. • Strong troubleshooting skills for complex technical issues. • Proficiency in scripting languages (Python, Bash) and automation tools (Terraform, Ansible). • Experience with CI/CD pipelines (Jenkins, GitLab CI/CD, Azure DevOps). • Ownership approach to engineering and product outcomes. • Excellent interpersonal communication, negotiation, and influencing skills. At Zensar, we’re “experience-led everything”. We are committed to conceptualizing, designing, engineering, marketing, and managing digital solutions and experiences for over 130 leading enterprises. We are a company driven by a bold purpose: Together, we shape experiences for better futures. Whether for our clients, our people, or the world around us, this belief powers everything we do. At the heart of our culture is ONE with Client - a set of four core values that reflect who we are and how we work: One Zensar, Nurturing, Empowering, and Client Focus. Part of the $4.8 billion RPG Group, we’re a community of 10,000+ innovators across 30+ global locations, including Milpitas, Seattle, Princeton, Cape Town, London, Zurich, Singapore, and Mexico City. Explore Life at Zensar and join us to Grow. Own. Achieve. Learn. to be the best version of yourself. We believe the best work happens when individuality is celebrated, growth is encouraged, and well-being is prioritized. We are an equal employment opportunity (EEO) and affirmative action employer, committed to creating an inclusive workplace. All qualified applicants will be considered without regard to race, creed, color, ancestry, religion, sex, national origin, citizenship, age, sexual orientation, gender identity, disability, marital status, family medical leave status, or protected veteran status.

Serbia
ELSA, Corp logo

Senior Site Reliability Engineer, API Platform Engineer

ELSA, Corp

World's leading A.I. app in English speaking and communication

DevOps Engineer57 days ago
Full TimeRemoteTeam 201-500Since 2015H1B No Sponsor

• Join the AI Infrastructure & Platform team to build, operate, and scale the production systems that power ELSA’s APIs, platform services, and AI-enabled applications. • This Senior Site Reliability Engineer / API Platform Engineer role bridges software engineering, cloud infrastructure, and operational excellence, requiring a pragmatic, highly productive individual who can use modern AI tools and automation to accelerate delivery and improve reliability. • Collaborate closely with engineering, AI, and product teams to ensure our services are secure, scalable, observable, and resilient in real-world production environments. • Design, build, and operate reliable, scalable infrastructure for APIs, platform services, and AI-enabled applications on AWS and Kubernetes. • Own and enhance CI/CD pipelines, deployment workflows, and operational tooling to enable safe and fast software delivery. • Build and maintain robust observability systems across metrics, logging, tracing, alerting, and service health. • Lead incident response, root cause analysis, postmortems, and remediation efforts to continuously improve production reliability. • Automate repetitive operational work through software, infrastructure-as-code, and AI-assisted workflows. • Use AI-native engineering tools including copilots, intelligent automation, and agentic operational tooling to improve debugging, response time, analysis, and team productivity. • Partner with backend, platform, and AI engineering teams to productionize new services and ensure they meet reliability, security, and scalability standards. • Optimize infrastructure and runtime performance across latency, throughput, availability, and cost. • Define and enforce engineering standards for reliability, security, observability, and operational excellence across services. • Contribute production-grade software and internal tools that reduce toil and improve platform leverage across the organization.

Indonesia
Job Closed
Unit4 logo

Senior DevOps Engineer – Kubernetes Platform

Unit4

The Next-Generation in Smart Enterprise Resource Planning.

DevOps Engineer57 days ago
Full TimeRemoteTeam 1,001-5,000Since 1980H1B No Sponsor

• Collaborare con i Cloud Operations Engineers per progettare, operare e migliorare i servizi scalabili in esecuzione su Azure Kubernetes Service (AKS) e anche cluster Kubernetes autonomi su infrastruttura dedicata. • Garantire l'affidabilità, le prestazioni e l'osservabilità delle piattaforme container sia cloud-native che on-premise. • Evolvere e mantenere il nostro ecosistema di Infrastructure-as-Code utilizzando Bicep, modelli ARM e Terraform. • Supportare e migliorare le capacità CI/CD attraverso Azure DevOps (pipelines, boards, Git repos). • Collaborare con team tecnici trasversali per espandere l'automazione e modernizzare i processi operativi. • Agire come punto di escalation tecnico per i team di Supporto Operativo globali. • Introdurre miglioramenti nelle distribuzioni, nell'infrastruttura e nel monitoraggio con una mentalità orientata all'automazione. • Mantenere documentazione, runbooks e articoli della knowledge base chiari e accurati. • Supportare e mantenere le istanze di PostgreSQL in esecuzione in ambienti cloud e containerizzati.

Poland
Job Closed
Kyndryl logo

Senior Lead – SAP Site Reliability Engineer, FI

Kyndryl

We design, build, manage and modernize the mission-critical technology systems that the world depends on every day.

DevOps Engineer57 days ago
Full TimeRemoteTeam 10,001+Since 2021H1B Sponsor

• Ensure stability, availability, resilience, and reliability of SAP FI processes • Lead management of critical SAP incidents and major problems • Act as SAP functional lead for AMS service, guiding functional consultants and support teams • Oversee monitoring strategies for critical SAP processes • Maintain up-to-date functional and operational documentation

Mexico