Founded in 1967, Capgemini is revered as one of the world's leading consulting, technology, and outsourcing agencies. In 2016 alone, the company reported global
Kubernetes Platform Engineer
Location
United States
Posted
76 days ago
Salary
$76.2K - $187K / year
Seniority
Mid Level
Job Description
Kubernetes Platform Engineer
Capgemini
At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and engineering services across all industries. Join us for a career full of opportunities. Where you can make a difference. Where no two days are the same. Location This is a fully remote role, anywhere in the USA. About the job you're considering We are seeking a Kubernetes Engineer to design, deploy, and operate Kubernetes-based hosting platforms across edge and embedded systems. You’ll play a critical role in ensuring our platform is secure, scalable, and resilient—supporting multi-tenant services and orchestrating workloads across aircraft and ground-based environments. This is a high-impact role for engineers who thrive in cloud-native ecosystems and are passionate about automation, security, and performance optimization. Experience in the telecom domain and networking is a strong plus. Your role - Develop and maintain custom Kubernetes controllers, CRDs, and operators to extend platform capabilities. - Integrate Kubernetes with Linux-based infrastructure bootstrapped via PXE, OTA, and other provisioning methods. - Deploy and configure K3s clusters across heterogeneous hardware (bare metal, ARM/x86 nodes, and accelerators). - Automate cluster provisioning, node registration, and upgrade workflows. - Monitor platform health using Prometheus, Grafana, and Kubernetes Events. - Enforce network policies, RBAC, secrets management, and container security best practices. - Optimize Kubernetes performance on constrained hardware and air-gapped systems. - Troubleshoot container runtimes, DNS resolution, and overlay networking (e.g., Calico, Flannel). Your Skills and Experience - 5+ years of experience managing Kubernetes platform components and lifecycle operations. - Understanding of multi-node hybrid clusters across x86, ARM, and accelerators. - Proficiency with Helm charts and Argo CD for application and platform deployment. - Strong skills in Go, Bash scripting, YAML/JSON configuration, and REST API development. - Deep understanding of Kubernetes CNI plugins and network policy management. The base compensation range for this role in the posted location is: $76,200-$187,740. Capgemini provides compensation range information in accordance with applicable national, state, provincial, and local pay transparency laws. The base compensation range listed for this position reflects the minimum and maximum target compensation Capgemini, in good faith, believes it may pay for the role at the time of this posting. This range may be subject to change as permitted by law. The actual compensation offered to any candidate may fall outside of the posted range and will be determined based on multiple factors legally permitted in the applicable jurisdiction. These may include, but are not limited to: Geographic location, Education and qualifications, Certifications and licenses, Relevant experience and skills, Seniority and performance, Market and business consideration, Internal pay equity. It is not typical for candidates to be hired at or near the top of the posted compensation range. In addition to base salary, this role may be eligible for additional compensation such as variable incentives, bonuses, or commissions, depending on the position and applicable laws. Capgemini offers a comprehensive, non-negotiable benefits package to all regular, full-time employees. In the U.S. and Canada, available benefits are determined by local policy and eligibility and may include: - Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave - Medical, dental, and vision coverage (or provincial healthcare coordination in Canada) - Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada) - Life and disability insurance - Employee assistance programs - Other benefits as provided by local policy and eligibility Important Notice: Compensation (including bonuses, commissions, or other forms of incentive pay) is not considered earned, vested, or payable until it becomes due under the terms of applicable plans or agreements and is subject to Capgemini’s discretion, consistent with applicable laws. The Company reserves the right to amend or withdraw compensation programs at any time, within the limits of applicable legislation. Disclaimers Capgemini is an Equal Opportunity Employer encouraging inclusion in the workplace. Capgemini also participates in the Partnership Accreditation in Indigenous Relations (PAIR) program which supports meaningful engagement with Indigenous communities across Canada by promoting fairness, accessibility, inclusion and respect. We value the rich cultural heritage and contributions of Indigenous Peoples and actively work to create a welcoming and respectful environment. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law. This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodation does not pose an undue hardship. Capgemini is committed to providing reasonable accommodation during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact. Please be aware that Capgemini may capture your image (video or screenshot) during the interview process and that image may be used for verification, including during the hiring and onboarding process. Click the following link for more information on your rights as an Applicant in the United States. http://www.capgemini.com/resources/equal-employment-opportunity-is-the-law Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Staff DevOps Engineer
Generac Power SystemsWe are DR Power, a Generac Company, professional power equipment done right. Established in 1985, we are a leader in the design and manufacture of professional-grade gas and battery-powered outdoor power equipment. We are dedicated to the enduring quality and uncompromising performance of everything we build. We stand behind every DR® product and are here to help every customer regardless of when or where they made their initial purchase.
We are Generac, a leading energy technology company committed to powering a smarter world. Over the 60 plus years of Generac’s history, we’ve been dedicated to energy innovation. From creating the home standby generator market category, to our current evolution into an energy technology solutions company, we continue to push new boundaries. Primary Purpose As a Staff DevOps Engineer on the Industrial Cloud team, you will be a technical leader and key contributor setting technical direction, mentoring engineers, and driving the evolution of our DevOps discipline to enable the delivery of highly scalable, observable software and infrastructure. At Generac, we are committed to providing sustainable cleaner energy products and technology. This is challenging work, and we are looking for individuals who are driven by being part of a team that will have a positive impact on the climate at scale. Your Daily Impact - Develop and own best practices for infrastructure creation, application scaling, monitoring and CI/CD. - Automate everything from infrastructure to everyday toil; identify inefficiencies and propose solutions. - Enable software engineers to work efficiently throughout the development lifecycle to deliver reliable, observable software. - Mentor development and other DevOps engineers on emerging industry trends, technical standards, and DevOps best practices. - Drive technical strategy and roadmap for platform infrastructure, influencing engineering direction at the organizational level. - Lead and collaborate with the development team and other DevOps engineers across the organization to define, standardize, and evangelize best practices for software delivery excellence at scale. - Lead cross-functional initiatives, serving as a technical authority and primary decision-maker for complex infrastructure and platform challenges. Minimum Job Requirements Education B.S. in Computer Science or Engineering or equivalent years of work experience Certification / License Work Experience 10+ years of non-internship experience in software engineering with at least 5 focused in DevOps Knowledge / Skills / Abilities - Deep, hands-on expertise designing, implementing and operating cloud-based systems in Azure, AWS, or Google Cloud including advanced architecture and cost optimization - Proven track record defining and scaling software delivery best practices, SDLC processes, and DevOps culture across engineering teams - Passion for reliable, scalable, observable software with a strong sense of ownership - Experience with infrastructure as code, preferably Terraform, and strong proficiency in containerization and container orchestration at production scale - Proficiency in Python, Go, or a similar language for building automation tooling, internal platforms, and operational scripts - Solid understanding of fundamental networking concepts (DNS, TCP/IP, load balancing, VPCs, firewalls, service mesh) and their application in cloud environments - Proven ability to navigate ambiguity in complex infrastructure environments, applying incremental improvement, frequent deployment practices, and well-defined rollback strategies to reduce risk - Hands-on experience designing and implementing disaster recovery strategies, including RTO/RPO planning, failover automation, backup validation, and regular DR testing - Naturally curious, growth oriented, and able to influence technical direction without direct authority - Demonstrated ability to mentor and develop engineers at multiple levels of seniority - Experience leading platform or infrastructure strategy conversations with senior engineering and product leadership - Strong written and verbal communication skills; ability to distill complex technical topics for non-technical stakeholders - Remains deeply hands-on and willing to contribute directly to complex technical work alongside the team, not just through direction - Sound judgment in the practical application of AI tooling—able to identify where AI accelerates delivery and quality, and where human expertise and oversight remain essential Physical Demands: While performing the duties of this job, the employee is regularly required to talk and hear; and use hands to manipulate objects or controls. The employee is regularly required to stand and walk. On occasion the incumbent may be required to stoop, bend or reach above the shoulders. The employee must occasionally lift up to 25 - 50 pounds. Specific conditions of this job are typical of frequent and continuous computer-based work requiring periods of sitting, close vision and ability to adjust focus. Occasional travel. “We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, national origin, disability status, protected veteran status, or any other characteristic protected by law.”
Rightworks offers the only intelligent cloud purpose-built for accounting firms and professionals. Backed by award-winning support, our fully managed IT and applications ensure customers have secure, reliable, on-demand access to their technology. We provide a curated software ecosystem that simplifies the complexity of running an accounting firm or small business, supported by a community of thought leaders, peer networks, and educational resources. Our success is made possible by leveraging decades of specialized experience in leading accounting firms, SMBs and technology companies. Thousands of Firms and SMBs count on us to run their business every day. We have a great team, we’re growing fast and have a winning culture based on innovation, teamwork, and mutual respect. Job Overview: We are looking for an experienced and proactive Tier 1 Automation Engineer to join our IT operations team. In this role, you will be responsible for maintaining, optimizing, and securing our organization’s customer facing server infrastructure, both on-premises and in the cloud. You will work closely with various teams to ensure the availability, performance, and security of critical systems and services. Your expertise will help guide IT strategy, troubleshoot issues, and ensure operational excellence in managing enterprise systems. This is a remote based position. Responsibilities: - Administer and maintain servers, networks, and systems across on-premises and cloud environments - Perform system upgrades, patches, and troubleshooting to ensure system reliability and security. - Finishes work as assigned by Lead and AE Manager, adhering to established code standards and delivery processes. - Creates new automation for applications that need update or install automation, using existing Tier 2 tooling and libraries. - Maintains annual application update automation, ensuring timely and successful rollout of new versions. - Creates and maintains custom automation for RMM (e.g., custom scripts, monitors, and administrative tasks). - Documents all new and updated automation scripts and processes in the central repository (Gitlab/Atlassian). - Troubleshoots and resolves failed application deployments and automation tasks in the client environment. - Ensure compliance with industry regulations and best practices related to system security, data privacy, and IT governance. - Stay up to date with the latest trends and technologies in system administration, recommending improvements when necessary. Requirements: - Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent work experience). - 5+ years of experience in system administration, devops, or other automation, with a solid track record of managing enterprise-level IT infrastructures. - Strong experience with operating systems (Linux, Windows Server, etc.) and server management. - In-depth knowledge of virtualization technologies (VMware, Hyper-V, KVM, Containers). - Expertise in network configuration, routing, and security (DNS, DHCP, VPN, firewalls). - Experience with automation and scripting (e.g., PowerShell, Bash, Python, Ansible). - Strong troubleshooting and problem-solving skills, particularly with complex infrastructure issues. - Familiarity with containerization technologies (Docker, Kubernetes) and continuous integration/continuous deployment (CI/CD) practices is a plus. - Experience with monitoring tools (e.g., Nagios, Zabbix, SolarWinds) and performance tuning. - Understanding of IT security best practices and experience implementing security measures. - Certifications such as CompTIA Server+, Microsoft Certified: Azure Administrator Associate, AWS Certified SysOps Administrator, or similar are a plus. - Excellent communication and teamwork skills, with the ability to interact effectively with various technical and non-technical teams. - Ability to manage multiple tasks and prioritize effectively in a fast-paced environment. Eligibility Requirements - This role is open to US Citizens or permanent residents authorized to work in the United States. Rightworks LLC is unable to offer visa sponsorship. - Due to specific state regulations, we are unable to accept applications from residents of California, Hawaii, or Alaska. - Relocation will not be offered for this position. Compensation Our Compensation range for this role ranges from $55,000 to $65,000 annually, and is determined based on factors such as relevant experience, skills, and internal equity. Benefits To provide best-in-class solutions, we need a best-in-class team. We offer competitive salaries to recruit the best talent. We provide company-paid short and long-term disability insurance, life insurance and a generous 401K match. We offer highly affordable medical, dental, vision coverage, and many other valuable benefits. We offer flexible PTO, and numerous paid holidays, affording you the time to be there for what is important in your life. We encourage giving back to our communities by providing paid volunteer time off. We are proud to be an Equal Opportunity Employer! This job description may not be inclusive of all assigned duties, responsibilities, or aspects of the job described, and may be amended at any time at the sole discretion of the employer.
- Location: Remote (India preferred) - Department: Product, Engineering & Data Science - Report to: Senior Director of Engineering About Us ELSA is a global leader in AI-powered English communication training, dedicated to transforming how people learn and speak English with confidence. Founded in 2016 and headquartered in San Francisco, we operate across the U.S., Vietnam, Portugal, Indonesia, Brazil and Japan. Powered by proprietary speech-recognition technology and generative AI, ELSA delivers real-time, hyper-personalized feedback to help learners improve pronunciation, fluency, and overall communication effectiveness. With over 50 million learners and 1 billion hours of anonymized speech data, ELSAs depth of language training intelligence is unmatched in the industry. Our B2B flagship platforms ELSA Enterprise and ELSA Schools empower organizations and educational institutions to elevate communication capabilities and unlock personal and professional opportunities for their people. We design engaging, bite-sized learning experiences that adapt to each learner's goals and context, ensuring measurable improvement and lasting confidence. Our vision is to become the global standard for real-time English communication training, enabling 1.5 billion language learners worldwide to speak clearly, be understood, and share their stories with the world. Backed by world-class investors including Googles Gradient Ventures, Monks Hill Ventures, and SOSV, ELSA has been recognized among the top global AI innovators: - Forbes Top 4 Companies Using AI to Transform the World - Research Sniper Top 5 Best AI Apps - ASU+GSV EdTech 150 - CB Insights Top 100 AI Companies Join us in shaping the future of language learning and empowering millions to unlock opportunity through confident communication. Role Summary We are looking for a Principal DevOps / SRE engineer to build and own our reliability practice end-to-end. This is not a firefighting role — our team already responds well to incidents. This person will formalize what works, automate what repeats, and build the foundation for enterprise-grade SRE as ELSA scales its B2B footprint. Key Responsibilities - Own the SRE practice: define severity tiers (P1–P4), formalize on-call rotation, build SLA tracking dashboards, and establish incident management workflows across a team of 4 DevOps engineers. - Build runbooks for the top recurring operational issues — pod scaling, deploy rollbacks, access management, EKS upgrades, CI/CD pipeline failures — and automate L1/L2 responses using tools like Shoreline.io, Rundeck, or PagerDuty automation. - Introduce and operationalize AI-assisted DevOps tooling: AIOps for alert correlation, CastAI/Kubecost for cost optimization, GitHub Copilot for IaC acceleration. Train the existing team on these tools. - Drive infrastructure modernization: EKS upgrades, Karpenter migration, observability (SigNoz/Prometheus), secrets management (ArgoCD/SOPS), and Terraform-based IaC maturity. - Collaborate with AI Engineering, Mobile, and B2B teams to ensure infrastructure supports real-time speech processing, GPU workloads, and multi-region enterprise deployments. - Design and plan round-the-clock SRE coverage model as B2B enterprise SLA commitments grow — evaluate vendor partnerships or strategic hires for Americas timezone coverage. What You Will Have - 2+ years in DevOps/SRE, with at least 2 years in a principal or staff-level role owning reliability practices for a production SaaS product. - Deep hands-on experience with AWS (EKS, EC2, DynamoDB, S3, IAM, Secrets Manager), Kubernetes (HPA, KEDA, Karpenter, pod scheduling, GPU workloads), and IaC (Terraform, Helm, ArgoCD). - Track record of building runbooks, on-call rotations, and incident management frameworks — not just participating in them. - Experience with observability stacks (Prometheus, Grafana, SigNoz or Datadog), CI/CD (GitLab CI, GitHub Actions), and alerting (PagerDuty, Opsgenie). - Comfort working across timezones with distributed teams (India, Vietnam, Portugal). Strong written communication — you'll be writing runbooks, RCAs, and proposals as much as Terraform. Nice to Have - Experience with AI/ML infrastructure (GPU scheduling, model serving, real-time audio/speech workloads). - Familiarity with compliance frameworks (ISO 27001, SOC 2, Vanta) in a DevOps context. - Hands-on experience with AIOps tooling, automated remediation platforms (Shoreline, Rundeck), or FinOps tools (CastAI, Kubecost). What We Offer - Flexible work setup: Remote-first for Singapore, India, Indonesia, Malaysia; hybrid model for Vietnam. - Comprehensive employee well-being benefits. - Free ELSA Premium courses to polish your language skills - Collaborative, international team culture. - Opportunity to contribute to a fast-growing, well-funded Silicon Valley startup with global impact.
L2 Cloud Operations Engineer
ScalableOSScalableOS is a premium offshoring solutions provider based in the Philippines.
• Provide second-line technical support to hedge fund and financial services clients across the US and UK • Take ownership of complex issues and drive them through to resolution • Monitor client cloud infrastructure and trading systems using enterprise monitoring tools • Manage support tickets end-to-end using Jira Service Management • Deliver expert-level desktop troubleshooting across Windows 10/11 environments • Operate and support client Microsoft Azure environments • Configure and troubleshoot SSL VPN and IPsec VPN connections for remote client access • Collaborate with L3 engineers and senior operations staff on escalated issues



