Cloud Infrastructure Engineer

Location

Worldwide

Posted

6 days ago

Salary

0

Seniority

Mid Level

Job Description

Cloud Infrastructure Engineer

Qompyl

Role Description Qompyl is looking for a Cloud Infrastructure & DevOps Engineer to help manage, optimize, and scale the cloud infrastructure behind our mobile-first fintech platform. Working closely with the engineering team, you will support deployments, improve CI/CD processes, maintain platform reliability, and ensure our Azure environment remains secure, efficient, and cost-effective. This is a part-time, equity-only role suited for someone seeking a meaningful side engagement alongside full-time work or consulting projects. Key Responsibilities - Design, manage, and optimize Microsoft Azure cloud infrastructure to support the Qompyl platform and future growth. - Deploy and maintain containerized services using Docker and Azure-native technologies. - Develop, manage, and improve CI/CD pipelines using Azure DevOps, GitHub Actions, and related automation tools. - Monitor infrastructure performance, availability, and reliability through logging, monitoring, and alerting systems. - Collaborate with software engineers to support application deployments, integrations, and feature releases. - Implement and maintain cloud security best practices, including access controls, network security, encryption, and role-based permissions. - Monitor cloud resource utilization and proactively identify opportunities to optimize performance and reduce costs. - Develop automation scripts and operational tooling to improve deployment efficiency and reduce manual effort. - Design and maintain backup, disaster recovery, and business continuity processes. - Troubleshoot infrastructure, deployment, and performance issues across cloud environments. - Document cloud architecture, deployment procedures, operational processes, and infrastructure standards. - Contribute to a culture of ownership, transparency, teamwork, and continuous improvement. Qualifications - Experience managing cloud infrastructure, preferably in Microsoft Azure. - Experience deploying and managing containerized applications using Docker. - Experience building and maintaining CI/CD pipelines using Azure DevOps, GitHub Actions, or similar tools. - Strong understanding of cloud networking, security, access controls, and infrastructure best practices. - Experience with monitoring, logging, and alerting in production environments. - Ability to troubleshoot infrastructure and deployment issues in a cloud environment. - Strong problem-solving skills and ability to work independently in a remote team. - Excellent communication and collaboration skills. Requirements - Part-time, equity-only role suitable as a side engagement alongside a primary job or consulting work. - The position will transition to compensated status after the MVP launch and once funding or sustainable revenue is secured. Benefits - Opportunity to join an early-stage mission-driven startup. - Help build a next-generation trading platform.

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Full TimeRemoteTeam 1,001-5,000Since 2007H1B Sponsor

Role Description We are seeking an experienced and driven Infrastructure Engineer to join our platform engineering team. In this role, you will be at the heart of our engineering organisation — designing and owning the infrastructure, CI/CD pipelines, observability stack, and cloud platforms that power our products. You will partner closely with development, QA, and security teams to build reliable, scalable, and secure systems, while championing a culture of automation, continuous improvement, and operational excellence. What You Will Bring to ChargePoint - CI/CD & Automation - Design, build, and maintain robust CI/CD pipelines that support rapid, high-quality software delivery across multiple teams and environments. - Automate repetitive operational tasks — build, test, release, and deployment workflows — to improve speed and consistency. - Own release management processes including versioning, tagging, rollback strategies, and progressive delivery (canary, blue-green deployments). - Integrate automated testing, code quality gates, and security scanning into delivery pipelines. - Cloud Infrastructure & Architecture - Architect, provision, and manage scalable cloud infrastructure across AWS, GCP, or Azure using Infrastructure-as-Code (IaC) tools such as Terraform, Pulumi, or CloudFormation. - Design for high availability, fault tolerance, and cost efficiency; regularly review and optimise cloud spend. - Manage multi-environment setups (dev, staging, production) with environment parity and configuration management best practices. - Evaluate and adopt new cloud-native services that align with business and engineering goals. - Container Orchestration & Microservices - Build and manage containerised workloads using Docker and Kubernetes (EKS, GKE, AKS, or self-managed clusters). - Define standards for container image builds, registry management, security scanning, and image lifecycle. - Implement and maintain service mesh, ingress, and networking solutions within Kubernetes environments. - Drive adoption of Helm charts or Kustomize for consistent and repeatable Kubernetes deployments. - Observability & Reliability - Build and maintain a comprehensive observability stack covering metrics, logs, and distributed tracing (e.g., Prometheus, Grafana, Datadog, ELK/EFK, Jaeger, or equivalent). - Define SLIs, SLOs, and error budgets in partnership with engineering teams; lead SRE practices across the organisation. - Lead incident response processes — on-call rotation, blameless post-mortems, and systematic reliability improvements. - Proactively identify and address toil, bottlenecks, and single points of failure in the platform. - Security & Compliance - Embed security practices into infrastructure and pipelines — secrets management (Vault, AWS Secrets Manager), least-privilege IAM, network segmentation, and vulnerability scanning. - Ensure infrastructure compliance with relevant standards (SOC 2, ISO 27001, CIS Benchmarks, PCI-DSS where applicable). - Collaborate with the security team on audit readiness, penetration testing remediation, and security posture improvements. - Implement and enforce IaC security scanning (Checkov, tfsec, or similar). - Collaboration & Leadership - Act as a technical lead and mentor for junior and mid-level DevOps/platform engineers. - Define and document engineering standards, runbooks, architectural decision records (ADRs), and operational playbooks. - Participate in architecture and design reviews; advocate for infrastructure and operational concerns early in the product lifecycle. - Evaluate and manage third-party tooling and vendor relationships. Qualifications - 7–9 years of hands-on experience in a DevOps, Site Reliability Engineering (SRE), or Platform Engineering role. - Strong expertise with at least one major cloud provider (AWS, GCP, or Azure) — including compute, networking, storage, IAM, and managed services. - Deep experience with CI/CD tools such as GitHub Actions, GitLab CI, Jenkins, CircleCI, or Tekton. - Proficiency in Infrastructure-as-Code using Terraform and/or Pulumi/CloudFormation. - Hands-on expertise with Docker and Kubernetes in production environments. - Strong scripting and automation skills in Python, Bash, or Go. - Solid understanding of networking fundamentals — DNS, TCP/IP, load balancing, VPNs, and CDNs. - Experience with observability and monitoring platforms (Prometheus/Grafana, Datadog, Splunk, ELK, or equivalent). - Familiarity with configuration management tools (Ansible, Chef, Puppet, or similar). - Demonstrated ability to lead complex technical initiatives and mentor team members. Requirements - Experience with Git workflows and tools such as ArgoCD or Flux. - Knowledge of service mesh technologies (Istio, Linkerd, Envoy). - Exposure to FinOps practices and cloud cost optimisation tooling. - Experience with database administration and managed data services (RDS, CloudSQL, DynamoDB, etc.). - Familiarity with ML/AI infrastructure or data platform engineering. - Relevant certifications: AWS Solutions Architect / DevOps Engineer, CKA/CKAD, Google Professional Cloud DevOps Engineer, HashiCorp Terraform Associate, or equivalent. - Background in regulated or high-compliance environments (FinTech, Healthcare, eCommerce). Location Gurgaon/Remote Company Description ChargePoint is committed to fostering an inclusive workplace that welcomes and supports all qualified individuals. In alignment with this commitment, we ensure that persons with disabilities are provided with reasonable accommodations throughout the employment process. If you need a reasonable accommodation to participate in the application or interview process, to perform essential job functions, or to access any other benefits and privileges of employment, please contact us at accommodations@chargepoint.com. ChargePoint is an equal opportunity employer. Applicants only - Recruiting agencies do not contact.

India
Cohere logo

Forward Deployed Engineer, Infrastructure Specialist

Cohere

At Cohere, our mission is to build machines that understand the world, and to make them safely accessible to all.

Full TimeRemoteTeam 11-50H1B Sponsor

Role Description This role offers a unique opportunity to shape how enterprises harness the power of AI in real-world applications. As a bridge between our core North product and our clients’ engineering teams, you’ll be at the forefront of solving complex problems and securely integrating AI into critical sectors such as finance, healthcare, and telecommunications. Our esteemed clients include industry leaders like RBC, Dell, and LG CNS. In this role, you will: - Lead end-to-end deployment of North in private cloud and on-premises environments, including planning, configuration, testing, and rollout. - Partner with enterprise IT teams to assess infrastructure, security requirements, and data management practices. - Experiment at a high velocity and with a high level of quality to engage our customers and ultimately deliver solutions that exceed their expectations. - Design and implement deployment strategies tailored to client needs, ensuring compliance with data privacy and security standards. - Troubleshoot and resolve deployment-related technical issues, providing timely solutions to minimize downtime. Qualifications - You have experience with and enjoy working directly with customers. - You have experience deploying enterprise software in private/hybrid cloud environments. - You have proven experience administering production Kubernetes clusters and expertise with Helm. - Familiarity with DevOps practices, CI/CD pipelines, and tools like Git for version control. - You have strong expertise in cloud infrastructure (Azure, AWS, GCP), networking, and virtualization. - You excel in fast-paced environments and can execute while priorities and objectives are a moving target. - You have fluent Japanese and business-level English. Requirements If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs. We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Benefits - An open and inclusive culture and work environment. - Work closely with a team on the cutting edge of AI research. - A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch. - Full health and dental benefits, including a separate budget for mental health. - RRSP matching, 401K, Pension Scheme. - 100% Parental Leave top-up for up to 6 months, for either parent. - Annual enrichment benefits: Arts & culture, fitness/wellness, quality time, and a workspace improvement credit. - Education & learning stipend for conferences, courses, and coaching. - Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend. - 6 weeks of paid vacation (30 working days!). - Budget for traveling to other offices if you are remote, plus an annual company offsite. Company Description Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems. We’re training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that. We are a global technology company co-headquartered in Toronto and San Francisco, with key offices in London, New York City, Montreal, Seoul, Germany and Paris. Join us!

Japan
Full TimeRemoteTeam 1,001-5,000

Role Description - Administer Azure and Entra ID, including identity governance and access controls - Manage employee lifecycle processes (joiner, mover, leaver) integrated with Workday - Implement and maintain Conditional Access, MFA, and PIM - Maintain role-based access and least privileged models - Own Microsoft Intune platform (enrollment, policies, compliance, deployments) - Standardize endpoint configurations across all sites - Manage patching coordination and endpoint security controls - Troubleshoot device and application deployment issues - Support and maintain hybrid infrastructure including Azure-hosted and on-prem environments - Administer and support Windows Server environments and Hyper‑V virtualization platforms - Manage core services such as Active Directory, DNS, DHCP, and file services - Monitor system performance, availability, and security across cloud and on-prem systems - Support server lifecycle management including upgrades, migrations, and decommissioning - Assist with workload transitions between on-prem and cloud environments where appropriate - Support and help drive global backup and recovery strategy execution - Ensure reliable backup coverage for critical systems across cloud and on-prem environments - Validate backup integrity through periodic restore testing - Partner with IT leadership to maintain recovery objectives and improve resiliency - Maintain documentation and visibility into backup configurations and coverage - Support and maintain site-to-site VPNs and global connectivity - Configure and troubleshoot firewalls with a preference for SonicWall platforms - Apply best practices for firewall rules, segmentation, and access control - Support network standardization and secure connectivity across all sites - Support plant automation initiatives that rely on segmented ICS (Industrial Control Systems) networks - Work with IT and engineering teams to ensure proper separation between enterprise and ICS environments - Assist in implementing secure connectivity models between business systems and production systems - Support firewall layering and segmentation strategies for manufacturing environments - Enforce security controls across identity, endpoints, and infrastructure - Support MDR, endpoint protection, and audit readiness - Maintain alignment with FDA, ISO 13485, and internal standards - Act as Tier 2/3 escalation point - Support and guide site IT teams - Drive standardization across regions - Develop PowerShell automation for administration and provisioning - Maintain SOPs, runbooks, and documentation - Support audit and operational readiness Qualifications - 5+ years in cloud, infrastructure, or systems engineering roles - Strong experience with Azure, Entra ID, and Microsoft 365 - Experience supporting hybrid environments (cloud + on-prem) - Hands-on experience with Windows Server and Hyper‑V virtualization (required) - Experience with Microsoft Intune endpoint management - Strong understanding of firewalls and network security (SonicWall preferred) - Experience supporting or working alongside ICS or segmented network environments is strongly preferred - Experience with backup systems and disaster recovery processes - Experience with identity lifecycle management and Workday integration (preferred) - PowerShell scripting and automation experience Benefits - Competitive salary and bonus compensation - Medical, Dental, Vision, and HSA plans - Life, AD&D, STD, LTD plans to help protect you and your loved ones - Generous 401k plan with Employer Match - Paid Holidays, PTO, and other leave programs to support your time off needs

United Kingdom
Job Closed
Helix logo

Senior Cloud Infrastructure Engineer

Helix

Helix is proud to be an equal opportunity employer, and committed to providing employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification, in accordance with applicable federal, state, and local laws.

Full TimeRemoteTeam 51-200

Role Description As a Senior DevOps & Infrastructure Engineer – Cloud & Automation at Helix, you will play a pivotal role in architecting and securing our critical cloud infrastructure that supports our platform. You collaborate closely with product managers, scientists, and other engineers to innovate and evolve our cloud infrastructure while ensuring reliability, scalability, and security. - Lead the strategic evolution of our cloud infrastructure, proactively identifying opportunities to enhance support for product teams. - Drive innovation in cloud infrastructure strategy. - Collaborate with product management, scientists, and engineering teams to understand intricate requirements and develop comprehensive, forward-thinking engineering designs. - Architect and implement highly reliable and massively scalable cloud infrastructure solutions on AWS and potentially other platforms, anticipating future growth and evolving user and health system needs. - Ensure alignment with the long-term architectural vision. - Champion and promote Cloud Engineering best practices across teams, influencing technical direction and standards. - Strategically lead initiatives to optimize cloud spending, demonstrating a strong understanding of budget ROI and resource utilization. - Spearhead the development and advancement of sophisticated CI/CD pipelines for highly automated, efficient, and resilient build, test, and deployment processes. - Define and implement comprehensive observability strategies, selecting and managing advanced tools to provide deep insights into system health and performance, and proactively predict and mitigate potential issues. - Mentor and guide Infrastructure Engineers by providing technical expertise, constructive feedback, coaching, and support to foster their professional development and drive excellence in cloud operations. Qualifications - Experience integrating and utilizing AI-powered tools for proactive observability, cost optimization, and code analysis to improve cloud operations and development. - Ability to evaluate and recommend new AI tools to enhance cloud operations and development workflows. - A strong passion for improving people’s lives through access to better information about their DNA. - Minimum of 5+ years of progressive hands-on experience in cloud architecture, deployment, and operations, demonstrating increasing technical leadership and strategic impact. - 5+ years of significant software development experience in one of the following: Go, Python, or a similar language, with a proven track record of designing and implementing complex cloud-native applications and infrastructure. - 5+ years of in-depth experience in cloud computing, with at least 5 years of demonstrable expert-level experience with a broad range of AWS services and a deep understanding of cloud architectural patterns. - Participate in a rotating on-call schedule, demonstrating ownership, providing leadership during incidents, and driving root cause analysis and preventative measures. - Expert-level sysadmin skills in a Linux environment, including mastery of Bash and other scripting languages for advanced automation, diagnostics, and performance tuning. - Extensive experience in designing and implementing infrastructure-as-code tooling/frameworks — e.g., Terraform and CDK — at scale, including defining best practices and standards. - A strong affinity for and proven ability to champion an engineering culture that emphasizes Agile methodologies, DevOps principles, and continuous delivery practices across teams. - Exceptional communication and interpersonal skills, with the proven ability to articulate complex technical concepts clearly and persuasively to diverse audiences, including executive stakeholders. Requirements - Direct experience with CircleCi, Rollbar and Pagerduty. - Familiarity with full-stack development. - Familiarity with regulated software systems and entities. - AWS or other cloud certifications. - Multi Cloud experience (AWS, GCP, Azure). - BS+ in Computer Science; coursework in genetics or bioinformatics. Benefits - Comprehensive Health Insurance with Date of Hire eligibility. - Above average employer paid premium coverage. - 12 weeks Helix Paid Parental Leave option. - 401(k) with employer matching of up to 3% and 100% Vesting on the Date of Hire. - Comprehensive Well-Being Benefits. - Flexible PTO. - Remote options for many roles and a home office stipend. Expected Interview Process - Initial Screen - Second Screen - Technical Screen - Final Loop Expected Pay For This Role - Expected Helix Base: $148,000 - $190,000. - Expected Helix Discretionary Annual Bonus: 10% of your annual salary. - Equity: We offer generous equity at Helix. If you receive a Helix offer your recruiter will book dedicated time with you to educate you on our equity model. What To Expect During Your Onboarding Week We want to make sure your first week is spent learning the foundations. You will be invited to live orientation sessions and spend time with your hiring manager developing your individual roadmap to success. This experience will include sessions that cover your benefits, security & privacy training, an introduction to the OKR framework used at Helix and our communication tools and norms. What To Expect During Your First 90 days - First 30 days: you’ll spend time learning the Helix way, completing training and onboarding for your roles, and getting introduced to your team and relevant stakeholders. You’ll also gain a deeper understanding of our customers, our products, the impact we make in the lives of our communities, and how to thrive at Helix through participation in Helix U. - Day 30 - 60: you’ll spend time contributing to projects, deeply familiarizing yourself with team and company processes, and developing a deeper understanding of Helix’s products, services and capabilities. - Day 60 - 90: you’ll build your OKRs with your manager, start to take ownership of projects and initiatives on your team, and begin to demonstrate your impact on the Helix mission. Helix is proud to be an equal opportunity employer, and committed to providing employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification, in accordance with applicable federal, state, and local laws. #LI- Remote

United States
$148K - $190K / year