Engineering new possibilities with platforms, data, and generative AI
Principal Cloud Infrastructure Architect
Location
United States
Posted
141 days ago
Salary
$200K - $220K / year
Seniority
Lead
Job Description
Principal Cloud Infrastructure Architect
Egen
• Act as a trusted technical advisor to the teams, customer executives, and technical leadership, translating complex technical roadmaps into clear business value and risk assessments • Define the multi-year, enterprise-wide strategy for cloud adoption, migration, and modernization, with a strong focus on GCP services and strategic interoperability with the second cloud platform (Pref AWS) • Establish and enforce global cloud governance frameworks, architectural standards, reference models, and reusable patterns for deployment, security, and operations across all environments • Drive advanced DevOps practices and accountability models • Oversee cloud cost optimization initiatives, and establish technical guardrails to ensure maximum ROI across the multi-cloud footprint. • Architect and lead the implementation of complex, large-scale solutions on GCP, with hands-on experience in leveraging services like GKE, Cloud Dataflow, BigQuery, Cloud Run, Vertex AI, and Cloud Spanner • Design and govern the integration patterns for hybrid and multi-cloud systems, focusing on low-latency, secure connectivity, and consistent Identity and Access Management (IAM) across platforms (e.g., using federated identity or centralized key management) • Define the DevSecOps and cloud security posture, ensuring that designs meet stringent regulatory (e.g., data residency) and global compliance requirements (e.g., GDPR, HIPAA) • Lead Proof-of-Concept (PoC) initiatives for emerging technologies like Generative AI (e.g., Google Vertex AI or AWS Bedrock), serverless computing, and define their eventual architecture and integration into the enterprise landscape • Architect a comprehensive visibility strategy to monitor system health and proactively resolve performance bottlenecks across complex cloud environments • Design automated governance guardrails that ensure every deployment is secure, compliant, and cost-optimized by default. • Mentor, coach, and grow a group of Senior/Lead Cloud Engineers, fostering a culture of engineering excellence, ownership, and continuous learning • Own and drive continuous improvement of the cloud architecture delivery pipeline, promoting GitOps and end-to-end automation across the development lifecycle • Collaborate with the Data & AI Team and Engineering Leads to ensure architectural decisions align with product roadmaps and business domain goals • Elevate the organization’s technical brand by authoring engineering blogs and whitepapers, and representing the company as a speaker at industry conferences and meetups.
Job Requirements
- Experience: At least 5-10 years in Cloud Architecture leadership roles managing enterprise-scale transformations
- GCP as a primary platform: Mastery of GCP services, networking, and security
- Second Cloud: Deep architectural experience in AWS (e.g., EC2, Lambda, S3, RDS, EKS) or Azure
- Certifications: Google Cloud Professional Cloud Architect (Preferred), Professional/Expert-level certification from the secondary cloud provider (e.g., AWS Certified Solutions Architect - Professional)
- Infrastructure as Code (IaC) using Terraform (including multi-cloud modules)
- At least one programming language (Java/Python)
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Senior Data Infrastructure Engineer
Collaborative RoboticsCollaborative Robotics' mission is to create a world where humans and robots collaborate in a trusted partnership.
• Own the full ingestion path from edge to cloud, ensuring robot telemetry, sensor data, and warehouse events are reliably captured, transported, and made available for downstream systems. • Design, build, and operate scalable pipelines and foundational data layers (streaming and batch) that deliver low-latency, reliable data for analytics, AI/ML, and product features. • Build and maintain ingestion pipelines from object storage (e.g., S3) into Databricks, including raw → staged → analytics-ready layers, supporting both streaming and batch workloads. • Own the reliability and CI/CD of the data warehouse and foundational data layers, enabling safe, repeatable deployment of schema changes, transformations, and infrastructure that analytics engineers depend on. • Implement observability, monitoring, and data quality checks to ensure pipeline correctness, detect failures or drift, and maintain trust in data used by Vista, Portal, and Scoutmap. • Scale and optimize multi-tenant data infrastructure, balancing performance, reliability, and cost-efficiency as Cobot’s customer base and data volume grow. • Collaborate directly with robotics, AI/ML, product, and analytics teams to translate product requirements into resilient data systems that unlock customer-facing features. • Establish and enforce best practices for data engineering, reliability, security, and CI/CD across ingestion, staging, and warehouse layers—owning the foundations while enabling analytics engineers to ship metrics, marts, and dashboards efficiently.
Site Reliability Engineer
ProArchConsulting and technology- enabled by cloud, guided by data, fueled by apps, and secured by design.
Role Description ProArch is looking for a passionate and skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. You will collaborate with various teams to optimize production environments, troubleshoot performance issues, and implement best practices for service reliability. Your contributions will be critical to improving system uptime and enhancing user satisfaction. - Monitor system performance and reliability, ensuring uptime meets organizational SLAs. - Implement and maintain observability tools to gather metrics and logs for proactive issue detection. - Troubleshoot and resolve complex production issues across various components of our infrastructure. - Collaborate with software engineering teams to design and implement scalable, fault-tolerant architectures. - Develop and maintain automation scripts for deployment, monitoring, and system management. - Participate in on-call rotation to respond to production incidents and perform root cause analysis. - Contribute to capacity planning and performance tuning to ensure optimal resource utilization. - Document infrastructure, processes, and incident responses to promote knowledge sharing. Qualifications - 8+ years of experience as a Site Reliability Engineer, DevOps Engineer, or related role. - Strong experience with cloud providers such as AWS, Azure, or GCP. - Proficiency in scripting languages such as Python, Bash, or Go. - Experience with container orchestration tools like Kubernetes. - Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI). - Experience in Snowflake. - Account Admin expertise for Snowflake. - Solid understanding of networking and security principles. - Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK stack. - Excellent problem-solving skills and a proactive attitude. - Strong communication and teamwork skills, with an emphasis on collaboration. Preferred Qualifications - Experience with Infrastructure as Code (IaC) tools such as Terraform or CloudFormation. - Knowledge of service mesh architectures and modern microservices patterns. - Background in software development and familiarity with Agile methodologies.
Senior Infrastructure Engineering Manager – FedHealth, Platform
NavaBuilding simple, effective government services. Want to contribute? We're hiring!
• Guide and develop a team of 10-12 platform engineers by providing coaching, feedback, and growth opportunities, setting clear goals, managing performance, and ensuring accountability • Foster a positive, inclusive culture, support employee well-being, and lead by example, while aligning team efforts with organizational goals, removing obstacles, and enabling the team to achieve results effectively • Work closely with development, operations, security, and architect engineers to identify needs, incorporate feedback, and deliver scalable and secure platform solutions • Contribute directly to the design, architecture and implementation of platform features and capabilities • Manage platform engineering budgets, coordinate resource allocation, and optimize costs for cloud and on-premise infrastructure • Define and enforce platform engineering standards, documentation, automated workflows, and compliance requirements • Ensure thorough documentation and facilitate knowledge sharing across technical teams, promoting internal awareness and adoption of platform solutions across Nava
Cloud Infrastructure Engineer – GCP
EgenEngineering new possibilities with platforms, data, and generative AI
• Implement cloud-based IaC solutions • Develop and implement automation to support continuous delivery and continuous integration solutions • Use GCP services to deploy highly available, scalable, and secure applications • Implement workflows to automate the release and upgrade process for applications in Development, Test, and Production environments. • Implement secure integrations using GCP security and networking technologies • Administration and engineering of IAM user Role-Based Access Controls and processes • Create and update support documentation and standards. • Develop automated methodologies for deployment activities, configuration management, supporting systems, and business processes. • Investigate and contribute to solving various issues in production environments.



