UCLA Health System logo
UCLA Health System

UCLA Health System provides top-tier healthcare and cutting-edge medical technology to the Los Angeles, California, region and beyond. The academic medical cent

Cloud Engineer

Location

California

Posted

23 hours ago

Salary

$128.5K - $298.1K / year

Seniority

Senior

Professional Certificate

Job Description

Cloud Engineer

UCLA Health System

Title:Cloud Engineer Location: Los Angeles United States Work Location: Los Angeles, CA, USA Flexible Hybrid Job Description: Primary Duties and Responsibilities Press space or enter keys to toggle section visibility The Cloud Engineer will design, build, and operate infrastructure and applications supporting UCLA Health’s Analytics Platform across both on-premises and multi-cloud environments (AWS, Azure, GCP). This role focuses on enabling secure, scalable AI/ML and GenAI platforms, with an emphasis on automation, reliability, and compliance in a regulated healthcare setting. Key Responsibilities - Design, implement, and manage cloud and hybrid infrastructure supporting analytics and AI/ML workloads - Build and operate MLOps capabilities, including: - Model training and inference platforms - Model and artifact management - CI/CD and deployment pipelines - Observability and monitoring solutions - Cost optimization controls - Develop and maintain automation and infrastructure-as-code (IaC) solutions for provisioning and configuration - Troubleshoot and resolve complex system and environment issues across cloud and on-prem platforms - Establish platform guardrails to ensure secure, reliable, and compliant operations - Collaborate with cross-functional teams to: - Gather requirements - Design and prototype solutions - Implement and test deployments - Support ongoing operations and enhancements - Apply security, privacy, and governance controls aligned with healthcare data regulations - Execute release, deployment, and configuration management processes What You’ll Bring - Strong background in cloud engineering and platform operations - Experience with multi-cloud environments (AWS, Azure, GCP) - Proficiency in: - Automation, scripting, and infrastructure-as-code - CI/CD pipeline development and optimization - Monitoring and observability tools - Experience supporting AI/ML or data platform workloads (preferred) - Ability to troubleshoot complex systems and drive solutions independently - Strong collaboration skills and the ability to translate business requirements into technical solutions Salary Range: $128500 - $298100 annually. Job Qualifications Press space or enter keys to toggle section visibility •  BS/MS in Computer Science (or equivalent) •  AWS Certified Cloud Engineer, Architect, Administrator Certifications required •  7+ years of advanced knowledge and experience as an AWS Cloud Engineer in all core services and offerings. AWS experience a plus •  15+ years of advanced knowledge and experience of Microsoft Technologies such as, Windows server and Linux based servers, enterprise system support experience and strong background in systems engineering and administration for both operating systems •  15+ years of advanced knowledge and experience with enterprise scale Windows technologies such as Server platforms, Desktop platforms, Exchange Environments, Active Directory, IIS, Windows Clustering, Virtualization and Collaboration tools. AWS Certification or equivalent experience preferred •  Working knowledge of DevOps-like work or experience in a real time operational role •  Advanced knowledge of analytics and AI/ML platform services across AWS, Azure, and GCP (e.g., AWS SageMaker/Bedrock, Azure Machine Learning/Azure OpenAI, Google Vertex AI) and how to operate them securely at enterprise scale. •  Experience enabling teams to build and deploy ML/AI solutions by providing reusable platform capabilities (reference architectures, templates, SDK/CLI standards, self-service onboarding, and guardrails) rather than only project-specific implementations. •  Hands-on experience operationalizing ML/AI workloads on cloud platforms (AWS/Azure/GCP): managed training/inference, batch vs real-time serving, feature/metadata management, model registry, and cost/performance optimization. •  Strong MLOps/platform engineering experience: CI/CD for ML and GenAI, automated validation gates, reproducible pipelines, environment promotion, artifact/version management, and production monitoring (drift, data quality, latency, cost) using cloud-native and/or enterprise tooling (e.g., Azure DevOps/GitHub Actions, SageMaker Pipelines, Vertex AI Pipelines, MLflow, Terraform). •  GenAI platform experience (AWS/Azure/GCP): deploying and governing LLM applications using managed services (e.g., Bedrock/Azure OpenAI/Vertex AI), RAG architectures, embeddings and vector databases/search, prompt/version management, and evaluation/guardrails for safety and groundedness. •  Responsible AI + governance experience for regulated environments: PHI/PII protections, access controls, encryption and key management, audit logging, model/endpoint risk assessments, bias/fairness considerations, and policy enforcement aligned to HIPAA and secure SDLC. •  Strong data engineering foundations that support AI platforms: standardized data ingestion/ETL/ELT, data quality/lineage, dataset and feature pipeline design, schema/version management, and integration with lake/lakehouse platforms (e.g., S3/ADLS/GCS with Spark/Databricks/BigQuery/Synapse) for feature and training data readiness. •  Experience operating scalable training/inference platforms (GPU/accelerated workloads): capacity planning/quotas, cluster or managed compute configuration, distributed training concepts, performance tuning, and chargeback/showback in cloud environments. As a condition of employment, the final candidate who accepts an offer of employment will be required to disclose if they have been subject to any final administrative or judicial decisions within the last seven years determining that they committed any misconduct; or have filed an appeal of a finding of substantiated misconduct with a previous employer.

Related Categories

Related Job Pages

More Cloud Engineer Jobs

Title: Principal Genesys Cloud PS Consultant (No Visa Sponsorship) Location: Philadelphia, PA, USA Employees can work remotely Full-time Company Description Miratech helps visionaries change the world. We are a global IT services and consulting company that brings together enterprise and start-up innovation. Today, we support digital transformation for some of the world's largest enterprises. By partnering with both large and small players, we stay at the leading edge of technology, remain nimble even as a global leader, and create technology that helps our clients further enhance their business. We are a values-driven organization and our culture of Relentless Performance has enabled over 99% of Miratech's engagements to succeed by meeting or exceeding our scope, schedule, and/or budget objectives since our inception in 1989. Miratech has coverage across 5 continents and operates in over 25 countries around the world. Miratech retains nearly 1000 full-time professionals, and our annual growth rate exceeds 25%. Job Description We are looking for a Principal Genesys Cloud Professional Services Consultant to join our team and drive the delivery of innovative, enterprise-grade contact center solutions. In this role, you will lead the design and implementation of advanced multi-channel customer experience platforms, leveraging deep expertise in Genesys Cloud and modern contact center technologies. The ideal candidate combines strong technical leadership, hands-on delivery experience, and the ability to translate complex business requirements into scalable, high-quality solutions. Work Authorization: Candidates must have valid authorization to work in the United States at the time of application. Visa sponsorship is not available for this role. Responsibilities: - Serve as the primary technical point of contact for customers and internal stakeholders, providing expert consultancy on Genesys Cloud solutions. - Lead the design and delivery of end-to-end technical solutions, including facilitating workshops, gathering requirements, and producing technical design documentation with stakeholder sign-off. - Analyze complex business and technical challenges to define scalable and effective solution architectures. - Own and evolve the technical architecture of Genesys solutions, introducing new technologies and best practices where appropriate. - Configure and optimize Genesys Cloud environments to align with changing business and operational needs. - Lead the planning and execution of contact center deployments, migrations, and system upgrades. - Act as the main escalation and coordination point for Genesys-related matters, collaborating with internal teams and external vendors. - Support production readiness activities and ensure smooth transition of contact center solutions into live environments. Qualifications - 5+ years of hands-on experience supporting and optimizing large, multi-site, complex contact center environments. - Experience in designing, implementing, and supporting Genesys Cloud solutions in enterprise settings. - Proven expertise in building routing strategies and IVR workflows using Genesys Architect - Experience with CRM integration (e.g., Salesforce, MS Dynamics) with the Genesys Cloud environment - Strong background in infrastructure planning, solution design, deployment, and lifecycle management to ensure high availability and performance. - Solid understanding of business processes and their alignment with customer experience technologies. - Deep knowledge of SIP infrastructure, including SIP protocol, Session Border Controllers (SBC), and load balancing approaches. Nice to Have - Knowledge of Genesys Cloud Workforce Management (WFM): provide expert guidance and best practices to optimize workforce management processes within the Genesys Cloud environment. - Genesys Cloud Training: conduct training sessions for various roles, including supervisors, administrators, users, and architects, to enhance their understanding and effective use of Genesys Cloud features - Experience with CX Cloud from Genesys and Salesforce Additional Information We offer: - Competitive Pay and Benefits: enjoy a comprehensive compensation and benefits package, including health insurance, and a relocation program. - Work From Anywhere Culture: make the most of the flexibility that comes with remote work. - Growth Mindset: reap the benefits of a range of professional development opportunities, including certification programs, mentorship and talent investment programs, internal mobility and internship opportunities. - Global Impact: collaborate on impactful projects for top global clients and shape the future of industries. - Welcoming Multicultural Environment: be a part of a dynamic, global team and thrive in an inclusive and supportive work environment with open communication and regular team-building company social events. - Social Sustainability Values: join our sustainable business practices focused on five pillars, including IT education, community empowerment, fair operating practices, environmental sustainability, and gender equality. * Miratech is an equal opportunity employer and does not discriminate against any employee or applicant for employment on the basis of race, color, religion, sex, national origin, age, disability, veteran status, sexual orientation, gender identity, or any other protected status under applicable law.

Pennsylvania

Storage & Cloud Engineer

NTT DATA Services

NTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers, and application services. Our consulting and Industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R&D.

Role Description We are currently seeking a Remote Storage & Cloud engineer to join our team in Guadalajara, Jalisco (MX-JAL), Mexico (MX). We are looking for a highly skilled and proactive Senior Storage & Cloud Engineer with deep expertise in Unix systems, distributed storage technologies, and cloud platforms. The role involves designing, managing, and decommissioning storage solutions across hybrid/cloud environments, while working closely with application development teams and leadership stakeholders. The ideal candidate will have strong technical depth, excellent communication skills, and the ability to manage work independently in a structured, process-driven environment. Key Responsibilities - Storage Architecture & Management - Design, implement, and manage enterprise storage solutions including: - NAS (Network Attached Storage) - NFS (Network File Systems) - Object storage platforms (e.g., S3-compatible solutions) - Perform storage capacity planning, performance tuning, and optimization. - Manage data lifecycle (provisioning, migration, archival, and decommissioning). - Ensure high availability, scalability, and resilience of storage platforms. - Unix/Linux Administration - Administer and troubleshoot Unix/Linux systems supporting storage environments. - Perform filesystem management, mount configurations, NFS exports, and permissions. - Optimize OS-level performance for storage-intensive workloads. - Cloud & Hybrid Infrastructure - Design and manage storage solutions across public cloud platforms (AWS/Azure/GCP). - Work with cloud-native storage services such as: - AWS S3 / EBS / EFS - Azure Blob / Files - GCP Cloud Storage - Lead or support migration of on-prem storage to cloud-based platforms. - Kubernetes & Container Storage - Implement and manage Kubernetes storage constructs: - Persistent Volumes (PV), Persistent Volume Claims (PVC) - Storage Classes - CSI (Container Storage Interface) drivers - Work with container-native storage solutions (e.g., Ceph, Portworx, OpenEBS). - Ensure reliable storage for stateful workloads running in Kubernetes clusters. - Decommissioning & Rationalization (Decom Plans) - Plan and execute storage and infrastructure decommissioning activities. - Collaborate with application development teams and leadership to: - Identify dependencies - Validate data migration or archival - Ensure zero data loss - Lead discussions and present decommissioning strategies clearly to stakeholders. - Monitoring, Troubleshooting & Optimization - Monitor storage utilization, performance metrics, and system health. - Troubleshoot issues related to latency, throughput, and data availability. - Use tools like Splunk, Prometheus, Grafana, or native cloud monitoring services. - Project Execution & Ownership - Independently manage assigned workstreams following defined processes. - Track deliverables, risks, and dependencies. - Ensure timely execution of storage-related initiatives and migrations. Qualifications - Expert-level knowledge of Unix/Linux systems - Strong experience with: - NAS, NFS, and distributed/object storage systems - Solid understanding of: - Cloud computing platforms (AWS/Azure/GCP) - Storage services and architecture in cloud environments - Good working knowledge of: - Kubernetes and container orchestration - Persistent storage in containerized environments - Familiarity with automation tools (Shell, Python, Ansible, Terraform) is a plus Requirements - 7+ years of experience in: - Storage systems AND Unix/Linux administration - 3+ years of experience in: - Cloud platforms and container ecosystems (Kubernetes preferred) - Storage migration, rationalization, or decommissioning projects - Experience with enterprise storage vendors (NetApp, Dell EMC, etc.) - Knowledge of backup & disaster recovery solutions - Certifications (optional but preferred): - AWS / Azure Cloud certifications - Kubernetes certifications (CKA/CKAD) - Storage-specific certifications Benefits - Negotiable salary - Grocery Tickets 12% of base salary - Saving fund - 30 days of Christmas bonus - 50% Vacation bonus - Medical insurance (You and your family) - Life insurance - Permanent home office

Mexico
Full TimeRemoteTeam 10,001+H1B Sponsor

Role Description The Principal Cloud Engineer is a senior technical leader who pairs architectural vision with hands-on engineering execution. You will design and build the reference architectures, patterns, and standards that support Greystar’s global operations, then implement those designs yourself using modern cloud practices. This is not a role that stops at diagrams. This is the role that proves out patterns through working POCs, then enables the platform team to operationalize and scale them. You will partner closely with the Data, Digital, and AI (D2AI) team, where much of Greystar’s active cloud development happens today, while keeping a broader, enterprise-wide perspective. Much of that work is AI and data intensive, so a core part of this role is building the cloud foundation that Greystar’s AI and ML workloads run on, from accelerated compute and model serving to the Databricks data plane that powers them. We are looking for someone who thinks in cloud concepts rather than being tied to any single provider. The ideal candidate understands foundational principles - IAM, virtual networking, DNS, load balancing, compute, and storage - and can translate those concepts fluidly across Azure, AWS, or any platform our growing portfolio demands. This is an individual contributor role with a strong mentoring component. You will guide and elevate systems administrators and engineers across the team, helping them grow into more strategic thinkers themselves. In addition to your resume, all candidates are required to include a short video (2–5 min) demonstrating how you have used AI tools in your engineering workflow — code generation, debugging, architecture, documentation, or similar. We recommend recording with Loom (free) or uploading as an unlisted YouTube video. Please embed this link at the top of your resume. Applications without a video link will not be reviewed. Qualifications - Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent professional experience. - 10+ years of progressive experience in cloud, infrastructure, or platform engineering, with at least 3 years in a senior or principal-level role. - Strong conceptual mastery of cloud computing fundamentals - IAM, VNets/VPCs, load balancing, DNS, compute, storage. - Deep hands-on experience with Azure private networking — hub-and-spoke topology, Private DNS zones, private endpoints, and network security controls. - Hands-on experience with Terraform or equivalent Infrastructure as Code tools (Bicep, CloudFormation). - Proficiency with Git-based version control and CI/CD pipeline design and management. - Experience with containerization technologies, particularly Docker. - Demonstrated ability to mentor and influence engineering teams without direct management authority. - Excellent communication skills with the ability to translate complex technical concepts and architectural decisions for diverse audiences. Requirements - Design and build reference architectures, design patterns, and technology standards that ensure consistency, security, and scalability across all environments. - Design, implement, and continuously improve Greystar’s private networking architecture, including hub-and-spoke topology, private endpoint strategy, DNS architecture, and network segmentation standards across all cloud environments. - Design and build Greystar’s container platform, delivering reference patterns for AKS and containerized workloads. - Design and implement cloud infrastructure solutions across multi-cloud environments (Azure, AWS), with an emphasis on reliability, security, and cost-efficiency. - Design and implement the infrastructure patterns for AI workloads and the Databricks data plane. - Partner closely with application development teams to help them move fast and safe. - Build and curate paved-road solutions: reference implementations, Terraform modules, pipeline templates, and starter architectures. - Champion and advance Infrastructure as Code (IaC) practices using Terraform. - Implement and maintain reliability standards the platform team operates against. - Build and maintain governance frameworks for cloud decisions. - Mentor and coach cloud and platform engineers. Benefits - Competitive Medical, Dental, Vision, and Disability & Life insurance benefits. - Generous Paid Time off: 15 days of vacation, 4 personal days, 10 sick days, and 11 paid holidays. - 6-Week Paid Sabbatical after 10 years of service (and every 5 years thereafter). - 401(k) with Company Match up to 6% of pay after 6 months of service. - Paid Parental Leave and lifetime Fertility Benefit reimbursement up to $10,000. - Employee Assistance Program. - Critical Illness, Accident, Hospital Indemnity, Pet Insurance and Legal Plans. - Charitable giving program and benefits.

United States
$160K - $190K / year
Full TimeRemoteTeam 10,001+Since 1954H1B Sponsor

Role Description Advance your career while shaping the future of cloud operations that support GDIT enterprise environments. Step into the role of Azure Cloud Engineer at GDIT, where technologists are empowered to grow meaningful careers while continually evolving their skills. As an Azure Cloud Engineer, the work you do at GDIT directly strengthens the reliability, security, and scalability of GDIT’s cloud ecosystem. In this role, you will play a pivotal part in ensuring our operations are supported by resilient, high‑performing Azure and Oracle cloud services, enabling rapid delivery, secure operations, and continuous modernization. - Support Azure‑focused cloud engineering initiatives, ensuring stable, secure, and compliant cloud operations that directly enable organizational mission success. - Collaborate with cross‑functional teams including cybersecurity, application, and infrastructure partners to ensure cloud platforms align with mission and operational requirements. - Drive innovation in automation, cloud optimization, and operational efficiency to enhance service delivery, reduce downtime, and quickly resolve mission-impacting challenges. - Utilize advanced cloud engineering tools and technologies such as Azure CLI, PowerShell, IaC frameworks, Kubernetes, Oracle Cloud services, and supporting AWS capabilities to modernize and sustain the cloud environment. Qualifications - Education: Bachelor’s degree NOT required. An additional 4 years of equivalent experience OR an Azure Certification may substitute. - Experience: 8+ years of IT engineering experience, 12+ in lieu of no degree. - 5+ years hands-on experience managing Azure cloud operations. - Technical skills: Strong understanding of cloud architecture, security, and automation. - Experience with PowerShell, Python, JSON, YAML, Git, and cloud-native CLIs. - Experience working with serverless, Docker, and Kubernetes. - Windows and Linux OS experience. - Preferred Certs and Experience: Demonstrated experience with Oracle Gov Cloud Infrastructure (OCI), and AWS Gov infrastructure experience/certification. - US Person required. - Role requirements: Excellent communication and documentation skills. Ability to work in an on-call rotation as required. Benefits - AI-powered career tool that identifies career steps and learning opportunities. - An internal mobility team focused on helping you achieve your career goals. - Comprehensive benefits and wellness packages, 401K with company match, and competitive pay and paid time off. - Full-flex work week to own your priorities at work and at home. - Award-winning culture of innovation and a military-friendly workplace. Company Description We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 26,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.

United States
$129.8K - $155.3K / year