Oracle logo
Oracle

Oracle, headquartered in Austin, Texas, is a global leader in computing solutions. The company specializes in database management systems, cloud-engineered systems, and enterprise

Principal Infrastructure & Site Reliability Engineer

Location

United States

Posted

38 days ago

Salary

$86.4K - $199.5K / year

Seniority

Lead

No structured requirement data.

Job Description

Principal Infrastructure & Site Reliability Engineer

Oracle

Role Description Join Oracle's Health Data Intelligence (HDI) team as a Software Engineer 4, focused on Site Reliability Engineering for large-scale healthcare analytics platforms. In this role, you will: - Design, build, and operate highly reliable, scalable infrastructure and data pipelines that power mission-critical analytics globally. - Contribute to the next evolution of cloud operations by advancing automation, observability, and AI-assisted reliability practices. - Explore the use of Generative AI and intelligent automation to improve incident response, system resilience, and operational efficiency. - Work within a collaborative team to deliver robust solutions that handle massive datasets with precision and performance. - Continuously improve system reliability and operational excellence. U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire. Qualifications - Experience building and operating high-availability, fault-tolerant systems. - Strong understanding of distributed systems, performance monitoring, and resiliency patterns. - Experience with incident response, root-cause analysis, and production troubleshooting. - Hands-on experience applying Generative AI or Agentic AI (e.g., LangChain, AutoGPT, custom agents). - Ability to design or integrate AI-driven workflows for operational efficiency and reliability. - Familiarity with building or integrating autonomous agents for DevOps/SRE use cases. - Strong experience with multi-cloud environments (OCI, AWS/Azure). - Deep understanding of cloud infrastructure design, deployment, and resource optimization. - Experience managing hybrid or cross-cloud architectures. - Advanced competency in CI/CD pipelines (Jenkins, Kubernetes). - Infrastructure as Code (Terraform). - Observability tools (Prometheus, Grafana). - Strong focus on automation-first operations. - Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake). - Experience with ETL frameworks and large-scale data processing. - Understanding of columnar storage systems. - Experience supporting or integrating BI tools (Tableau, Power BI, Oracle Analytics). - Strong proficiency in Python, Java, or Go. - Experience with Docker, Kubernetes, and shell scripting. - Strong troubleshooting skills with ability to perform root-cause analysis. - Experience resolving complex production issues in distributed systems. Benefits - Competitive benefits that support our people with flexible medical, life insurance, and retirement options. - Encouragement for employees to give back to their communities through volunteer programs. Company Description Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. With AI embedded across our products and services, we help customers turn that promise into a better future for all. True innovation starts when everyone is empowered to contribute. We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States. Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full TimeRemoteTeam 11-50H1B No Sponsor

• Build and maintain testnets and test automation for components of the Monad blockchain • Prototype new deployment patterns across validator and full-node topologies • Design and run performance benchmarks and load tests to surface bottlenecks before they reach production • Drive observability, alerting, and security across the stack • Define best practices for node operators & support them • Develop tooling for internal & external use • Participate in on-call rotation for production support

United States
Job Closed
LineTen logo

Site Reliability Engineer

LineTen

LineTen is a cloud-based, SaaS technology platform that enables businesses to aggregate technical transactions.

DevOps Engineer38 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Ensure global coverage of our products. • Responsible for ensuring all engineering teams have a first class development experience. • Drive roll out of Docker/Kubernetes across all engineering workstations. • Ensure top-notch observability setup is in place. • Provide engineering support across all products. • Work with the Architecture team for product development direction. • Participate in post-incident reviews.

Malaysia
NEORIS logo

DevOps Mid

NEORIS

NEORIS is a Digital Accelerator that helps companies step into the future.

DevOps Engineer38 days ago
Full TimeRemoteTeam 1,001-5,000H1B No Sponsor

NEORIS ahora parte de EPAM es un acelerador Digital que ayuda a las compañías a entrar en el futuro, con más de 20 años de experiencia como Socios Digitales de algunas de las compañías más importantes del mundo. Somos más de 4,000 profesionales en 11 países, con una cultura multicultural y de startup donde fomentamos la innovación, el aprendizaje continuo y la generación de soluciones de alto impacto para nuestros clientes. Estamos en búsqueda de: DevOps Semi Senior - Mid! Principales responsabilidades: • Implementar, optimizar y mantener entornos Cloud Native, garantizando escalabilidad, seguridad y rendimiento. • Gestionar y automatizar pipelines de CI/CD utilizando GitHub, GitLab o Jenkins. • Diseñar, desplegar y administrar soluciones basadas en contenedores. • Configurar y mantener herramientas de observabilidad, asegurando una monitorización integral mediante Dynatrace, Prometheus y Grafana. • Colaborar con equipos de desarrollo y arquitectura para definir buenas prácticas DevOps. • Participar en la mejora continua de procesos, automatización y estándares técnicos. Requerimientos: Excluyentes: • Experiencia mínima de 2 a 4 años en roles DevOps (nivel semi senior). • Conocimientos sólidos en prácticas Cloud Native y gestión de contenedores. • Experiencia aplicando herramientas de observabilidad como Dynatrace, Prometheus y Grafana. • Manejo de GitHub o GitLab y experiencia en pipelines con Jenkins. • Conocimiento de herramientas de calidad y seguridad como Kiuwan. Deseables: • Experiencia con Kubernetes u otros orquestadores de contenedores. • Certificaciones en DevOps o Cloud. • Experiencia en entornos de alta disponibilidad o proyectos de transformación digital. • Conocimientos en automatización avanzada e Infrastructure as Code. Ofrecemos • Contrato indefinido con salario competitivo • Modalidad flexible y posibilidad de trabajo remoto. • Plan de carrera personalizado y formación continua (certificaciones, inglés, etc.). • Participación en proyectos estables con alto componente técnico. • Flexibilidad horaria y enfoque en la conciliación. • Beneficios sociales adaptados a tus necesidades Te invitamos a conocernos en http://www.neoris.com, Facebook, LinkedIn, Twitter o Instagram: @NEORIS. #LI-MO1

Spain
Tekhqs logo

Senior DevOps Engineer

Tekhqs

TekHQS is a global technology and AI-driven solutions company delivering scalable SaaS, Cloud, AI/ML, Blockchain/Web3, DevOps, and enterprise software solutions to startups and enterprise clients worldwide. With a team of 300+ professionals across the USA, UK, UAE, Qatar, Pakistan, and India, we specialize in building high-performance digital products across Logistics, FinTech, Healthcare, and emerging technology sectors. At TekHQS, we foster a culture of innovation, ownership, and continuous growth, empowering our teams to build impactful technology that drives real business transformation.

DevOps Engineer38 days ago
Full TimeRemoteTeam 201-500

About the Role We are seeking a skilled DevOps Engineer to strengthen our infrastructure, automation, and CI/CD capabilities across multiple projects. The ideal candidate will drive automation, streamline CI/CD pipelines, and ensure reliable deployments across development and production environments. This role requires strong hands-on experience with AWS and/or Azure, containerization, orchestration tools, infrastructure as code, and continuous integration systems. Key Responsibilities Infrastructure & Cloud Management - Design and manage cloud infrastructure on AWS and Azure. - Deploy and maintain services such as EC2/VMs, S3/Blob Storage, RDS/Azure SQL, VPC/VNet, IAM/Azure AD, Load Balancers, and related services. - Ensure high availability, scalability, and security of production systems. Containerization & Orchestration - Build and manage Docker containers. - Deploy and maintain Kubernetes clusters (EKS/AKS preferred). - Optimize container orchestration for performance and cost efficiency. CI/CD & Automation - Design and maintain CI/CD pipelines using Jenkins (experience with Azure DevOps is a plus). - Automate build, test, and deployment processes. - Implement Infrastructure as Code using Terraform. - Automate configuration management using Ansible. Monitoring & Reliability - Implement monitoring, logging, and alerting mechanisms (CloudWatch, Azure Monitor, Prometheus, Grafana, etc.). - Troubleshoot production issues and ensure minimal downtime. - Improve system reliability and deployment velocity. Security & Compliance - Implement security best practices in infrastructure and pipelines. - Manage IAM roles, access controls, and secrets securely across AWS/Azure environments. - Conduct regular system audits and vulnerability checks. Collaboration - Work closely with development, QA, and product teams. - Support release planning and environment readiness. - Document processes, workflows, and infrastructure architecture. Preferred Qualifications - AWS and/or Azure Certifications (Associate or Professional level). - Experience with microservices architecture. - Experience with monitoring tools (Prometheus, Grafana, CloudWatch, Azure Monitor, etc.). - Knowledge of security best practices and DevSecOps concepts. Soft Skills - Strong analytical and troubleshooting skills. - Ability to work independently and within cross-functional teams. - Strong documentation and communication skills. - Proactive and ownership-driven mindset. - Ability to manage multiple environments and deadlines efficiently. Job Details Experience: 5 years Job Type: Fully Remote Location: 9 pm to 5 am About TekHQS TekHQS is a global technology and AI-driven solutions company delivering scalable SaaS, Cloud, AI/ML, Blockchain/Web3, DevOps, and enterprise software solutions to startups and enterprise clients worldwide. With a team of 300+ professionals across the USA, UK, UAE, Qatar, Pakistan, and India, we specialize in building high-performance digital products across Logistics, FinTech, Healthcare, and emerging technology sectors. At TekHQS, we foster a culture of innovation, ownership, and continuous growth — empowering our teams to build impactful technology that drives real business transformation.

Pakistan