Job Closed
This listing is no longer active.
Headquartered in Pleasanton, California, Veeva is a leading provider of cloud-based software and services for the life sciences industry. As an employer, Veeva
Senior Software Engineer – SRE
Location
California
Posted
56 days ago
Salary
$110K - $270K / year
Seniority
Senior
Job Description
Senior Software Engineer – SRE
Veeva
• Build Cloud Infrastructure: Rapidly build new cloud infrastructure from scratch, adhering to software development best practices • Drive Reliability & Scalability: Ensure our platform meets the scalability and reliability needs of our hundreds of global customers (across North America, Europe, and Asia) • Lead Incident Management: During an incident, effectively lead triage and mitigation efforts, potentially performing periodic on-call duty for escalations • Automate & Optimize: Develop tools and automation to eliminate manual work and reduce issue resolution times • Full-Stack Diagnostics: Proactively learn all necessary systems to provide full-stack diagnostics and determine root causes of production problems • Strategic Engineering Partnership: Strategize with engineering teams on complex problems, offering insights on what will work at scale (supporting 2M+ users) and guiding development decisions before features ship • Influence Design: Participate in engineering design reviews of new features and drive initiatives to improve operational efficiency and platform scalability • Cross-functional Collaboration: Partner effectively with Product Management, Design, and QA to deliver cutting-edge solutions and direct customer value • Backend Focus: Work across multiple layers of our technology stack, with a primary focus on backend development, and opportunities in frontend and infrastructure • Effective Communication: Communicate clearly with engineering teams, succinctly describing problems for seamless hand-offs during outages with both technical and non-technical audiences • Mentorship: Actively mentor team members, contributing to a positive and high-performing team environment
Job Requirements
- Deep Java Expertise: 5+ years of experience in Java development, with a strong preference for experience within enterprise cloud software companies
- Operational Experience: Hands-on operational experience in a high-volume or critical production service environment, including incident management and root cause analysis
- Code Quality: Proven ability to write clean, testable, readable, and maintainable code within a collaborative team setting
- Open Source Proficiency: Hands-on experience with a range of open-source technologies, such as Spring, MySQL, Hibernate, Solr, Maven, Git, Tomcat, Linux, AWS, Vagrant, Docker, and Kubernetes
- Database Mastery: 3+ years of experience in relational databases with expert-level SQL skills
- Scripting Skills: Solid scripting proficiency with languages such as Shell, Bash, Ansible, Python, Go, Ruby, etc.
- Leadership & Communication: Demonstrated history of incident management and leadership ability, with effective communication skills across all levels (individual contributors to executives)
- Mentorship: Proven record of making your team better through mentorship
- This role requires a working schedule of Monday - Friday, 2 PM - 10 PM PST, and candidates must be located in the HST or PST time zones to be considered
- Applicants must have the unrestricted right to work in the United States. Veeva will not provide sponsorship at this time
Benefits
- Medical, dental, vision, and basic life insurance
- Flexible PTO and company paid holidays
- Retirement programs
- 1% charitable giving program
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer
My Personal RecruiterLeverage Today Best Technology And Methods To Help You Land Your Dream Job.
• Tightly partner with Development groups to build and deploy better services • Develop Operational targets and telemetry with Development clients • Promote strong quality control practices through continuous integration and continuous deployment implementations using tools such as Sonar • Responsible for application integration activities for monitoring tools using tools such as DataDog, SumoLogic and the alert integration with collaboration tools such as MS Teams • Analyze software applications to identify vulnerabilities using tools such as JFrog Xray, Veracode, Sonar. • Design and maintain infrastructure automation tools • Production experience with Docker/Kubernetes at scale • Design and evolve deployment systems and pipelines for reliability, security, and efficiency • Gain deep understanding of supported services • Implement system and service telemetry to improve reliability and availability • Develop deep insight into application and service performance • Execute on security best practices for cloud deployments • Ensure system security through industry best practices
Senior DevOps Engineer
Bikeleasing GruppeUmweltbewusst, engagiert und serviceorientiert: Unser Erfolg ist das Ergebnis starker Teamarbeit: An unseren Unternehmensstandorten im südniedersächsischen Uslar, im hessischen Vellmar und in Innsbruck (Österreich) sind zurzeit rund 430 Dienstrad-Begeisterte beschäftigt. Wir bringen Arbeitnehmer, Selbstständige und Freiberufler aufs Dienstrad. Aktuell nutzen mehr als 80.000 Unternehmen und Organisationen des öffentlichen Dienstes mit über 4 Millionen Angestellten die Leistungen der Bikeleasing-Service GmbH & Co. KG. Wir wachsen weiter und fördern zukunftsfähige, ökologische und sozial gerechte Mobilität, indem wir uns dafür einsetzen, dass in Zukunft noch mehr Menschen, die Vorteile des leasingfinanzierten Dienstrads nutzen können.
Role Description Die Bikeleasing Gruppe gehört zu den führenden Employer-Benefit- und Mobilitätsanbietern in Deutschland. Aktuell befinden wir uns mitten in einer umfangreichen technischen Transformation. Für diesen Weg suchen wir Dich als Senior DevOps Engineer (gn) – Lead Role. - Gestalte unsere Cloud-Infrastruktur end-to-end. - Etabliere ein nachhaltiges Operating Model für eine Multi-Stakeholder-DevOps-Funktion. - Baue Self-Service-Fähigkeiten auf, die unsere Produktteams nachhaltig befähigen. - Übernehme die vollständige Verantwortung für unsere Cloud-Infrastruktur (AWS, Azure, Terraform). - Sorge für zuverlässige Deployments durch Ownership über unsere CI/CD-Landschaft. - Entwickle Self-Service-Fähigkeiten gemeinsam mit unserem Enabler Team. - Definiere klare SLIs und SLOs über kritische Services hinweg. - Verantworte die technische Security- und Compliance-Postur. - Führe das DevOps-Team fachlich und organisatorisch. Qualifications - Mindestens 5 Jahre Erfahrung in DevOps, SRE oder Platform Engineering. - Mindestens 2 Jahre in einer Senior- oder Lead-Rolle. - Sehr starke Hands-on-Expertise mit Terraform, Kubernetes, GitOps und Observability. - Tiefe Kenntnisse in AWS, idealerweise ergänzt durch Erfahrungen in Azure oder GCP. - Sichere Kenntnisse in Infrastruktur-Security. - Fähigkeit, Infrastrukturthemen in Business Impact zu übersetzen. Requirements - Führungskompetenz in einem kleinen Team. - Konstruktive Zusammenarbeit mit internen und externen Stakeholdern. - Echtes Builder-Mindset. Benefits - Maximale Flexibilität durch modernes Gleitzeitmodell und Workation Policy. - Monatlicher 50 €-Gutschein über Probonio und 60 € jährlich zu Deinem Geburtstag. - Starke Versicherungsbedingungen für betriebliche Altersvorsorge. - Möglichkeit, bis zu zwei Fahrräder oder Pedelecs über uns zu leasen. - Moderne Tech-Umgebung (Kotlin, TypeScript, PHP, Spring Boot, Vue.js, React, NestJS, Symfony; AWS & Terraform). - Hochwertige technische Ausstattung inkl. freier Wahl des Betriebssystems. - Weiterbildung über Udemy inkl. Arbeitszeit und regelmäßigen Tech-Konferenzen. - Interne Tech-Talks für Wissensaustausch und Weiterentwicklung. - Voll remote innerhalb Deutschlands möglich.
Lead DevOps/SRE (GCP/GKE, Ukraine)
Capgemini EngineeringCapgemini Engineering, the leader in engineering and R&D services, helps clients unleash their R&D potential.
At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and engineering services across all industries. Join us for a career full of opportunities. Where you can make a difference. Where no two days are the same. Overview We are looking for a highly skilled Senior DevOps Engineer with 6+ years of experience to design, implement, and maintain scalable infrastructure and CI/CD pipelines. The ideal candidate should have strong expertise in Kubernetes, Linux systems, and Python scripting, along with a deep understanding of cloud platforms (GCP/GDCE) and automation. Key Responsibilities Design, build, and maintain scalable and highly available infrastructure using Kubernetes in a 24x7 environment. Deploy, manage, and monitor containerized applications. Develop and maintain CI/CD pipelines for automated build, test, and deployment processes. Automate infrastructure provisioning using Infrastructure as Code (IaC) tools. Manage and optimize Linux-based systems and servers. Write efficient scripts and tools using Python for automation and monitoring. Implement logging, monitoring, and alerting solutions. Ensure system security, compliance, and reliability. Troubleshoot production issues and ensure minimal downtime. Collaborate with development and QA teams to streamline delivery processes. Required Skills and Qualifications 6+ years of experience in DevOps / SRE / Infrastructure Engineering. Manage and optimize GKE clusters and other container orchestration platforms. Strong hands-on experience with Kubernetes (deployment, scaling, troubleshooting). Solid understanding of Linux/Unix systems administration. Proficiency in Python scripting for automation. Experience with CI/CD tools (e.g., Jenkins, GitLab CI, GitHub Actions). Knowledge of containerization tools like Docker. Familiarity with Infrastructure as Code tools (Terraform, CloudFormation). Understanding of networking, security, and system architecture. Experience with monitoring tools (Prometheus, Grafana, ELK stack). Preferred Qualifications Experience with microservices architecture. Knowledge of Helm charts and Kubernetes operators. Certification in Kubernetes (CKA/CKAD) or cloud platforms (GCP preferred). Experience with configuration management tools (Ansible, Chef, Puppet). Soft Skills Strong problem-solving and analytical skills. Good communication and collaboration abilities. Ability to work in a fast-paced, agile environment. What you will love about working here? We care about all our employees and want them to feel as comfortable as possible. That's why we offer them health insurance from the first days, regardless of the probationary period. The gift from the company - Christmas holidays from 25 December to 31 December. Сooperation with Superhumans center and Veteran HUB. Capgemini Engineering has supported the launch of psychological rehabilitation department of Superhumans. Our team also donnated over UAH 500 000 prosthetics for three Ukrainian defenders. Currently, we support psychological counseling provided by the Veteran Hub, and we have implemented a internal policy making the company friendly to military and veterans with the assistance of the Hub. Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem. #LI-Remote
Job Summary The University Research Computing Facility (URCF) at Drexel University is building a new shared computing platform focused on GPU-accelerated workloads, particularly AI model training. The system includes GPU and CPU compute nodes with Nvidia H200, A100, and Grace Hopper hardware, orchestrated by Kubernetes on bare-metal, as well as a 1 PB high-performance Weka storage cluster and a 3 PB S3-compatible archival storage system with iRODS as the metadata layer. The DevOps Engineer will help build and operate this platform, working alongside the URCF’s Research Computing Specialist and collaborators in Drexel IT. The platform is under active development, and URCF is itself in the process of adopting container-native tools and workflows coming from a more traditional HPC background. This means the role involves building new things, improving what exists, and navigating some institutional learning curves alongside us. We currently use the following Technologies: - Ansible - Warewulf - Proxmox - Kubernetes (RKE2) - Cilium - Kyverno - Envoy - Kubeflow - Weka - iRODS - STORJ - Globus - Rocky Linux - Python and - Bash. PLEASE NOTE: You don’t need experience with all of these. We include the list so you can get a sense of the environment This is a grant-funded position through September 1, 2027. It is fully remote. If you’re not sure whether you’re qualified, we’d encourage you to apply anyway. This position is grant-funded; employment is contingent upon the continued availability of those funds. Essential Functions - Develop and maintain automation for provisioning, configuring, and managing the cluster (Ansible, Warewulf, Kubernetes manifests, shell scripts). - Contribute to the Kubernetes platform layer, including networking, storage integration, security policies, and workload orchestration. - Help built out storage infrastructure, including iRODS and Globus/Globus Connect Server for data transfer, as well as the integrations between these systems and the compute cluster. - Troubleshoot issues across the stack, from bare-metal boot problems to container orchestration bugs. - Write and maintain operational and user-facing documentation. - Coordinate with Drexel’s IT teams on shared infrastructure concerns (networking, DNS, firewall rules, etc.). - Contribute to web application development for a user-facing portal for project management, permissions, and usage tracking. Required Qualifications - Minimum of a Bachelor's Degree in Computer Science, Engineering, or a related field or the equivalent combination of education and work experience (Please review the Equivalency Chart for additional information). - Minimum of 1–3 years of experience. - Experience with infrastructure tooling such as Linux systems administration, configuration management, containers, or container orchestration. - Comfortable working in a terminal with tools like Git, SSH, and a text editor. - Working proficiency with at least one scripting language (Python, Bash, etc.). - Strong written communication skills. - Ability to work independently and manage your own time in a fully remote setting. Preferred Qualifications - Experience with Kubernetes. - Experience with bare-metal provisioning or HPC cluster management. - Familiarity with any of: Ansible, Warewulf, RKE2, Cilium, Kubeflow, Weka, iRODS, Globus, infrastructure-as-code tools generally. - Web application development experience (any stack). - Experience in an academic or research computing environment. Physical Demands - Typically sitting at a desk/table - Lifting demands ≤ 25lbs Location - Remote Additional Information This position is classified as Exempt, grade N. Compensation for this grade ranges from $90,430.00 - $135,64000 per year. Please note that the offered rate for this position typically aligns with the minimum to midrange of this grade, but it can vary based on the successful candidate’s qualifications and experience, department budget, and an internal equity review. Applicants are encouraged to explore the Professional Staff salary structure and Compensation Guidelines & Policies for more details on Drexel’s compensation framework. For information about benefits, please review Drexel’s Benefits Brochure. Special Instructions to the Applicant Please make sure you upload your CV/resume and cover letter when submitting your application. A review of applicants will begin once a suitable candidate pool is identified. #LI-Remote Job duties: • Develop and maintain automation for provisioning, configuring, and managing the cluster (Ansible, Warewulf, Kubernetes manifests, shell scripts). • Contribute to the Kubernetes platform layer, including networking, storage integration, security policies, and workload orchestration. • Help built out storage infrastructure, including iRODS and Globus/Globus Connect Server for data transfer, as well as the integrations between these systems and the compute cluster. • Troubleshoot issues across the stack, from bare-metal boot problems to container orchestration bugs. • Write and maintain operational and user-facing documentation. • Coordinate with Drexel’s IT teams on shared infrastructure concerns (networking, DNS, firewall rules, etc.). • Potentially contribute to web application development for a user-facing portal for project management, permissions, and usage tracking. (This isn’t the core of the role, but if you have web development experience and are interested, there’s real work to be done here.)Essential -->


