Spend is the fuel to help your company deliver performance, profitability, and purpose!
Senior DevOps Engineer
Location
Mexico
Posted
74 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps Engineer
Coupa Software
• Administration of Linux machines, Web servers, Application servers, Databases Application and cloud infrastructure support for customer environments. • Provide application support on Java and Ruby applications. • Own end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence. • Tools development and automation to increase availability and performance. • Ensuring the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective. • Collaborate with Product and Release engineering for new product releases and maintenance. • Coordinate incident, problem and change management. • Participate in on-call rotation for after-hours and weekend emergencies
Job Requirements
- Bachelor's Degree with 8+ years of professional experience handling large scale production systems.
- Experience with AWS or comparable cloud providers with certification.
- Experience in designing of new services on AWS or comparable cloud provider, migration of services to cloud and deployment of new services on AWS or comparable cloud provider.
- Hands on experience with Terraform and configuration management tools like Chef, Ansible or equivalent.
- Experience in application support/development on Java or Ruby.
- Hands on scripting experience with anyone of these: Python or Bash.
- Excellent knowledge of large scale web applications/distributed systems.
- Experience in Kubernetes, Docker, and/or cloud deployment technologies.
- Experience in observability tools like NewRelic, Datadog etc
- Expertise in problem solving and analyzing global scale distributed systems.
- Excellent written and verbal communication skills.
- Critical thinking, continuously challenging how and why we do things to help us improve
Benefits
- Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.
- Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.
- Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer
Input Output (IOHK)IOG is one of the world's pre-eminent blockchain infrastructure research and engineering companies.
• Apply best practices for monitoring, observability, security, and infrastructure automation. • Ensure availability of services targeting 99.99% uptime for critical blockchain infrastructure. • Develop and maintain tools for Cardano and Midnight infrastructure in a multi-cloud or bare metal hybrid environment. • Actively troubleshoot issues during testing and deployment, addressing protocol-level bottlenecks. • Build and maintain CI/CD pipelines for Midnight projects. • Participate in on-call rota for production system interruptions. • Manage transition from federated validator to decentralized validator network.
Sr. Site Reliability Engineer - 11293
Coupa SoftwareSpend is the fuel to help your company deliver performance, profitability, and purpose!
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins. Why join Coupa? 🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend. 🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence. 🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other. Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa. The Impact of a Sr. Site Reliability Engineer at Coupa: As a Senior Site Reliability Engineer, you will play a crucial role in the development of solutions for our Contract platform. Coupa Contract (Standard) enables customers to author, approve, and operationalize contracts, making them easily available for purchasing by employees across the organization. Contract compliance delivers savings as employees make purchases using negotiated rates and helps to mitigate risk by ensuring that appropriate terms are in place. Contract enforcement and spend visibility are provided through embedded dashboards at both the contract and summary level. Coupa Contract Advanced is an enterprise-class contract management solution to help companies improve contract visibility, risk management, and operational efficiency at scale. Contract Advanced is designed to handle the creation, storage, and optimization of any contract across any industry or department. At a business level, together with the product management and development team you will change the way our customers deal with Contracts life cycle management ecosystem and build best in class hosting infrastructure on cloud. At a technical level we will jointly drive scaling our Business Spend Management platform on public cloud by following Site reliability engineering (SRE) best practices. What You'll Do: • Administration of Linux machines, Web servers, Application servers, Databases Application and cloud infrastructure support for customer environments. • Provide application support on Java and Ruby applications. • Own end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence. • Tools development and automation to increase availability and performance • Ensuring the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective • Collaborate with Product and Release engineering for new product releases and maintenance. • Coordinate incident, problem and change management. • Participate in on-call rotation for after-hours and weekend emergencies What You Will Bring to Coupa: - Bachelor's Degree with 8+ years of professional experience handling large scale production systems. - Experience with AWS or comparable cloud providers with certification. - Experience in designing of new services on AWS or comparable cloud provider, migration of services to cloud and deployment of new services on AWS or comparable cloud provider. - Hands on experience with Terraform and configuration management tools like Chef, Ansible or equivalent. - Experience in application support/development on Java or Ruby. - Hands on scripting experience with anyone of these: Python or Bash. - Excellent knowledge of large scale web applications/distributed systems. - Experience in Kubernetes, Docker, and/or cloud deployment technologies. - Experience in observability tools like NewRelic, Datadog etc - Expertise in problem solving and analyzing global scale distributed systems. - Excellent written and verbal communication skills. - Critical thinking, continuously challenging how and why we do things to help us improve #LI-REMOTE #LI-AA2 Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees. Please be advised that inquiries or resumes from recruiters will not be accepted. By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.
• You will be the first person who owns this area entirely. • Your mission is to make MAIA's platform reliable, secure, auditable, and developer-friendly, at a stage where every decision you make has lasting impact. • You take full ownership of our infrastructure and establish the standards that will govern how it grows - Hetzner first, AWS, GCP, and Azure for selected services. • You own and evolve our CI/CD pipelines (GitHub Actions) and deployment workflows - improving rollout strategies, versioning, and rollback procedures from where they stand today. • You build out and mature our observability stack (Grafana, Loki, Sentry, PostHog) so that problems surface before customers notice them. • You implement and own our security fundamentals: IAM, secrets management, TLS, vulnerability scanning, and patch management. • You drive the technical controls required for our ISO 27001 certification and build the systems that produce auditable evidence continuously.
• Defining the reliability architecture for Akamai's AI compute and platform services, including SLO frameworks, fault tolerance patterns, and capacity planning models • Hands-on building of automation and tooling that reduces operational toil and scales the SRE team's impact • Designing observability strategy by leveraging Akamai's existing platform to build the telemetry, dashboards, alerts, and GPU-specific monitoring needed for AI workloads • Architecting deployment safety practices including progressive rollouts, canary analysis, rollback automation, and change safety processes • Influencing product engineering architecture and design decisions, embedding reliability into the development lifecycle at the system level • Mentoring and elevating other SREs through design reviews, code reviews, and hands-on problem-solving, setting the technical bar for the team



