Input Output (IOHK) logo
Input Output (IOHK)

IOG is one of the world's pre-eminent blockchain infrastructure research and engineering companies.

DevOps Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 201-500Since 2015H1B No SponsorCompany SiteLinkedIn

Location

India

Posted

74 days ago

Salary

0

Seniority

Senior

Job Description

DevOps Engineer

Input Output (IOHK)

• Apply best practices for monitoring, observability, security, and infrastructure automation. • Ensure availability of services targeting 99.99% uptime for critical blockchain infrastructure. • Develop and maintain tools for Cardano and Midnight infrastructure in a multi-cloud or bare metal hybrid environment. • Actively troubleshoot issues during testing and deployment, addressing protocol-level bottlenecks. • Build and maintain CI/CD pipelines for Midnight projects. • Participate in on-call rota for production system interruptions. • Manage transition from federated validator to decentralized validator network.

Job Requirements

  • 5+ years in a DevOps/SRE role with direct experience in blockchain infrastructure.
  • Strong experience with Linux-based systems and administration.
  • Working knowledge of container environments and Kubernetes orchestration.
  • Proficient with Grafana, Prometheus, or similar tools for real-time monitoring.
  • Expert in shell scripting, with knowledge of systems languages, Rust, Go and Python.
  • Extensive experience configuring, scaling, monitoring, and tuning services in AWS/GC.
  • Proven ability to write clear operational designs and blameless post-mortems.
  • Experience with continuous integration and continuous delivery tools, like Jenkins, GitHub Actions, etc.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Coupa Software logo

Sr. Site Reliability Engineer - 11293

Coupa Software

Spend is the fuel to help your company deliver performance, profitability, and purpose!

DevOps Engineer74 days ago
Full TimeRemoteTeam 1,001-5,000Since 2006H1B Sponsor

Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins. Why join Coupa? 🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend. 🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence. 🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other. Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa. The Impact of a Sr. Site Reliability Engineer at Coupa: As a Senior Site Reliability Engineer, you will play a crucial role in the development of solutions for our Contract platform. Coupa Contract (Standard) enables customers to author, approve, and operationalize contracts, making them easily available for purchasing by employees across the organization. Contract compliance delivers savings as employees make purchases using negotiated rates and helps to mitigate risk by ensuring that appropriate terms are in place. Contract enforcement and spend visibility are provided through embedded dashboards at both the contract and summary level. Coupa Contract Advanced is an enterprise-class contract management solution to help companies improve contract visibility, risk management, and operational efficiency at scale. Contract Advanced is designed to handle the creation, storage, and optimization of any contract across any industry or department. At a business level, together with the product management and development team you will change the way our customers deal with Contracts life cycle management ecosystem and build best in class hosting infrastructure on cloud. At a technical level we will jointly drive scaling our Business Spend Management platform on public cloud by following Site reliability engineering (SRE) best practices. What You'll Do: • Administration of Linux machines, Web servers, Application servers, Databases Application and cloud infrastructure support for customer environments. • Provide application support on Java and Ruby applications. • Own end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence. • Tools development and automation to increase availability and performance • Ensuring the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective • Collaborate with Product and Release engineering for new product releases and maintenance. • Coordinate incident, problem and change management. • Participate in on-call rotation for after-hours and weekend emergencies What You Will Bring to Coupa: - Bachelor's Degree with 8+ years of professional experience handling large scale production systems. - Experience with AWS or comparable cloud providers with certification. - Experience in designing of new services on AWS or comparable cloud provider, migration of services to cloud and deployment of new services on AWS or comparable cloud provider. - Hands on experience with Terraform and configuration management tools like Chef, Ansible or equivalent. - Experience in application support/development on Java or Ruby. - Hands on scripting experience with anyone of these: Python or Bash. - Excellent knowledge of large scale web applications/distributed systems. - Experience in Kubernetes, Docker, and/or cloud deployment technologies. - Experience in observability tools like NewRelic, Datadog etc - Expertise in problem solving and analyzing global scale distributed systems. - Excellent written and verbal communication skills. - Critical thinking, continuously challenging how and why we do things to help us improve #LI-REMOTE #LI-AA2 Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees. Please be advised that inquiries or resumes from recruiters will not be accepted. By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.

Mexico
MAIA logo

DevOps / Platform Engineering Lead

MAIA

Empowering the Mittelstand through AI-powered SaaS Solutions.

DevOps Engineer74 days ago
Full TimeRemoteTeam 1-10Since 2021H1B Sponsor

• You will be the first person who owns this area entirely. • Your mission is to make MAIA's platform reliable, secure, auditable, and developer-friendly, at a stage where every decision you make has lasting impact. • You take full ownership of our infrastructure and establish the standards that will govern how it grows - Hetzner first, AWS, GCP, and Azure for selected services. • You own and evolve our CI/CD pipelines (GitHub Actions) and deployment workflows - improving rollout strategies, versioning, and rollback procedures from where they stand today. • You build out and mature our observability stack (Grafana, Loki, Sentry, PostHog) so that problems surface before customers notice them. • You implement and own our security fundamentals: IAM, secrets management, TLS, vulnerability scanning, and patch management. • You drive the technical controls required for our ISO 27001 certification and build the systems that produce auditable evidence continuously.

Germany
€70K - €80K / year
Job Closed
Full TimeRemoteTeam 5,001-10,000H1B Sponsor

• Defining the reliability architecture for Akamai's AI compute and platform services, including SLO frameworks, fault tolerance patterns, and capacity planning models • Hands-on building of automation and tooling that reduces operational toil and scales the SRE team's impact • Designing observability strategy by leveraging Akamai's existing platform to build the telemetry, dashboards, alerts, and GPU-specific monitoring needed for AI workloads • Architecting deployment safety practices including progressive rollouts, canary analysis, rollback automation, and change safety processes • Influencing product engineering architecture and design decisions, embedding reliability into the development lifecycle at the system level • Mentoring and elevating other SREs through design reviews, code reviews, and hands-on problem-solving, setting the technical bar for the team

Poland
Full TimeRemoteTeam 5,001-10,000H1B Sponsor

• Lead the team responsible for reliability across Akamai's AI compute and platform services • Build the team, owning hiring strategy, candidate evaluation, and interview coordination for AI SRE roles • Partner with product engineering teams to embed reliability into the development lifecycle • Define and implement SRE practices for Akamai's AI compute and platform services • Ensure operational readiness for AI products by establishing quality gates, on-call rotations, runbooks, and escalation paths for AI infrastructure failure mode • Scale operations through software and automation, reducing toil and driving the team toward programmatic solutions over manual intervention • Own incident management integration for AI workloads, including post-incident analysis and driving systemic improvements that prevent recurrence

Poland
Job Closed