Boomi is the platform for intelligent connectivity and automation. Connect everyone to everything, anywhere.
Senior Site Reliability Engineer
Location
Ireland + 1 moreAll locations: Ireland | United Kingdom
Posted
51 days ago
Salary
0
Seniority
Senior
No structured requirement data.
Job Description
Senior Site Reliability Engineer
Boomi
About Boomi and What Makes Us Special Are you ready to work at a fast-growing company where you can make a difference? Boomi aims to make the world a better place by connecting everyone to everything, anywhere. Our award-winning, intelligent integration and automation platform helps organizations power the future of business. At Boomi, you’ll work with world-class people and industry-leading technology. We hire trailblazers with an entrepreneurial spirit who can solve challenging problems, make a real impact, and want to be part of building something big. If this sounds like a good fit for you, check out boomi.com or visit our Boomi Careers page to learn more. The Boomi Managed Cloud Services Team is looking for a Cloud Operations Engineer with a passion for delivering customer excellence. The Cloud Operations Engineer is responsible for providing a world class support experience, managing customer expectations, and resolving challenging issues for customers of the Boomi Managed Cloud Service, based in the United Kingdom. This role is key to delivering customer excellence and a world-class support experience for our Managed Cloud Service customers. The engineer will be responsible for managing customer expectations and resolving complex technical issues. The Role (What you need): - We're looking for a Cloud Operations Engineer for the Boomi Managed Cloud Services Team. - The job is all about giving awesome support and fixing tough issues for customers using the standard Site Reliability tools on the Boomi Managed Cloud Service. - You need to be based in the United Kingdom and have all the relevant documentation to legally live and work in the UK. What makes a successful candidate: - You're big on Site Reliability stuff. - You genuinely love working with customers and internal teams. - You're a detective when it comes to figuring out the root cause of issues (installation, config, performance, both infrastructure and app layers). - You're super curious and can learn fast. - You're a team player—ready to teach and learn from others. - You're into using AI methods to solve problems and build tools. - You're comfortable and confident operating in a technical IT environment while also managing direct customer-facing responsibilities, simultaneously. What you'll be doing: - Building Boomi Clouds using Ansible (predefined configurations). - Giving remote tech support for the Boomi Managed Cloud Service. - Dealing directly with customer issues related to Networking, Infrastructure, and Boomi application errors. - Trying to recreate customer problems to figure them out. - Using diagnostic skills to find issues and recommend fixes. - Building cool tools using Claude Code and AWS infrastructure. - Documenting problems and solutions in the support database. Must-Haves (Technical Requirements): - Experience with monitoring production systems, performance tuning, and advanced troubleshooting. - Experience supporting production Java runtimes (JVMs) on Cloud platforms (AWS, Azure). - Familiar with Ansible, Python, Harness, and Jenkins. - Intermediate to advanced with Linux (RHEL preferred)—you're comfortable at the command line! - Solid understanding of computer architecture, cloud tech, virtual computing, and networking basics (TCP/IP, SSH, NFS or NetApp). - Several years of experience in DevOps or Technical Cloud Support. Nice-to-Haves (Desirable): - Experience with Observability tools like New Relic, Datadog, or Splunk. - Know-how in troubleshooting Kubernetes or similar containerized services (like AWS EKS, Azure AKS). - A Bachelor’s degree in a relevant technical field. - Certification and proficiency in Boomi Runtime Architecture and Systems Administration. #LI-TS1 Be Bold. Be You. Be Boomi. We take pride in our culture and core values and are committed to being a place where everyone can be their true, authentic self. Our team members are our most valuable resources, and we look for and encourage diversity in backgrounds, thoughts, life experiences, knowledge, and capabilities. All employment decisions are based on business needs, job requirements, and individual qualifications. Boomi strives to create an inclusive and accessible environment for candidates and employees. If you need accommodation during the application or interview process, please submit a request to talent@boomi.com. This inbox is strictly for accommodations, please do not send resumes or general inquiries.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineer II ( Remote )
LivePersonLivePerson is an online engagement solutions company, which means that it works with clients to provide their customers with real, live assistance and advice. The company was found
LivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are powered by our Conversational Cloud each month. You'll be successful at LivePerson if you are excited to build something from the ground up. You excel by finding daily opportunities to grow at the same pace as the technology we're building, and you build partnerships that improve our business. Likewise, you're someone who sees feedback as a chance to learn and grow and believe decisions powered by data are the norm. You care about the well-being of others and yourself. Job Description: Site Reliability Engineer (Platform Engineer) Mid Level (L2) Location: India (Remote) Overview: We are seeking a Mid-Level Site Reliability Engineer (SRE) to join our global Platform Engineering team. As an SRE, your primary responsibility is to ensure that our platform is reliable, scalable, and performant. You’ll be the bridge between development and operations — designing automation, improving observability, and maintaining the health of our production systems. You should have what it takes to ask the right questions, identify potential risks early, and raise flags when necessary to maintain a culture of reliability and continuous improvement. You will: - Collaborate closely with Developers, QA, and Product teams during sprint planning to understand release plans, dependencies, and infrastructure requirements. - Participate in the application release cycle, ensuring deployments are automated, consistent, and reliable. - Manage and operate Kubernetes clusters in Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS). - Develop and manage Terraform modules for provisioning and configuring cloud infrastructure across GCP and AWS. - Standardize service deployments using Helm for templating and versioned releases. - Build and enhance observability with Prometheus, Grafana, and Datadog to monitor application and platform performance. - Design, implement, and maintain GitLab CI/CD pipelines for build, test, and deployment automation. - Drive an automation-first culture by developing scripts and tooling in Python, Go, or Shell to minimize manual effort and improve efficiency. - Participate in a 24/7 on-call rotation, ensuring quick detection, mitigation, and resolution of incidents. - Perform root cause analysis (RCA) and contribute to post-incident reviews to prevent recurrence. - Proactively identify reliability or scalability gaps, raise early warnings, and partner with teams to address systemic risks. You have: - 5-8 years of experience as a Site Reliability Engineer, Platform Engineer, or DevOps Engineer. - Hands-on experience managing Kubernetes clusters (GKE, EKS) in GCP and AWS. - Strong knowledge of Terraform, Helm, and GitLab CI/CD pipelines. - Proficiency in Python, Go, or Shell scripting for automation and tooling. - Experience implementing and managing observability stacks (Prometheus, Grafana, Datadog). - Deep understanding of Linux systems, cloud networking, and container orchestration concepts. - Experience working in Agile/Scrum environments and partnering closely with developers. - Excellent analytical skills with a proactive attitude — able to question assumptions and escalate potential risks early. Good to Have - Experience with ArgoCD or Flux (GitOps-based workflows). - Familiarity with service mesh (Istio, Linkerd) or API gateways. - Knowledge of cloud cost optimization, autoscaling, or security best practices. - Experience with incident management tools such as PagerDuty, ServiceNOW Why Join Us - Build and operate modern cloud-native platforms using Kubernetes, Terraform, GitLab, Datadog, and Grafana. - Be part of a global SRE team that values automation, reliability, and innovation. - Work in a collaborative culture that encourages ownership, learning, and continuous improvement. - Enjoy flexible working arrangements, competitive compensation, and career growth opportunities including certifications and mentorship. Why you’ll love working here: As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace. - Benefits: 15 Days PTO + Casual & Sick Leave - Insurance: 8 Lakhs Family Floater Coverage; Personal Accident & Life Insurance: 3x of Gross Annual Salary* The talent acquisition team at LivePerson has recently been notified of a phishing scam targeting candidates applying for our open roles. Scammers have been posing as hiring managers and recruiters in an effort to access candidates' personal and financial information. This phishing scam is not isolated to only LivePerson and has been documented in news articles and media outlets.Please note that any communication from our hiring teams at LivePerson regarding a job opportunity will only be made by a LivePerson employee with an @liveperson.com email address. LivePerson does not ask for personal or financial information as part of our interview process, including but not limited to your social security number, online account passwords, credit card numbers, passport information and other related banking information. If you have any questions and or concerns, please feel free to contact recruiting-lp@liveperson.com
Senior DevOps Engineer
accessoOur team is on a mission to improve the guest experience with technology. We support some of the world's top attractions and leisure & entertainment venues by creating innovative technology solutions that enhance the guest journey from start to finish. Currently, accesso® employs over 500 team members around the globe, many of whom come from the industries we serve. From ticketing and eCommerce to virtual queuing and more, we understand firsthand what makes our clients and their guests smile, and we’re constantly developing new solutions to enhance the guest experience while helping our clients streamline operations and drive revenue.
Position Overview: Do you love building ⚙️ products people love? We are looking for a smart and motivated Senior Devops Engineer to join our accesso Freedom team. You will assist the development team to fully automate, deploy and test new infrastructure and applications to ensure operational reliability. Input and implement deployment strategies with principle and architects. Collaborate with team members and own implementations. You will join a growing team supporting a new cloud native SaaS product collaborating with development and operational teams such as SRE’s, DevSecOps, and support to improve the platform and deliver requirements consistently & incrementally. As a Senior member of the team, you will be expected to lead technical decisions, and help set standards and best practices across the infrastructure discipline. Location: This role can be performed 100% remotely anywhere in the UK, at our office in Twyford, or a hybrid version of in-office and remote. Reports to: IT Manager, Engineering Travel ✈️ Requirement: None What you’ll be working on: - Implement and manage cloud resources to support SaaS application deployments which are designed to be scalable, secure, performant, manageable and cost effective - Ensure that all aspects of the implementation are scripted and repeatably deployable through Infrastructure as Code practices - Share the responsibility for maintaining clean, efficient CI/CD pipelines with Developers and QA - Use an established release process to release to Staging and Production environments, Bi-Monthly, outside of normal working hours - Share the responsibility for building observability into the platform with Developers, QA and Site Reliability Engineers - Identification and Resolution of platform and pipeline issues/problems as they occur as well as proactively looking to improve the reliability and performance - Participate in retrospectives after events occur ensuring plans are put in place to prevent repeat occurrences - Work with security team to identify and resolve infrastructure vulnerabilities and deployment issues - Ensure data storage and services are deployed in accordance with company security policies and cloud best practices - Enhance and participate in DR dry runs - Share the responsibility for building observability into the platform with Developers, QA and Site Reliability Engineers using the Grafana stack - Work with other members of the teams to exchange and build knowledge - Work in an agile, rapid development environment where effective communication is paramount - Collaborate to build and maintain an effective backlog to plan and track tasks according to prioritization - Vulnerability scanning and remediation across container images, clusters and IaC - Regular upgrades of components and touchpoints - Chaos engineering practices - Participate in the change management process - Focus on lean & improving DORA metrics - Evaluate, integrate, and operationalize AI-powered tooling (e.g., AI coding assistants, agentic AI workflows) to accelerate development and deployment processes - Champion the responsible adoption of AI-assisted automation across the engineering organization - Approach all work with Automation first approach, to reduce TOIL across all tasks and processes - Engage with architects, product owners, and business stakeholders to translate requirements into infrastructure solutions What you bring to the role: - 5+ years of current experience at both design and hands-on levels with Azure cloud infrastructure - Note: ‘years of experience’ may not always be the best measure for your ability to succeed in this role. If the below bullet points feel like you, please consider applying. - Professional experience with RBAC, key vault, VMs, VNETs, VPNs, NSGs, Git, Ubuntu WSL2, PowerShell, Azure CLI and Data platforms (SQL, ADF) - Kubernetes deployment and management experience, with associated toolsets such as Kubernetes CLI, Helm, terraform, packer and ansible. - Network management within Kubernetes with NGINX, service mesh, policy management, WAF - Infrastructure monitoring with the Grafana stack (Loki, Prometheus, Tempo). - Scripting and automation with Bash and/or Python in addition to PowerShell - Experience in load/ performance testing and tuning - Vulnerability management, such as Trivy or Kubescape - Azure Container registry management - Experience using AI-powered development tools (e.g., GitHub Copilot, Claude Code, Cursor) to improve engineering productivity - Experience with High Availability, DR/Business Continuity strategies - Must possess a positive attitude at work and make contributions to a healthy company culture - Strong problem-solving skills and logical troubleshooting approach - Required to join on call rotation (1 week in 4) ⭐️ Bonus points if you have: - Microsoft Azure Administrator / Architect or other relevant certification - Kubernetes Certified Administrator - Familiarity with agentic AI patterns - orchestrating autonomous AI agents for infrastructure management, incident response, or automated remediation - Secret management beyond Azure Key Vault - Understanding of POS, kiosk hardware, peripherals including OPOS drivers *If you don’t have all the qualifications listed, don’t worry! We understand everyone’s career path is unique and still encourage you to apply if you feel this role is aligned with your career trajectory. Perks & Benefits: - Competitive compensation package including an annual bonus opportunity, because your hard work deserves recognition; - 8-days of paid bank holiday leave and 26-days of paid annual leave (paid leave increases with tenure) – so you can go “OOO” and take that vacation you’ve been dreaming of 😎; - 8 hours of paid Volunteer Time Off to contribute to causes close to your heart. Making a difference, made easy. - Inclusive Family Benefits, including a $7,500 benefit, or currency equivalent, for surrogacy, adoption, and fertility. Because family planning should come with support 🫶; - Robust health insurance scheme with the opportunity to participate in private medical scheme after satisfactory performance; - Matching pension scheme (up to 8%) for a secure financial future; - Gain unlimited access to LinkedIn Learning to support ongoing learning and career development; - Enjoy a flexible work schedule that aligns with your team’s schedule⏰. LIFE at accesso: At accesso, we believe that fun is a fundamental part of the workday! From our tech to our passion for attractions, we infuse fun into everything we do, and our culture is no different. We’ve created a virtual environment with no shortage of connection – so share memes and high fives 🙌 with teammates, or break up your day with virtual escape quests, “Online Office Olympics” and more! Work-life balance is important here too, so you’ll have flexibility in choosing the work setting and hours that fit your life best (so long as your work permits). We believe that diversity is vital to innovation and that when we celebrate what makes each of us unique, we create a more inclusive environment where you can truly thrive🌱. Our people are our most treasured asset, and we are proud to have such talented, passionate and tech-savvy professionals on our team💚. We are dedicated to providing equal opportunities for all, and any hiring decisions will be assessed on qualifications, merit and business need. If there are any accommodations you may need throughout the hiring process, please feel free to email us at careers@accesso.com so that we can set you up for success. Learn more about Diversity & Inclusion at accesso. You can review our candidate privacy statement here: Candidate Privacy Statement ABOUT accesso: Our team is on a mission to improve the guest experience with technology. We support some of the world's top attractions and leisure & entertainment venues 🏟🎡🎢🚢🎻 by creating innovative technology solutions that enhance the guest journey from start to finish. Currently, accesso® employs over 500 team members around the globe 🌎, many of whom come from the industries we serve. From ticketing and eCommerce to virtual queuing and more, we understand firsthand what makes our clients and their guests smile, and we’re constantly developing new solutions to enhance the guest experience while helping our clients streamline operations and drive revenue.
Senior Cloud & DevOps Engineer
Blend360Optimizing business performance through people, data, tech & analytics
Company Description Blend is a premier AI services provider, committed to creating meaningful impact for its clients through the power of data science, AI, technology, and people. We help organisations solve complex business challenges by combining deep domain understanding with modern data and AI capabilities. Our teams work across strategy, analytics, engineering, and product delivery to create scalable, high-value solutions that improve decision-making, efficiency, and growth. Job Description We are looking for an experienced Senior Cloud & DevOps Engineer to support the build and production readiness of a foundational Azure data platform for a large telecommunications client. This role will focus on provisioning and operating the core Azure infrastructure, including Azure Data Factory, Data Lake Storage, data warehousing solutions and establishing the CI/CD pipelines, environment management, monitoring, and operational controls needed to take the platform through Dev, Test, and Production. The ideal candidate will have strong expertise in Azure-native architecture, infrastructure-as-code (Terraform), release engineering, observability, and secure platform operations in regulated environments. This person will work closely with Data Engineers, BI Consultants, and Governance leads to ensure the platform is deployable, scalable, secure, and aligned with enterprise and PIPEDA compliance standards. Responsibilities - Design and implement Azure cloud infrastructure and deployment patterns for the data platform, including Entra ID design, subscription hierarchy, naming conventions, and tagging standards. - Build and maintain CI/CD pipelines to support repeatable, controlled releases across Development, Test, and Production environments. - Provision and configure Azure infrastructure as code (Terraform), including Data Factory, Data Lake, ExpressRoute/VPN, network topology, and firewall rules to connect on-premises source systems. - Configure Azure DevOps and Databricks or Snowflake Git integration to enforce version-controlled deployments. - Support deployment of backend services, orchestration components, data services, and front-end applications. - Enable monitoring, logging, alerting, and telemetry for both platform health and end-user usage feedback loops. - Define and implement operational controls for reliability, performance, scalability, and incident response. - Implement and enforce secure access patterns using Entra ID, Azure Key Vault for secrets management, and RBAC, including column-level and row-level security controls required for PIPEDA compliance. - Ensure the solution aligns with architecture, security, and service transition requirements. - Support non-functional testing, release readiness, and path-to-production activities. - Produce comprehensive operational runbooks, platform documentation, and a full IaC handover package enabling the client’s internal IT team to take ownership of platform operations at programme close. - Support cost management, network performance tuning, and security hardening of the Azure platform; contribute to cost optimisation reporting and assist with backup and disaster recovery planning. Qualifications - Strong hands-on experience with CI/CD tooling and release automation. - Experience with infrastructure-as-code using Terraform or similar tools. - Hands-on experience deploying and operating cloud-native workloads in Microsoft Azure, including Data Factory, Databricks, Snowflake, Data Lake Storage, and Entra ID. - Strong understanding of containerisation, serverless and managed compute services, and environment promotion strategies. - Experience with observability tooling covering logging, monitoring, alerting, and service health. - Knowledge of security best practices including IAM, RBAC, secrets management, and policy-driven access control. - Experience supporting production-grade data platforms in enterprise environments, ideally in regulated sectors with compliance requirements such as PIPEDA or equivalent. - Familiarity with Git-based workflows and collaborative engineering practices. - Strong troubleshooting, communication, and stakeholder management skills. Nice to Have - Experience with specific Azure services including Azure Data Factory (including Self-Hosted Integration Runtime), Azure Databricks (Unity Catalog, Repos, Medallion architecture), Snowflake, Azure Data Lake Storage Gen2, Azure Key Vault, and Azure Monitor. - Familiarity with Azure DevOps pipelines and Power BI deployment pipelines for dev/test/prod environment promotion of both infrastructure and BI assets. - Experience with pipeline observability and data quality monitoring in Medallion architectures, including alerting on ingestion failures and SLA-driven orchestration schedules. - Understanding of Canadian data privacy requirements (PIPEDA) and how they translate into platform controls such as column-level security, PII tagging, RBAC design, and audit logging in Azure and data warehouse environments. - Experience supporting service transition into managed support models. - Exposure to QA automation and non-functional testing in cloud-native systems. What about languages? Advanced English proficiency required. How much experience must I have? 5+ years of experience in Cloud Engineering, DevOps, or Platform Engineering roles. Additional Information Our benefits: Learning Opportunities: - Certifications in AWS (we are AWS Partners), Databricks, and Snowflake. - Access to AI learning paths to stay up to date with the latest technologies. - Study plans, courses, and additional certifications tailored to your role. - Access to Udemy Business, offering thousands of courses to boost your technical and soft skills. - English lessons to support your professional communication. 👩🏫 Mentoring and Development: - Career development plans and mentorship programs to help shape your path. 🎁 Celebrations & Support: - Special day rewards to celebrate birthdays, work anniversaries, and other personal milestones. - Company-provided equipment. ⚖️ Flexible working options to help you strike the right balance. Other benefits may vary according to your location in LATAM. For detailed information regarding the benefits applicable to your specific location, please consult with one of our recruiters. So what are the next steps? Our team is eager to learn about you! Send us your resume or LinkedIn profile below and we’ll explore working together!
• Define and drive the DevOps Vision and using Agile best practices • Set direction, standards, and best practices for the team • Lead the design of scalable, secure, and reliable infrastructure and delivery pipelines • Establish and maintain CI/CD pipelines for multiple applications and services • Align DevOps initiatives with engineering, product, and business goals • Ensure high-quality engineering is demonstrated across the team • Design, deploy and maintain cloud infrastructure (Azure) • Mentor engineers and promote knowledge sharing • Facilitate clear communication between the different departments • Advocate DevOps culture across the teams looking to shift-left wherever possible




