Every day, ANDRITZ continues to deliver successful innovative solutions to our customers globally. Why are we so successful? Because we are passionate and love what we do! We are at the forefront of future engineering technologies, with solutions that ensure the success of our clients in key industries that are shaping the future of the world we live in. ANDRITZ representa Paixão, Parceria, Perspectivas e Versatilidade, valores fundamentais com os quais a empresa está comprometida. Paixão: Você ama o que faz? Busca extrair o melhor de você? Parceria: Você está procurando uma posição onde possa ser acessível e autêntico com todos os nossos parceiros? Perspectivas: Você está disposto a encontrar novos caminhos, tecnologias e soluções promissoras para o futuro da ANDRITZ? Versatilidade: Você está disposto a enfrentar novos desafios e lidar com eles de forma flexível e criativa?
Reliability Specialist
Location
Northern America
Posted
3 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
Reliability Specialist
ANDRITZ AG
Role Description This position is responsible for executing maintenance assessments, performance studies, and action plans. Key responsibilities include: - Developing pre-engineering activities - Managing startups, shutdowns, and overhauls - Reliability engineering, installation, and online monitoring of field instrumentation and process performance - Implementing new maintenance and performance tools - Supporting instrumentation and automation requirements within maintenance contracts - Collaborating with engineers from the Pulp & Paper divisions and customer teams both inside and outside the Andritz Group This role requires excellent reporting and presentation skills. Qualifications - Minimum of 2 years of project management experience - CMMS system experience - Excellent communication skills required - Pulp and Paper experience preferred - Maintenance Experience preferred - Planning Experience preferred - Reliability Engineering Experience preferred Requirements - Physical condition for field inspection in Pulp and Paper Mills - Ability to express themselves with correctness, certainty, and objectivity, transmitting security and confidence - Ability to present a calm and balanced temperament, with self-control of their actions Travel Percentage - 10% travels in North America - *Please note, previous DUI convictions can result in being inadmissible to Canada Company Description Every day, ANDRITZ continues to deliver successful innovative solutions to our customers globally. Why are we so successful? Because we are passionate and love what we do! We are at the forefront of future engineering technologies, with solutions that ensure the success of our clients in key industries that are shaping the future of the world we live in.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Cloud DevOps Engineer
OneStream SoftwareA comprehensive cloud-based platform to modernize the Office of the CFO.
• Develop and maintain Infrastructure-as-Code such as Terraform, PowerShell, ARM, Bicep, Bash, and YAML languages • Deliver high-quality implementations in a timely manner • Design and maintain CI/CD pipelines supporting secure, reliable, and repeatable deployments • Update technical documentation, workflows, and knowledge base articles • Build knowledge in focused areas of the OneStream platform and deployment stack • Participate in collaborative engineering, peer reviews, and knowledge sharing initiatives • Collaborate with other teams to define, estimate, and implement requirements for new automations or services needed for development • Apply software engineering best practices to infrastructure and automation development • Optimize cloud environments for scalability, reliability, and cost efficiency • Participate in troubleshooting and resolution of complex production issues across cloud platforms and services • Work with Compliance and Security teams to ensure compliance with required controls
• Champion a security-first mindset within Engineering to help set the security posture of our platform infrastructure — supply chain hardening, secrets management, IAM/IRSA, container image integrity, and vulnerability remediation across our AWS/EKS environment • Design and build automation that makes compliance evidence continuous, not manual — translating HITRUST controls into passing tests and structured outputs that flow into our compliance tooling (Vanta) • Embed security into the platform by default: make the secure path the easy path for application engineers, through guardrails, policy-as-code, and well-documented patterns • Partner with our Security team to translate threat assessments and control gaps into engineering proposals with clear scope, tradeoffs, and recommended paths forward • Lead platform security initiatives from design to operationalization — requirements, technical design, code and code review, deployment, and documentation • Contribute hands-on to the broader platform: CI/CD pipelines, container orchestration, observability, and developer tooling — this is an IC role, not a governance role • Participate in on-call rotation and own the systems you build, including production incidents • Mentor engineers on security practices and raise the security baseline across the team
Senior Reliability Operations Engineer
Serve RoboticsMeet the future of sustainable, self-driving delivery.
• Serve as the primary incident lead during your region’s daytime hours, coordinating technical investigations, centralizing communication, and engaging the appropriate engineering and SRE teams when escalation is required. • Respond to escalations from Tier 1 support, using runbooks, metrics, logs, and system diagnostics to investigate and remediate issues or determine when escalation to Tier 3 is necessary. • Develop and update runbooks, workflows, and operational documentation to ensure consistent and reliable responses to recurring issues, collaborating with product teams to expand coverage over time. • Write, maintain, and enhance automation scripts and tools that streamline common remediation steps, improve response times, and reduce manual operational overhead. • Use metrics, logs, and tracing tools (Grafana/Prometheus, GCP Monitoring, OpenTelemetry) to proactively identify problems, validate system behavior, and support continuous improvement of detection mechanisms. • Act as the central point of communication during active incidents, ensuring timely updates and clear routing to the correct product engineering and SRE stakeholders. • Collaborate with reliability and product teams to share insights, recommend improvements, and help refine processes that enhance the stability and operability of our systems. • Participate in a shared weekend on-call rotation to help maintain operational coverage for production systems, responding to incidents and escalations as needed and coordinating with engineering teams when issues arise. • Help establish operational best practices, refine workflows, and prepare the foundation for a broader reliability operations function.
• Lead incident investigations during your region’s daytime hours, providing timely updates, escalating appropriately, and supporting senior engineers leading the response. • Respond to escalations from Tier 1 support using established runbooks, metrics, logs, and diagnostics to remediate issues or escalate to Tier 3 when needed. • Update runbooks and operational documentation based on new issues, discoveries, and feedback, ensuring clarity and consistency across all procedures. • Run existing automations and collaborate with senior team members to enhance tooling and scripts that streamline troubleshooting and remediation tasks • Use observability tools such as Grafana/Prometheus, GCP Monitoring, and OpenTelemetry to interpret metrics, logs, and traces, helping identify anomalies and validate system performance. • Provide concise, accurate updates during incidents, ensuring information reaches the correct engineering and SRE contacts and supporting structured incident coordination. • Participate in discussions around root causes, share operational insights, and contribute to process improvements that enhance system stability and supportability. • Participate in a shared weekend on-call rotation to help maintain operational coverage for production systems, responding to incidents and escalations as needed and coordinating with engineering teams when issues arise. • Proactively strengthen workflows, adopt best practices, and build the foundation of the Reliability Operations function as it evolves.


