Reliability Engineer

Location

United States

Posted

45 days ago

Salary

$115K - $150K / year

Seniority

Mid Level

Job Description

Reliability Engineer

Cordia LLC

Job Summary: This position is Remote. The Cordia team is seeking a Reliability Engineer to support maintenance, reliability, and optimization initiatives across its fleet. Reporting directly to the Operations Support Manager, this role focuses on identifying, explaining, and reducing equipment and system reliability losses through data analysis, engineering judgment, and follow-through. The Reliability Engineer helps develop the fleet reliability program and partners closely with plant teams to turn operating and maintenance data into actionable improvements that support safe, reliable service for customers across the region. Principle Responsibilities/Key Results Areas: · Help develop, improve, and execute a robust reliability program and contribute to a strategic fleet roadmap. · Support the establishment and continuous improvement of a standard work management and maintenance planning process, working with plant managers to provide tools and guidance to properly prioritize maintenance tasks in alignment with business objectives. · Utilize technical documentation, design drawings, operational manuals, and historical data to create clear, usable maintenance procedures. · Continuously enhance the functionality of the computerized maintenance management system (CMMS), Fiix, to monitor schedule adherence, optimize asset & spare parts management, and improve the quality of maintenance tasks being performed. · Conduct assessments of current assets to identify emerging equipment health risks, potential failure points, and opportunities for improvement. · Drive the implementation of the Fiix CMMS system for new plants or assets in coordination with plant and corporate stakeholders as part of the region’s growth initiatives · Propose and implement improved practices, methods, and procedures to increase safety, efficiency, and quality of maintenance work · Lead root cause analysis reviews for equipment downtime and support development and execution of corrective actions to reduce the risk of repeat failures · Implement reporting of key performance metrics for the reliability program and leverage data analysis to formulate focused, prioritized improvement plans · Provide support and training to technicians and plant managers on reliability centered maintenance practices, fostering a culture of continuous improvement and skill development · Ensure strict compliance with industry standards, regulatory requirements, and best engineering practices for asset management · Support the continuous improvement mindset for safety, reliability, maintenance, and project management initiatives Skills and Qualifications: · High School Diploma · A minimum of five (5) years of experience in an operations or maintenance environment · Experience with industrial or energy systems, rotating equipment, thermal systems, or process operations OR a strong demonstrated interest in applied reliability work · Experience spending time in both data and the field to understand plant/system/equipment behavior · Proven ability to prioritize work streams to determine the highest impact on overall company objectives · Track record of owning issues end-to-end, including follow-up after implementation · Willingness to lead and train others to increase the maintenance planning skillset throughout the organization · Familiarity with CMMS (Fiix) computer programs or the ability to quickly learn its functionalities · Comfortable working across multiple facilities and coordinating with diverse stakeholders · Excellent multitasking skills, capable of managing numerous projects and activities concurrently · Excellent written and verbal communication skills with the ability to convey technical concepts effectively · High proficiency in computer-based programs including the Microsoft Suite as well as project scheduling tools Preferred Qualifications: · Associate or bachelor's degree in a related field (engineering, construction management, etc.) · 8 (8) years of experience in an operations or maintenance environment · Experience in contributing to or helping build a comprehensive reliability program · Experience developing and improving plant and system processes, authoring clear and effective SOPs, and leading the implementation of process and system changes across multiple operating facilities. · Fiix maintenance management program and/or NetSuite experience · Aveva PI operational data platform experience · Exposure to data analysis, controls, or instrumentation · Knowledge of industry best reliability practices (root cause analysis, equipment health scores, predictive maintenance technology, etc.)

Related Categories

Related Job Pages

More Engineer Jobs

Noda logo

Senior QA Engineer

Noda

We’re Hiring | Better Buildings for a Better World

Engineer45 days ago
Full TimeRemoteTeam 51-200Since 2018H1B No Sponsor

Hi! At Noda we’re building seamless Open Banking solutions that make payments smarter, faster, and safer. We're currently on the lookout for an experienced Senior QA Engineer to join Noda, If you're fueled by fintech excitement and seek a global career journey, keep reading! Your mission: We’re growing the team behind our core solution - a complex and business-critical system - currently supported by 4 skilled testers. This role calls for a highly experienced and dependable professional, as the responsibilities will include: - Designing and executing test cases across functional, integration, regression, UI, and end-to-end testing; - Performing manual API testing (Postman) and validating complex payment flows and integrations; - Testing web (DevTools, proxy tools) and mobile applications; - Identifying, documenting, and communicating bugs with clear reports and usability feedback; - Collaborating with developers, product managers, and stakeholders on testing strategy; - Investigating production issues using logs and monitoring tools (e.g. Kibana, Elasticsearch); - Analyzing risks, alerts, and system behavior, including antifraud-related flows; - Working effectively with incomplete documentation and supporting release processes. What are your skills and experience: - 3–5+ years of experience in Manual QA (strong Middle or Senior level); - Proven experience in fintech or payment systems (highly preferred); - Strong hands-on experience in API testing, including tools like Postman or similar; - Solid understanding of web application testing, including use of browser DevTools and proxy tools (e.g. Charles, Fiddler); - Experience testing mobile applications (iOS/Android); - Practical experience working with complex payment flows and financial transactions (a strong advantage); - Good knowledge of databases (SQL and/or NoSQL); - Experience working with logs and monitoring tools (e.g. Elasticsearch, Kibana); - Strong skills in test design, including ability to cover edge cases and complex scenarios; - Experience testing complex integrations and systems with non-trivial business logic (e.g. calculations, financial flows); - Ability to analyze alerting/reporting systems and understand system behavior based on logs and metrics; - Experience testing antifraud systems or risk-related features is a strong plus; - High level of attention to detail and analytical thinking; - Ability to work with incomplete or ambiguous documentation, proactively identifying gaps and predicting system behavior; - Strong problem-solving skills and ownership mindset. Nice to have: - Experience with Microsoft services like: Microsoft 365, Microsoft Entra ID, Microsoft Power BI, Azure DevOps, Dynamics 365. - Experience with AI-driven testing tools and agents What do we offer? - 100% Remote. Work from anywhere. Let the world be your office - work from one of our headquarters, or remotely. We span borders and continents, and promote the nomad lifestyle - it’s up to you to decide - Employee Learning and Development. Professional growth. We encourage our employees to constantly develop and grow within their field of expertise by covering the training and education fees. - Team buildings. Doesn’t matter where you work from - we’ll find a way to get the Team together. Take a part in a range of online and offline activities, as dinners, hikes, bike/cart ridings, karaoke nights, boat-trips, etc. - Tech. We provide or compensate all necessary hardware. - Non-toxic environment. We love what we do, we are proud to be #nodapeople and we are working together to achieve Noda goals! What happens once you apply: - Your CV will undergo careful review, and we'll quickly update you on the next step in our recruitment journey. - Get ready for an easy-going 45-minute screening call with our HR Manager, where you'll delve into the heart of our company, product, and team dynamics while sharing your own experiences and aspirations. - If the vibe matches and expectations align, the next step will be two additional interview rounds (technical and final). - And if everything clicks, anticipate a thrilling job offer landing in your inbox soon! We do our best to close interview rounds within 3 weeks, although sometimes it might take slightly longer. Send your application our way, we look forward to meeting you!

Poland
GoDaddy logo

Senior Site Reliability Engineer

GoDaddy

GoDaddy is a web services platform that helps individuals and businesses worldwide start, grow, and manage their online presence. GoDaddy employs team members a

Engineer45 days ago

Location Details: BC or ON, Canada, remote. At GoDaddy, the future of work looks different for each team. Some teams work in the office full-time; others have a hybrid arrangement (they work remotely some days and in the office some days) , and some work entirely remotely.​ This is a remote position, so you’ll be working remotely from your home. You may occasionally visit a GoDaddy office to meet with your team for events or meetings. Join our team GoDaddy is seeking a Senior Site Reliability Engineer to join our team and play a crucial role in building and maintaining our cutting-edge eCommerce platform. As a Senior Site Reliability Engineer (SRE), you will collaborate with multinational development teams to ensure the smooth deployment and ongoing maintenance of our platform. What you'll get to do... - Design, implement, and manage robust CI/CD pipelines to automate application and platform deployments. - Own and continually enhance the release process to ensure scalability, efficiency, and high quality across distributed environments. - Deploy and support new infrastructure components, and provide operational support for production updates and hotfixes across products such as Poynt terminals and cloud services. - Collaborate with engineering, product, and business teams to coordinate release schedules and ensure timely, reliable delivery. - Mentor junior team members and participate in on-call rotations to maintain system reliability and operational excellence. Your experience should include... - 4+ years of experience with Kubernetes (EKS/AKS/GKE/Fargate, etc.). - 4+ years of hands-on experience working with AWS, both in cloud-native and agnostic capacities. - 4+ years of expertise in Linux administration. - Strong coding skills in languages such as Go, Python, Ruby etc. - Strong experience in coding infrastructure as code (Pulumi, Terraform, Ansible, CDK, etc.). You might also have... - Strong understanding of CI/CD concepts, version control systems, and testing tools (e.g., Jenkins, Gradle, Maven). - Hands-on experience with SQL databases, especially MySQL or PostgreSQL. - Proven experience in building and managing distributed infrastructure and systems. - Solid understanding of networking principles and a strong commitment to cybersecurity best practices. - Ability to work independently and perform effectively under pressure. We've got your back...  We offer a range of total rewards that may include paid time off, retirement savings (e.g., 401k, pension schemes), bonus/incentive eligibility, equity grants, participation in our employee stock purchase plan, competitive health benefits, and other family-friendly benefits including parental leave. GoDaddy’s benefits vary based on individual role and location and can be reviewed in more detail during the interview process. We also embrace our diverse culture and offer a range of Employee Resource Groups (Culture). Have a side hustle? No problem. We love entrepreneurs! Most importantly, come as you are and make your own way. We encourage you to apply even if your experience or skillset doesn’t align perfectly with every requirement. We value a wide range of backgrounds and transferable skills, and we are excited to support learning and growth. About us... GoDaddy is empowering everyday entrepreneurs around the world by providing the help and tools to succeed online, making opportunity more inclusive for all. GoDaddy is the place people come to name their idea, build a professional website, attract customers, sell their products and services, and manage their work. Our mission is to give our customers the tools, insights, and people to transform their ideas and personal initiative into success. To learn more about the company, visit About Us. At GoDaddy, we know diverse teams build better products—period. Our people and culture reflect and celebrate that sense of diversity and inclusion in ideas, experiences and perspectives. But we also know that’s not enough to build true equity and belonging in our communities. That’s why we prioritize integrating diversity, equity, inclusion and belonging principles into the core of how we work every day—focusing not only on our employee experience, but also our customer experience and operations. It’s the best way to serve our mission of empowering entrepreneurs everywhere, and making opportunity more inclusive for all. To read more about these commitments, as well as our representation and pay equity data, check out our Diversity and Pay Parity annual report which can be found on our Diversity Careers page. GoDaddy is proud to be an equal opportunity employer. GoDaddy will consider for employment qualified applicants with criminal histories in a manner consistent with local and federal requirements. Refer to our full EEO policy. Our recruiting team is available to assist you in completing your application. If they could be helpful, please reach out to myrecruiter@godaddy.com. GoDaddy doesn’t accept unsolicited resumes from recruiters or employment agencies.

BC + 2 moreAll locations: BC | ON | Canada
Akvelon, Inc. logo

MLOps Engineer (Relocation to Serbia)

Akvelon, Inc.

Custom-Built Software Engineering Teams

Engineer45 days ago
Full TimeRemoteTeam 1,001-5,000Since 2000H1B No Sponsor

This engagement is focused on building an internal AI platform that enables developers to ship AI-powered services efficiently. Scope includes model connectivity, prompt testing and evaluation, monitoring/observability, and the underlying AI infrastructure layer. The objective is to improve DevEx and reduce time-to-market for AI features. Location: Serbia (relocation support available), Croatia, Poland, Portugal Tasks - Build and operate the AI platform infrastructure enabling developers to ship LLM-based services faster. - Implement and maintain Kubernetes-based runtime environments (incl. AKS) for AI workloads. - Manage infrastructure as code with Terraform (modules, environments, CI/CD automation). - Support LLM workflows: RAG, agents, prompt experimentation, evaluations, and deployment patterns. - Integrate and operate tooling such as Azure AI Foundry, LiteLLM, Langfuse, MLflow. - Orchestrate pipelines using Kubeflow Pipelines and/or Argo Workflows (build, deploy, evaluate). - Improve platform reliability and observability (monitoring, logging, tracing, cost/perf signals). - Collaborate closely with developers to streamline DX (APIs, templates, docs, golden paths, automation). Requirements - Strong hands-on experience with Kubernetes in production (preferably AKS). - Solid Terraform expertise (IaC best practices, multi-env setups). - Practical experience supporting ML/LLM workloads in a platform or DevOps/MLOps context. - Proficiency in Python for automation, scripting, and supporting APIs/evaluation tooling. - Understanding of CI/CD, release processes, and production-grade operations. - Ability to work under tight timelines and deliver pragmatically. Nice to Have - Experience building internal developer platforms or “paved roads” for engineering teams. - Familiarity with LLM evaluation frameworks, prompt testing workflows, and LLM observability. - Exposure to RAG architectures, vector databases, and agentic patterns. - Experience with Kubeflow, Argo, and ML lifecycle tooling. Engagement Type - Long-term B2B contract. Team - You will join a team of 5, with 3 AI Platform Engineers being added. Location / Timezone - Remote work from Croatia, Poland, Portugal, and Serbia. - European working hours. - Occasionally available for meetings up to 10:00 AM PST (US overlap).

Kazakhstan
Job Closed
Akvelon, Inc. logo

MLOps Engineer (Relocation to Serbia)

Akvelon, Inc.

Custom-Built Software Engineering Teams

Engineer45 days ago
Full TimeRemoteTeam 1,001-5,000Since 2000H1B No Sponsor

This engagement is focused on building an internal AI platform that enables developers to ship AI-powered services efficiently. Scope includes model connectivity, prompt testing and evaluation, monitoring/observability, and the underlying AI infrastructure layer. The objective is to improve DevEx and reduce time-to-market for AI features. Location: Serbia (relocation support available), Croatia, Poland, Portugal Tasks - Build and operate the AI platform infrastructure enabling developers to ship LLM-based services faster. - Implement and maintain Kubernetes-based runtime environments (incl. AKS) for AI workloads. - Manage infrastructure as code with Terraform (modules, environments, CI/CD automation). - Support LLM workflows: RAG, agents, prompt experimentation, evaluations, and deployment patterns. - Integrate and operate tooling such as Azure AI Foundry, LiteLLM, Langfuse, MLflow. - Orchestrate pipelines using Kubeflow Pipelines and/or Argo Workflows (build, deploy, evaluate). - Improve platform reliability and observability (monitoring, logging, tracing, cost/perf signals). - Collaborate closely with developers to streamline DX (APIs, templates, docs, golden paths, automation). Requirements - Strong hands-on experience with Kubernetes in production (preferably AKS). - Solid Terraform expertise (IaC best practices, multi-env setups). - Practical experience supporting ML/LLM workloads in a platform or DevOps/MLOps context. - Proficiency in Python for automation, scripting, and supporting APIs/evaluation tooling. - Understanding of CI/CD, release processes, and production-grade operations. - Ability to work under tight timelines and deliver pragmatically. Nice to Have - Experience building internal developer platforms or “paved roads” for engineering teams. - Familiarity with LLM evaluation frameworks, prompt testing workflows, and LLM observability. - Exposure to RAG architectures, vector databases, and agentic patterns. - Experience with Kubeflow, Argo, and ML lifecycle tooling. Engagement Type - Long-term B2B contract. Team - You will join a team of 5, with 3 AI Platform Engineers being added. Location / Timezone - Remote work from Croatia, Poland, Portugal, and Serbia. - European working hours. - Occasionally available for meetings up to 10:00 AM PST (US overlap).

United States
Job Closed