At Graphcore, we’re building the future of AI compute. We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale. As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem. To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world. We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence.
Reliability Engineer
Location
Taiwan
Posted
11 days ago
Salary
0
Seniority
Lead
Job Description
Reliability Engineer
Graphcore
About GraphcoreAt Graphcore, we’re building the future of AI compute.We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale.As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem.To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world.We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence. Job Summary Responsible for system-level reliability of AI servers with liquid cooling and HVDC architectures, owning reliability validation, shock & vibration robustness, and failure analysis from board to rack level to ensure safe transport, deployment, and long-term datacenter operation. Key Responsibilities and skills - Plan and execute reliability validation across board, server, and rack levels. - Define and run environmental, accelerated, and mechanical tests, including thermal/power cycling, humidity, corrosion, shock & vibration, and HALT/HASS. - Lead shock & vibration validation for transportation, handling, seismic, and operational conditions. - Assess reliability risks for liquid cooling systems (leakage, fatigue, pump life, corrosion, coolant stability). - Evaluate HVDC mechanical and electrical robustness (busbars, connectors, power interfaces). - Perform reliability prediction and life data analysis (Weibull, MTBF). - Lead cross-functional design reviews and drive risk mitigation. - Conduct failure analysis and RCA using standard FA methodologies. - Define and maintain reliability and S&V test specifications (JEDEC, Telcordia GR-63, JESD22, MIL-STD-810, ISTA, ASHRAE, UL, IEC). - Implement On-going Reliability Test (ORT) for production quality. - Document results and support customer audits and certifications. Qualifications - Bachelor’s or Master’s degree in Mechanical, Electrical, Reliability, Materials, or related Engineering. - 10+ years of reliability engineering experience in AI servers, datacenter systems, HPC, or complex electronics. - Hands-on experience with environmental, shock, and vibration testing. - Strong knowledge of reliability methodologies and statistical analysis. - Practical experience with liquid cooling and HVDC systems. - Proven failure analysis and RCA capability. - Strong communication skills in English; Mandarin a plus. Preferred Experience - AI server architecture and large-scale liquid cooling systems. - FEA/modal analysis and test correlation. - Datacenter, telecom, and transportation standards knowledge. - Reliability certification (e.g., ASQ CRE). Benefits In addition to a competitive salary, Graphcore offers a competitive benefits package. We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Engineering Evaluator - Domain Expert - AI Trainer
MercorCincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from Mercor. While opportunities may be discovered through Mercor's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.
Role Description - Evaluate AI-generated artifacts against domain-specific quality rubrics. - Identify factual, aesthetic, and presentation errors in documents, spreadsheets, and slide decks. - Provide clear, structured written feedback to improve AI model outputs. - Apply deep subject-matter expertise to grade outputs for accuracy and rigor. - Work independently and asynchronously to meet deadlines and enhance domain quality. Qualifications - 5+ years of relevant professional experience in Engineering / manufacturing / technical operations. - Native or professional fluency in English. - Highly proficient in Microsoft Office and Google Workspace, especially Slides. - Advanced degree (Master's or higher) from a reputable institution (preferred). Company Description Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Sr. Structural Analysis Engineer
Lanteris Space SystemsLanteris Space Systems is an aerospace company based in Palo Alto, California, specializing in the design and building of satellites and space systems for a var
Role Description Lanteris Space is currently seeking a Sr. Structural Analysis Engineer to join our team in Mountain View, CA. This position can also be performed remotely within the U.S. You'll be part of a challenging and fast-paced team, working on spacecrafts operating in earth orbits, deep space missions, interplanetary missions and much more. In this role, you will be responsible for performing structural analysis on spacecraft structures and subsystem assemblies. - Perform structural analyses which includes static, dynamics and thermal stress analysis - Conduct finite element analysis (FEA) and classical hand calculations to evaluate metal, non-metallic, and composite structures - Provide technical inputs to mechanical designs and support manufacturing related structural issues - Develop comprehensive load cases for on-orbit operations, launch environments, test conditions and ground handling scenarios - Design and execute structural test programs, including static and dynamics testing, to validate analytical predictions and design integrity - Collaboration with design engineers, manufacturing team and cross-disciplinary subsystems to solve complex problems - Consolidate analysis results into clear technical documentation - Write test reports and present in technical reviews Qualifications - Must be a U.S. citizen or permanent resident - Bachelor's degree in mechanical, civil or other engineering discipline or a related field. Four additional years of experience may be substituted for a degree - 5 years of experience. A master’s degree may account for two years of experience - Proficiency with FEMAP or Patran - Knowledge of structural testing, and basic structural analysis concepts such as strength of materials, stress analysis, and structural dynamics - Familiarity with advanced composite materials Requirements - Experience with Project management and team leadership roles - Excellent problem solving and analytical skills - Strong communication and interpersonal skills - Familiarity with Spacecraft structures - Familiarity with fracture analysis - Static and dynamics test experience - Local candidates for hybrid work Benefits - Comprehensive package of benefits including paid time off - Health and welfare insurance - 401(k) to eligible employees
Reliability Engineer
Kohl'sIt’s no secret that our associates love #LifeAtKohls and we know you will too.
Role Description As Reliability Engineer, you will ensure the resilience and availability of Kohl’s systems and applications and collaborate closely with development teams to review designs, conduct risk assessments and implement robust monitoring and failover mechanisms. - Drive incident response efforts, perform root cause analysis and implement preventative measures to enhance system reliability. - Establish consistent practices that elevate Kohl’s operational excellence through automation and process improvements. - Follow software lifecycle and drive reliability, observability and efficiency across product teams within an assigned domain. - Identify repeated toil and find opportunities for automation and risk reduction. - On-call on a rotation to respond to production incidents and conduct blameless retros and root-cause analyses (RCAs) to drive a culture of continuous improvements. - Proactively identify failures before they cause outages using chaos engineering techniques such as edge cases, failure modes and design review. - Advise on capacity planning and provide continuous assessments on systems behavior and consumption. - Work with product managers to identify and prioritize work for reliability best practices (i.e., leveraging SLIs/SLOs/Error Budgets). - Additional tasks may be assigned. Qualifications - Bachelor's Degree or equivalent in MIS, Computer Science or related field. - 2+ years of experience in software development. - Strong programming skills in one or more languages (Java, Python, Go or Node.js). - Working knowledge of systems architecture, operating system internals and network fundamentals. - Experience working with one cloud platform (e.g., GCP, AWS, or Azure). Requirements - Experience with monitoring techniques and tools (e.g., CloudWatch, Grafana, Prometheus, OpenTelemetry, Tracing). - Working knowledge around containerization and container orchestration (e.g., Docker, Kubernetes, Rancher).
Structural Analysis Engineer
Lanteris Space SystemsLanteris Space Systems is an aerospace company based in Palo Alto, California, specializing in the design and building of satellites and space systems for a var
Role Description Lanteris Space is currently seeking a Structural Analysis Engineer to join our team in Mountain View, CA. This position can also be performed remotely within the U.S. You'll be part of a challenging and fast-paced team, working on spacecrafts operating in earth orbits, deep space missions, interplanetary missions and much more. In this role, you will be responsible for performing structural analysis on spacecraft structures and subsystem assemblies. - Perform structural analyses which includes static, dynamics and thermal stress analysis - Conduct finite element analysis (FEA) and classical hand calculations to evaluate metal, non-metallic, and composite structures - Provide technical inputs to mechanical designs and support manufacturing related structural issues - Develop comprehensive load cases for on-orbit operations, launch environments, test conditions and ground handling scenarios - Design and execute structural test programs, including static and dynamics testing, to validate analytical predictions and design integrity - Collaboration with design engineers, manufacturing team and cross-disciplinary subsystems to solve complex problems - Consolidate analysis results into clear technical documentation - Write test reports and present in technical reviews Qualifications - Must be a U.S. citizen or permanent resident - Bachelor's degree in mechanical, civil or other engineering discipline or a related field. Four additional years of experience may be substituted for a degree - 2 years of experience. A master’s degree may account for two years of experience - Proficiency with FEMAP or Patran - Knowledge of structural testing, and basic structural analysis concepts such as strength of materials, stress analysis, and structural dynamics - Familiarity with advanced composite materials Requirements - Experience with project management and team leadership roles - Excellent problem solving and analytical skills - Strong communication and interpersonal skills - Familiarity with spacecraft structures - Familiarity with fracture analysis - Static and dynamics test experience - Local candidates for hybrid work Benefits - Comprehensive package of benefits including paid time off - Health and welfare insurance - 401(k) to eligible employees
