Board System Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

AMD ยท 1 day ago

Board System Reliability Engineer

AMD is a leading company focused on building innovative products that enhance computing experiences across various domains. The Board System Reliability Engineer will lead reliability efforts for AMD's MI accelerators, engaging with design and manufacturing teams to ensure product reliability and performance.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
badNo H1Bnote

Responsibilities

As a Senior member of technical staff (SMTS), you will lead reliability efforts of next generation AMD MI accelerators and air/liquid cooled systems, including product qualifications of accelerator modules, kits and systems
Engage early with design, platform and manufacturing teams to build-in DfR (design for reliability)
Conduct NUDD (new, unique, different, difficult) analysis of new HW designs and utilize FMEA/FEA to define reliability assessment plans
Lead DfM (design for manufacturability) and process-FMEA reviews with cross-functional teams and contract manufacturing partners
Conduct reliability risk assessments and characterization DoE to ensure assembly processes are robust and optimized
Drive efficient execution of qualification plans across different teams/geographies, facilitating bring-up/debug support, telemetry analysis, and collaborative issue resolution. Quantify reliability risks using applied statistical methods

Qualification

Electronics/platform reliabilityStatistical analysisDesign for Reliability (DfR)Failure analysis methodsPCB design/layout toolsCertified Reliability Engineer (CRE)Analytical/problem-solving skillsAttention to detailCollaboration skills

Required

Technical expert in the field of electronics/platform reliability with broad knowledge across materials, mechanical, and electrical domains with ability to deep-dive into each
Sound knowledge of material behavior, physics of failure, electronics failure mechanisms, solder reliability, electrochemical migration, burn-in screening, and operational field issues
Proven competence in statistical analysis, DoE formulations, and DPPM risk quantification
Strong analytical/problem-solving skills, highly organized, with attention to detail
A self-starter and leader, able to handle ambiguity and possesses strong accountability
Adept at delivering results under time constraints, in a technical, fast-paced, solution-driven work environment, and collaborating across international time zones to drive results
As a Senior member of technical staff (SMTS), you will lead reliability efforts of next generation AMD MI accelerators and air/liquid cooled systems, including product qualifications of accelerator modules, kits and systems
Engage early with design, platform and manufacturing teams to build-in DfR (design for reliability)
Conduct NUDD (new, unique, different, difficult) analysis of new HW designs and utilize FMEA/FEA to define reliability assessment plans
Lead DfM (design for manufacturability) and process-FMEA reviews with cross-functional teams and contract manufacturing partners
Conduct reliability risk assessments and characterization DoE to ensure assembly processes are robust and optimized
Drive efficient execution of qualification plans across different teams/geographies, facilitating bring-up/debug support, telemetry analysis, and collaborative issue resolution
Quantify reliability risks using applied statistical methods
MS/PhD, in Mechanical, Electrical, or Mechatronics engineering

Preferred

Extensive experience in quality/reliability engineering role leading product validation/analysis and delivering reliability risk assessments/results to key stakeholders and customers
Candidate should be experienced in electronics board manufacturing/NPI, assembly/system integration, DfM and reliability under various product lifecycle loads
Candidate should be well-versed with issue triage/debug, devising experiments to solve varied manufacturing challenges, and associated failure analysis methods
Datacenter platform validation experience, lifecycle warranty analysis and knowledge of networking and optics
Basic knowledge of PCB design, layout tools such as Allegro, Gerber and CAD tools
Familiarity with JEDEC, IPC, AEC, IEC, and ASHRAE industry standards
Certified Reliability Engineer (CRE) certification, preferred

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase