Sr. Quality & Reliability Engineer, Trainium Servers and Systems Manufacturing jobs in United States
cer-icon
Apply on Employer Site
company-logo

Amazon Web Services (AWS) · 1 day ago

Sr. Quality & Reliability Engineer, Trainium Servers and Systems Manufacturing

Amazon Web Services (AWS) is a leading provider of cloud services, and they are seeking a Senior Reliability Engineer for their Trainium Manufacturing, Quality and Reliability Team. The role involves engaging with cross-disciplinary teams to design infrastructure technologies, drive manufacturing process improvements, and ensure product reliability through testing and validation.

ConsultingDevOpsInformation TechnologySoftwareWeb Development
check
H1B Sponsor Likelynote

Responsibilities

Be responsible for the test validation of future technologies
Drive manufacturing process improvements to address reliability issues and concerns
Qualify manufacturing lines and mechanisms for mass production
You will have a fundamental understanding of Reliability statistics/Reliability tests and/or solid understanding of computer systems to influence design for reliability
Lead identifying and validating product/component risks and work with design teams to mitigate them and define the test methodology and test coverage to assure product reliability
Deep-dive in technologies aligned with product roadmap
Provide technical leadership and mentor engineers
Perform Reliability prediction of failure mechanisms, products under development and products in the field
Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations
Responsible for defining reliability tests to be implemented during manufacturing
Drive manufacturing process improvements to address reliability issues and concerns
Perform Reliability prediction of failure mechanisms, products under development and products in the field
Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations

Qualification

Reliability EngineeringServer Compute PlatformsFailure AnalysisReliability StatisticsTest Plan DevelopmentMaterials CharacterizationAnalytical SkillsTeam CollaborationCommunication SkillsProblem Solving

Required

Bachelor's or Master's degree in Reliability Engineering, Physics or related field, or equivalent experience
7+ years of Reliability Engineering work experience with server compute platforms or on high-tech hardware

Preferred

Master's Degree or PhD in Reliability Engineering or related field
Demonstrated ability to uncover systemic issues prior to NPI
Working understanding of server subcomponents (CPU, memory, HDD, SSD, motherboard, thermal system, peripherals, etc.)
Analytical, test plan, and test procedure development experience related to server compute platforms or with high-tech hardware
Demonstrated ability to achieve stretch goals
Ability to thrive in a startup-like environment
Demonstrated ability to drive failure analysis activities to root cause swiftly and accurately
Able to work in a diverse team
Reliability modeling and materials characterization experience
Ability to influence development teams, procurement and external partners
Meets/exceeds Amazon's leadership principles requirements for this role
Meets/exceeds Amazon's functional/technical depth and complexity for this role

Benefits

Equity
Sign-on payments
Full range of medical, financial, and/or other benefits

Company

Amazon Web Services (AWS)

company-logo
Launched in 2006, Amazon Web Services (AWS) began exposing key infrastructure services to businesses in the form of web services -- now widely known as cloud computing.

H1B Sponsorship

Amazon Web Services (AWS) has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)

Funding

Current Stage
Late Stage
Total Funding
unknown
Key Investors
BIRD Foundation
2025-01-22Grant

Leadership Team

leader-logo
Matt Garman
Chief Executive Officer
linkedin
leader-logo
Anand Desikan
CTO, CXO Advisor, and Enterprise Technologist
linkedin
Company data provided by crunchbase