Amazon Web Services (AWS) · 1 day ago
Sr. Quality & Reliability Engineer, Trainium Servers and Systems Manufacturing
Amazon Web Services (AWS) is a leading provider of cloud services, and they are seeking a Senior Reliability Engineer for their Trainium Manufacturing, Quality and Reliability Team. The role involves engaging with cross-disciplinary teams to design infrastructure technologies, drive manufacturing process improvements, and ensure product reliability through testing and validation.
ConsultingDevOpsInformation TechnologySoftwareWeb Development
Responsibilities
Be responsible for the test validation of future technologies
Drive manufacturing process improvements to address reliability issues and concerns
Qualify manufacturing lines and mechanisms for mass production
You will have a fundamental understanding of Reliability statistics/Reliability tests and/or solid understanding of computer systems to influence design for reliability
Lead identifying and validating product/component risks and work with design teams to mitigate them and define the test methodology and test coverage to assure product reliability
Deep-dive in technologies aligned with product roadmap
Provide technical leadership and mentor engineers
Perform Reliability prediction of failure mechanisms, products under development and products in the field
Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations
Responsible for defining reliability tests to be implemented during manufacturing
Drive manufacturing process improvements to address reliability issues and concerns
Perform Reliability prediction of failure mechanisms, products under development and products in the field
Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations
Qualification
Required
Bachelor's or Master's degree in Reliability Engineering, Physics or related field, or equivalent experience
7+ years of Reliability Engineering work experience with server compute platforms or on high-tech hardware
Preferred
Master's Degree or PhD in Reliability Engineering or related field
Demonstrated ability to uncover systemic issues prior to NPI
Working understanding of server subcomponents (CPU, memory, HDD, SSD, motherboard, thermal system, peripherals, etc.)
Analytical, test plan, and test procedure development experience related to server compute platforms or with high-tech hardware
Demonstrated ability to achieve stretch goals
Ability to thrive in a startup-like environment
Demonstrated ability to drive failure analysis activities to root cause swiftly and accurately
Able to work in a diverse team
Reliability modeling and materials characterization experience
Ability to influence development teams, procurement and external partners
Meets/exceeds Amazon's leadership principles requirements for this role
Meets/exceeds Amazon's functional/technical depth and complexity for this role
Benefits
Equity
Sign-on payments
Full range of medical, financial, and/or other benefits
Company
Amazon Web Services (AWS)
Launched in 2006, Amazon Web Services (AWS) began exposing key infrastructure services to businesses in the form of web services -- now widely known as cloud computing.
H1B Sponsorship
Amazon Web Services (AWS) has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)
Funding
Current Stage
Late StageTotal Funding
unknownKey Investors
BIRD Foundation
2025-01-22Grant
Leadership Team
Recent News
2026-01-08
Bizjournals.com Feed (2025-11-12 15:43:17)
2026-01-08
Company data provided by crunchbase