Lambda · 2 months ago
Hardware Quality Engineer
Lambda is a company focused on building Gigawatt-scale AI Factories for Training and Inference. The Hardware Quality Engineer will be responsible for managing quality issues in the data center, performing root cause analysis, and collaborating with cross-functional teams to ensure the reliability and performance of AI-driven infrastructure.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingData CenterGPUMachine Learning
Responsibilities
Track, log, and manage all quality issues arising in the data center during deployment and production environment
Perform root cause analysis (RCA) for every failure (hardware, software, process)
Analyze production system metrics and quality data to detect trends, anomalies, or weak points
Improve turnaround time (TAT) for Return Merchandise Authorization (RMA) processes
Design, monitor, and drive corrective and preventive actions (CAPA)
Implement and verify containment actions to keep systems operational until permanent fixes are applied
Collaborate with operations, hardware, engineering, supply chain, and vendors to resolve quality issues
Capture and upload failure analysis (FA) reports and related data into Quality Management Systems (QMS)
Verify quality of spares (incoming and outgoing) to avoid repeat failures
Define and track quality KPIs / SLAs and report on quality performance to leadership
Oversee MRB (Material Review Board) inventory, rework, disposal decisions
Ensure the quality management system (QMS) is up to date, with necessary training rolled out
Work cross-functionally during hardware ramp, deployments, and upgrades to ensure quality gates
Up to 30% travel may be required for this role
Qualification
Required
Have experience working with hardware / data center / infrastructure systems
Are strong at data analysis, statistics, and metrics (you can turn raw data into insight)
Are skilled in root cause analysis methods (5 Whys, fishbone, 8D, A3, etc.)
Are comfortable managing cross-team communication, stakeholder expectations, and conflict resolution
Are detail-oriented, process-driven, and quality-minded
Have experience working with quality tools or QMS software (e.g. audit modules, ERP, defect tracking)
Communicate clearly in English (both written and verbal)
Preferred
Experience in the machine learning / AI infrastructure / GPU / HPC / computer hardware industry
Exposure to data center standards, certifications (e.g. ISO, Uptime Institute, etc.)
Experience working on vendor quality, supply chain quality, or incoming inspections
Understanding of firmware, embedded systems, reliability engineering
Familiarity with scripting or automation (Python, SQL, etc.) to help with data processing
Exposure to cloud or hyperscaler infrastructure operations
Experience with 'manufacturing-like' quality concepts applied to compute hardware
Benefits
Health, dental, and vision coverage for you and your dependents
Wellness and Commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible Paid Time Off Plan that we all actually use
Company
Lambda
Lambda is a cloud-based platform that provides high-performance GPU hardware and cloud infrastructure for AI model training and inference.
H1B Sponsorship
Lambda has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (1)
2023 (3)
2022 (2)
2021 (2)
2020 (3)
Funding
Current Stage
Late StageTotal Funding
$3.19BKey Investors
TWG GlobalJP MorganMacquarie Group
2025-11-18Series E· $1.5B
2025-08-19Debt Financing· $275M
2025-02-19Series D· $480M
Recent News
2026-01-11
2026-01-09
Company data provided by crunchbase