Senior Lead Storage & Server Test Engineer (Austin) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Celestica · 2 days ago

Senior Lead Storage & Server Test Engineer (Austin)

Celestica is a leader in design, manufacturing, hardware platform and supply chain solutions. The Senior Lead Storage and Server Test Engineer will be responsible for designing, developing, and executing comprehensive test strategies for AI data center's storage and server infrastructure, while also mentoring junior engineers and driving test automation.

ElectronicsManufacturingProduct DesignSupply Chain Management
check
H1B Sponsor Likelynote

Responsibilities

Define, develop, and implement comprehensive test plans and strategies for all storage and server hardware, firmware, and software components within the AI data center environment
Lead the test team in designing, executing, and analyzing complex test cases, including functional, performance, reliability, stress, and endurance testing
Mentor and provide technical guidance to junior test engineers, fostering a culture of technical excellence and continuous improvement
Design and implement automated test frameworks and scripts using languages like Python, Go, or similar, to improve efficiency and coverage of testing
Conduct in-depth performance analysis and bottleneck identification for storage systems (e.g., NVMe, SSD, HDD arrays, distributed storage, SAN/NAS) and server platforms (e.g., CPU, GPU, memory, PCIe, networking), and OpenBMC interfaces/features
This includes debugging issues related to BMC functionality and its interaction with server hardware
Develop and maintain robust testbeds and infrastructure for continuous integration and validation
Utilize open-source and commercial test tools relevant to storage, server, and OpenBMC validation
Collaborate closely with hardware design, software development, infrastructure, and AI/ML engineering teams to understand requirements and integrate testing throughout the product lifecycle
Communicate test progress, results, and critical issues effectively to stakeholders, including executive leadership
Develop specialized test methodologies to validate performance and reliability under heavy AI/ML workloads (e.g., large model training, inference at scale, data ingestion)
Understand and test the interactions between GPU-accelerated computing, high-speed networking, and storage systems

Qualification

Enterprise storage systemsServer architecturesTest automationPerformance analysisScripting languagesLinux operating systemsNetworking conceptsTest methodologiesProblem-solving skillsCommunication skillsInterpersonal skills

Required

Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related technical field
7+ years of experience in hardware and/or software testing, with at least 5 years focused on enterprise-level storage and server systems
3+ years of experience in a lead or senior technical role, mentoring junior engineers or leading test initiatives
Proven experience in a lead or senior technical role, mentoring and guiding other engineers
Deep expertise in various storage technologies including NVMe, SAS/SATA SSDs/HDDs, RAID, distributed file systems (e.g., Ceph, Lustre, GPFS), SAN, and NAS
Strong understanding of server architectures (x86, ARM, GPU servers), CPU/memory subsystems, PCIe, and power management
Strong understanding of server architectures (x86, ARM, GPU servers), CPU/memory subsystems, PCIe, power management, and Baseband Management Controllers (BMC) functionality
Proficiency in scripting languages (e.g., Python, Bash) for test automation and data analysis
Experience with Linux operating systems (e.g., Ubuntu, CentOS, RHEL) and command-line tools
Familiarity with networking concepts (Ethernet, TCP/IP, InfiniBand) and network testing methodologies
Experience with test methodologies such as performance testing, reliability testing, stress testing, and fault injection
Excellent problem-solving, analytical, and debugging skills
Strong communication and interpersonal skills, with the ability to collaborate effectively across diverse teams

Preferred

Familiarity with OCP (Open Compute Project)
Experience with cloud environments (AWS, Azure, GCP) and virtualization technologies
Knowledge of containerization technologies (Docker, Kubernetes)
Familiarity with AI/ML frameworks (e.g., TensorFlow, PyTorch) and their infrastructure requirements
Experience with performance profiling tools (e.g., fio, Iometer, Perf, VTune)
Contributions to open-source projects related to storage, servers, or testing
Certifications in relevant technologies (e.g., NetApp, Dell EMC, HPE, NVIDIA)

Company

Celestica

company-logo
Celestica is a manufacturing firm that provides design, hardware platform, and supply chain solutions to a multitude of industries.

H1B Sponsorship

Celestica has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (13)
2024 (3)
2023 (6)
2020 (3)

Funding

Current Stage
Public Company
Total Funding
$1.47B
2024-06-20Post Ipo Debt· $657.67M
2023-06-05Post Ipo Secondary· $148.8M
2021-09-30Post Ipo Debt· $660.4M

Leadership Team

leader-logo
Robert Mionis
President and CEO
linkedin
Company data provided by crunchbase