Etched · 1 week ago
MTS - Supercomputing (Test)
Etched is building the world’s first AI inference system purpose-built for transformers, delivering unprecedented performance and cost efficiency. They are seeking a Supercomputing Software Engineer (Test) to ensure the reliability of their high-performance inference server hardware and software through comprehensive testing and collaboration with engineering teams.
AI InfrastructureArtificial Intelligence (AI)ComputerHardwareSemiconductor
Responsibilities
Test Development: Design, develop, and implement automated burn-in test suites using common scripting languages (Python, Go, Bash) and test frameworks across all aspects of System Operation including: boot sequences, root-of-trust, system management, workload deployment and performance
Test Execution: Execute burn-in tests on server hardware, monitor system performance and health, and analyze test results
Failure Analysis: Investigate and debug hardware and software failures identified during testing, providing detailed reports and mitigation plans
Collaboration: Collaborate with internal and external hardware and software engineering teams to identify root causes of failures and implement corrective actions
Test Infrastructure: Contribute to the development and maintenance of the burn-in testing infrastructure, including portable test environments and automation tools runable in any environment
Documentation: Create and maintain comprehensive documentation for test plans, test cases, and test results
Performance Analysis: Analyze system performance metrics to identify potential bottlenecks and areas for optimization
Continuous Improvement: Participate in continuous improvement efforts to enhance the efficiency and effectiveness of the burn-in testing process
Qualification
Required
Proficiency in at least one scripting language (e.g., Python, Bash, Go)
Experience with software testing methodologies and tools
Strong understanding of operating systems (Linux preferred) and server hardware architectures
Ability to analyze complex technical problems and provide effective solutions
Excellent communication and collaboration skills
Ability to work independently and as part of a team
Experience with version control systems (e.g., Git)
Experience with reading and interpreting hardware logs
Preferred
Experience with hardware burn-in testing or reliability testing
Knowledge of server virtualization and cloud computing concepts
Experience with performance testing and benchmarking tools
Familiarity with hardware diagnostic tools and techniques
Experience with containerization technologies (e.g., Docker, Kubernetes)
Experience with CI/CD pipelines
Knowledge of low level hardware communication protocols (i2c, etc.)
Experience with data analysis tools and techniques
Benefits
Full medical, dental, and vision packages, with generous premium coverage
Housing subsidy of $2,000/month for those living within walking distance of the office
Daily lunch and dinner in our office
Relocation support for those moving to West San Jose
Company
Etched
Building the hardware for superintelligence
H1B Sponsorship
Etched has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
2024 (11)
2023 (1)
Funding
Current Stage
Growth StageTotal Funding
$125.36MKey Investors
Primary Venture Partners
2024-06-25Series A· $120M
2023-05-16Seed· $5.36M
Recent News
Soma Capital
2026-01-07
Google Patent
2025-04-02
Google Patent
2024-12-04
Company data provided by crunchbase