Principal AI/ML System Software Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

d-Matrix · 5 months ago

Principal AI/ML System Software Engineer

d-Matrix is focused on unleashing the potential of generative AI to power the transformation of technology. They are seeking a Principal AI/ML System Software Engineer to develop and maintain next-generation AI deployment software, working closely with a team of system software experts.

AI InfrastructureArtificial Intelligence (AI)Cloud InfrastructureData CenterSemiconductor
check
H1B Sponsor Likelynote

Responsibilities

The role requires you to be part of the team that helps productize the SW stack for our AI compute engine
As part of the software team, you will be responsible for the development, enhancement, and maintenance of the next-generation AI deployment software
You have had past experience working across all aspects of the full-stack toolchain and understand the nuances of what it takes to optimize and trade-off various aspects of hardware-software co-design
You are able to build and scale software deliverables in a tight development window
You will work with a team of system software experts to build out the deployment infrastructure, working closely with other software (ML, compilers) and hardware experts in the company

Qualification

C/C++/Python developmentMachine learning fundamentalsDistributed systemsDeep learning frameworksInference servers/model servingSoftware testing fundamentalsMLOps toolsLeadershipTeam playerSelf-motivated

Required

BS in Computer Science, Engineering, Math, Physics, or related degree with 12+ years of industry software development experience
Strong grasp of system software, data structures, computer architecture, and machine learning fundamentals
Proficient in C/C++/Python development in Linux environment and using standard development tools
Experience with distributed, high-performance software design and implementation
Self-motivated team player with a strong sense of ownership and leadership

Preferred

MS or PhD in Computer Science, Electrical Engineering, or related fields
Experience with inference servers/model serving frameworks (such as TensorRT-LLM, vLLM, SGLang, etc.)
Experience with deep learning frameworks (such as PyTorch and TensorFlow)
Experience with deep learning runtimes (such as ONNX Runtime, TensorRT, etc.)
Experience with distributed systems collectives such as NCCL and OpenMPI
Experience with software testing fundamentals
Experience deploying ML workloads (LLMs, VLMs, NLP, etc.) on distributed systems
Experience with Kubernetes, Ray, or other MLOps tools and techniques used from definition to deployment
Prior startup, small team, or incubation experience
Work experience at a cloud provider or AI compute/subsystem company

Company

d-Matrix

twittertwittertwitter
company-logo
D-Matrix is a platform that enables data centers to handle large-scale generative AI inference with high throughput and low latency.

H1B Sponsorship

d-Matrix has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (20)
2024 (15)
2023 (8)
2022 (7)

Funding

Current Stage
Growth Stage
Total Funding
$429M
Key Investors
Temasek HoldingsTSVC
2025-11-12Series C· $275M
2023-09-06Series B· $110M
2022-04-20Series A· $44M

Leadership Team

leader-logo
Peter Buckingham
Senior Vice President, Software Engineering
linkedin
Company data provided by crunchbase