Senior AI Systems Performance Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

SambaNova · 2 days ago

Senior AI Systems Performance Engineer

SambaNova is a company specializing in generative AI platforms for enterprise and government organizations. They are seeking a talented ML performance engineer to optimize and scale state-of-the-art foundation models on their dataflow platform, working hands-on with advanced models to enhance throughput, latency, and efficiency.

AnalyticsArtificial Intelligence (AI)Machine LearningSemiconductorSoftware
check
H1B Sponsor Likelynote

Responsibilities

Bring up and optimize cutting-edge foundation models (e.g., DeepSeek, Llama, Qwen, and others) on the SambaNova platform through the SambaNova software stack
Profile and enhance model performance across compiler, runtime, and hardware layers to achieve SOTA throughput and latency
Collaborate with machine learning, compiler, runtime, and hardware teams to deliver co-designed, high-performance AI applications
Integrate the latest advances in model architecture, quantization, scheduling, and memory optimization from both academia and industry
Develop robust, scalable, and efficient end-to-end inference solutions aligned with customer needs
Identify performance bottlenecks and propose dataflow or scheduling optimizations for both single-node and distributed systems

Qualification

Deep learning optimizationCompiler optimizationSoftware-hardware co-designPythonC++ML framework experienceGPU programmingQuantization techniquesAnalytical skillsCollaboration skillsProblem-solving skills

Required

Bachelor's or higher degree in computer science, electrical engineering, or a related field (e.g., applied mathematics, physics, or statistics)
3+ years of experience in one or more of the following areas: Deep learning model development and performance optimization, Compiler, runtime, or kernel-level optimization, Software–hardware co-design or systems performance tuning
Proficiency in Python or C++, with strong foundations in algorithms, data structures, and numerical computing
Experience with at least one major ML framework — PyTorch, TensorFlow, or JAX
Demonstrated ability to analyze and optimize performance in real-world ML pipelines

Preferred

Hands-on experience with LLM or multimodal model training and inference
Background in large-scale distributed training, continuous batching, and high-throughput inference systems
Familiarity with quantization, graph optimization, kernel fusion, and model partitioning
Experience with frameworks such as DeepSpeed, Megatron, vLLM, or TensorRT
Strong GPU programming skills (CUDA, Triton, or OpenCL); experience with cuDNN, cuBLAS, or similar libraries is a plus
Knowledge of memory hierarchy optimization, caching, and scheduling for large-scale model execution
Publication record or open-source contributions in ML systems or performance optimization is a plus

Benefits

95% premium coverage for employee medical insurance
77% premium coverage for dependents
Health Savings Account (HSA) with employer contribution
Dental insurance
Vision insurance
Short/Long term Disability insurance
Basic Life insurance
Voluntary Life insurance
AD&D insurance plans
Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care
Full subscription to Headspace
Gympass+ membership with access to physical gyms
One Medical membership
Counseling services with an Employee Assistance Program

Company

SambaNova

twittertwittertwitter
company-logo
SambaNova develops software and hardware for artificial intelligence and machine learning applications.

H1B Sponsorship

SambaNova has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (31)
2024 (27)
2023 (37)
2022 (41)
2021 (35)
2020 (29)

Funding

Current Stage
Late Stage
Total Funding
$1.14B
Key Investors
SoftBank Vision FundBlackRockIntel Capital
2023-10-01Secondary Market
2021-04-13Series D· $676M
2020-02-25Series C· $250M

Leadership Team

leader-logo
Rodrigo Liang
Founder & CEO
linkedin
leader-logo
Annie Weckesser
CMO
linkedin
Company data provided by crunchbase