Full Stack LLM Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Cerebras · 2 months ago

Full Stack LLM Engineer

Cerebras Systems builds the world's largest AI chip, providing industry-leading training and inference speeds for machine learning applications. The role involves bringing up state-of-the-art models on Cerebras CSX systems, requiring a system-minded engineer comfortable with the entire software stack.

AI InfrastructureArtificial Intelligence (AI)ComputerHardwareRISCSemiconductorSoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems
Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning
Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization
Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups

Qualification

PythonC/C++Deep learning frameworksCompiler developmentOptimization techniquesDebugging skillsPerformance profilingModel architecture translationRuntime integration

Required

Bachelor's, Master's, or PhD in Computer Science, Engineering, or a related field
Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc
Strong debugging skills across performance, numerical accuracy, and runtime integration
Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion)
Proficiency in C/C++ programming and experience with low-level optimization
Proven experience in compiler development, particularly with LLVM and/or MLIR
Strong background in optimization techniques, particularly those involving NP-hard problems

Benefits

Competitive salary and benefits package.
Opportunities for professional growth and career advancement.
A dynamic and innovative work environment.
The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Company

Cerebras

twittertwittertwitter
company-logo
Cerebras Systems is the world's fastest AI inference. We are powering the future of generative AI.

H1B Sponsorship

Cerebras has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (31)
2024 (16)
2023 (18)
2022 (17)
2021 (34)
2020 (23)

Funding

Current Stage
Late Stage
Total Funding
$1.82B
Key Investors
Alpha Wave VenturesVy CapitalCoatue
2025-12-03Secondary Market
2025-09-30Series G· $1.1B
2024-09-27Series Unknown

Leadership Team

leader-logo
Andrew Feldman
CEO & Founder
linkedin
leader-logo
Bob Komin
Chief Financial Officer
linkedin
Company data provided by crunchbase