Principal Inference Stack Engineer @ Groq | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Principal Inference Stack Engineer jobs in Mountain View, CA
Be an early applicantLess than 25 applicants
company-logo

Groq · 7 hours ago

Principal Inference Stack Engineer

ftfMaximize your interview chances
Artificial Intelligence (AI)Electronics
check
H1B Sponsor Likelynote

Insider Connection @Groq

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Analyze latest ML workloads from Groq partners or Cloud and develop optimization roadmap and strategies to improve inference performance and operating efficiency of workload
Design, develop, and maintain optimizing compiler for Groq's LPU
Expand Groq runtime API to simplify execution model of Groq LPUs
Benchmark and analyze output produced by optimizing compiler and runtime, and drive enhancements to improve its quality-of-results when measured on the Groq LPU hardware.
Manage large multi-person and multi-geo projects and interface with various leads across the company
Mentor junior compiler engineers and collaborate with other senior compiler engineers on the team.
Review and accept code updates to compiler passes and IR definitions.
Work with HW teams and architects to drive improvements in architecture and SW compiler
Publish novel compilation techniques to Groq's TSP at top-tier ML, Applications, Compiler, and Computer Architecture conferences.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

C/C++LLVMML frameworksHPC workloadsDeep LearningSpatial architecturesDistributed systemsFunctional programmingML IR representations

Required

10+ years of experience in the area of computer science/engineering or related
5+ years of direct experience with C/C++ and runtime frameworks
Knowledge of LLVM and compiler architecture
Experience with mapping HPC, ML, or Deep Learning workloads to accelerators
Knowledge of spatial architectures such as FPGA or CGRAs an asset
Knowledge with distributed systems and disaggregated compute desired
Knowledge of functional programming an assert
Experience with ML frameworks such as TensorFlow or PyTorch desired
Knowledge of ML IR representations such as ONNX and Deep Learning

Preferred

Knowledge of spatial architectures such as FPGA or CGRAs an asset
Knowledge with distributed systems and disaggregated compute desired
Knowledge of functional programming an assert
Experience with ML frameworks such as TensorFlow or PyTorch desired

Benefits

Equity
Benefits

Company

Groq

twittertwittertwitter
company-logo
Groq radically simplifies compute to accelerate workloads in artificial intelligence, machine learning, and high-performance computing.

H1B Sponsorship

Groq has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (4)
2022 (6)
2021 (18)
2020 (2)

Funding

Current Stage
Growth Stage
Total Funding
$362.55M
Key Investors
Social Capital
2024-08-05Secondary Market· undefined
2024-06-20Secondary Market· undefined
2021-04-14Series C· $300M

Leadership Team

leader-logo
Jonathan Ross
CEO and Founder
linkedin
leader-logo
Stuart C. Pann
COO
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot