Groq · 4 hours ago
Principal Inference Stack Engineer
Maximize your interview chances
Artificial Intelligence (AI)Electronics
H1B Sponsor Likely
Insider Connection @Groq
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Analyze latest ML workloads from Groq partners or Cloud and develop optimization roadmap and strategies to improve inference performance and operating efficiency of workload
Design, develop, and maintain optimizing compiler for Groq's LPU
Expand Groq runtime API to simplify execution model of Groq LPUs
Benchmark and analyze output produced by optimizing compiler and runtime, and drive enhancements to improve its quality-of-results when measured on the Groq LPU hardware.
Manage large multi-person and multi-geo projects and interface with various leads across the company
Mentor junior compiler engineers and collaborate with other senior compiler engineers on the team.
Review and accept code updates to compiler passes and IR definitions.
Work with HW teams and architects to drive improvements in architecture and SW compiler
Publish novel compilation techniques to Groq's TSP at top-tier ML, Applications, Compiler, and Computer Architecture conferences.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
10+ years of experience in the area of computer science/engineering or related
5+ years of direct experience with C/C++ and runtime frameworks
Knowledge of LLVM and compiler architecture
Experience with mapping HPC, ML, or Deep Learning workloads to accelerators
Knowledge of spatial architectures such as FPGA or CGRAs an asset
Knowledge with distributed systems and disaggregated compute desired
Knowledge of functional programming an assert
Experience with ML frameworks such as TensorFlow or PyTorch desired
Knowledge of ML IR representations such as ONNX and Deep Learning
Preferred
Knowledge of spatial architectures such as FPGA or CGRAs an asset
Knowledge with distributed systems and disaggregated compute desired
Knowledge of functional programming an assert
Experience with ML frameworks such as TensorFlow or PyTorch desired
Benefits
Equity
Benefits
Company
Groq
Groq radically simplifies compute to accelerate workloads in artificial intelligence, machine learning, and high-performance computing.
H1B Sponsorship
Groq has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (4)
2022 (6)
2021 (18)
2020 (2)
Funding
Current Stage
Growth StageTotal Funding
$362.55MKey Investors
Social Capital
2024-08-05Secondary Market· undefined
2024-06-20Secondary Market· undefined
2021-04-14Series C· $300M
Recent News
Idaho Business Review
2024-11-20
2024-11-19
Crunchbase News
2024-10-18
Company data provided by crunchbase