Be an early applicantLess than 25 applicants

Company

Groq · 7 hours ago

Principal Inference Stack Engineer

Mountain View, CA

Full-time

Remote

Lead/Staff

$249K/yr - $407K/yr

10+ years exp

Maximize your interview chances

Artificial Intelligence (AI)Electronics

H1B Sponsor Likely

Insider Connection @Groq

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Analyze latest ML workloads from Groq partners or Cloud and develop optimization roadmap and strategies to improve inference performance and operating efficiency of workload

Design, develop, and maintain optimizing compiler for Groq's LPU

Expand Groq runtime API to simplify execution model of Groq LPUs

Benchmark and analyze output produced by optimizing compiler and runtime, and drive enhancements to improve its quality-of-results when measured on the Groq LPU hardware.

Manage large multi-person and multi-geo projects and interface with various leads across the company

Mentor junior compiler engineers and collaborate with other senior compiler engineers on the team.

Review and accept code updates to compiler passes and IR definitions.

Work with HW teams and architects to drive improvements in architecture and SW compiler

Publish novel compilation techniques to Groq's TSP at top-tier ML, Applications, Compiler, and Computer Architecture conferences.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

C/C++LLVMML frameworksHPC workloadsDeep LearningSpatial architecturesDistributed systemsFunctional programmingML IR representations

Required

10+ years of experience in the area of computer science/engineering or related

5+ years of direct experience with C/C++ and runtime frameworks

Knowledge of LLVM and compiler architecture

Experience with mapping HPC, ML, or Deep Learning workloads to accelerators

Knowledge of spatial architectures such as FPGA or CGRAs an asset

Knowledge with distributed systems and disaggregated compute desired

Knowledge of functional programming an assert

Experience with ML frameworks such as TensorFlow or PyTorch desired

Knowledge of ML IR representations such as ONNX and Deep Learning

Preferred

Knowledge of spatial architectures such as FPGA or CGRAs an asset

Knowledge with distributed systems and disaggregated compute desired

Knowledge of functional programming an assert

Experience with ML frameworks such as TensorFlow or PyTorch desired

Benefits

Equity

Benefits

Company

Groq

Groq radically simplifies compute to accelerate workloads in artificial intelligence, machine learning, and high-performance computing.

Founded in 2016

Mountain View, California, USA

51-200 employees

http://groq.com

H1B Sponsorship

Groq has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2023 (4)

2022 (6)

2021 (18)

2020 (2)