Apply on Employer Site

SF Tensor · 1 week ago

Founding GPU Kernel Engineer

San Francisco, CA

Full-time

Onsite

Senior Level

$285K/yr - $315K/yr

SF Tensor is a company focused on revolutionizing AI and high-performance computing through innovative software and infrastructure solutions. They are seeking a Founding GPU Kernel Engineer who will optimize GPU kernels for machine learning workloads and develop automated compiler passes to enhance performance across various GPU architectures.

Artificial Intelligence (AI)Cloud ComputingMachine LearningSoftware

Responsibilities

Write and hand-optimize GPU kernels for ML workloads (matmuls, attention, normalization, etc.) to set the performance ceilings

Profile at the microarchitectural level: look into SM utilization, warp stalls, memory bank conflicts, register pressure, instruction throughput

Debug performance issues by digging deep into things like clock speeds, thermal throttling, driver behavior, hardware errata

Turn your hand-optimization insights into automated compiler passes (working closely with our compiler team)

Develop performance models that predict how kernels will behave across different GPU architectures

Build tools and methods for systematic kernel optimization

Work with NVIDIA, AMD, and emerging AI accelerators - understand the common parts and what's vendor-specific

Qualification

GPU architectureC++CUDALow-level profiling toolsKernel optimizationPTX/SASSDistributed training systemsML operations mappingHPC backgroundDriver developmentMLIRPublications in GPU optimization