Software Engineering Manager - Meta Superintelligence Labs - Infra: Optimizations Team jobs in United States
cer-icon
Apply on Employer Site
company-logo

Meta · 2 months ago

Software Engineering Manager - Meta Superintelligence Labs - Infra: Optimizations Team

Meta is seeking hands-on engineering managers to join the Meta SuperIntelligence Lab, making direct contributions to the next generation of Generative AI models. The role involves leading a high-performance team to develop and optimize cutting-edge AI infrastructure and systems.

Computer Software
check
Comp. & Benefits

Responsibilities

Lead and support the team that develops various kernels including but not limited to GEMMs, Attention mechanisms etc. Also, contribute to enabling performance at scale of our inference and training of next generation GenAI (Llama) models
Enable the growth of individual contributors, driving the technical roadmap along with technical leads and expand the impact of the team by growing new skill-sets and capabilities
Lead a high performance team of engineers to deliver new capabilities and efficient compute systems for our fleet
Technical management
Experience in systems architecture, performance, workload-analysis and large scale distributed systems
Work cross-functionally across hardware and software/services team to drive engineering efforts

Qualification

GPU/ASIC kernel developmentDistributed systemsHigh performance computingSystems architectureCUDAPyTorchKernel optimizationsQuantizationTechnical managementPerformance analysisTeam leadership

Required

MS or BS in Computer Science or Electrical/Electronics Engineering or equivalent
3+ years of experience of directly managing or leading a team of engineers with varied skill levels
Experience in leading teams working on high performance computing (HPC) and AI/ML systems
GPU/ASIC-based kernel development and optimization (e.g. CUDA)
Distributed systems for large scale training and serving
Systems Architecture + Performance
Large scale distributed systems
Experience running a large-scale program and dealing with ambiguity

Preferred

Familiarity with the latest techniques in optimizing GenAI workloads
Using frameworks like PyTorch, TorchTriton to develop custom kernels
Understands Kernel enablement and optimizations, including experience working on attention kernels
Understanding GPU memory hierarchy and computation capabilities
Understands low-level CUDA kernel optimizations for inference and training
Experience with Quantization and structure sparsity for low precision training & inference
Understands Optimizers such as Adam, Shampoo, Muon

Benefits

Bonus
Equity
Benefits

Company

Meta's mission is to build the future of human connection and the technology that makes it possible.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Kathryn Glickman
Director, CEO Communications
linkedin
leader-logo
Christine Lu
CTO Business Engineering NA
linkedin
Company data provided by crunchbase