Member of Technical Staff, LLM Inference - MAI Superintelligence Team jobs in United States
cer-icon
Apply on Employer Site
company-logo

Microsoft · 5 days ago

Member of Technical Staff, LLM Inference - MAI Superintelligence Team

Microsoft is dedicated to advancing Copilot and other consumer AI products and research. The Member of Technical Staff in the LLM Inference team will work alongside researchers and engineers to implement frontier AI research ideas and improve model inference performance through the introduction of new systems and tools.

Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Work alongside researchers and engineers to implement frontier AI research ideas
Introduce new systems, tools, and techniques to improve model inference performance
Build tools to help debug performance bottlenecks, numeric instabilities, and distributed systems issues
Build tools and establish processes to enhance the team’s collective productivity
Find ways to overcome roadblocks and deliver your work to users quickly and iteratively
Enjoy working in a fast-paced, design-driven product development cycle
Embody our Culture and Values

Qualification

Generative AIDistributed computingGPU kernel programmingPython expertiseLarge scale production inferenceProfilingOptimizing PyTorchOpen source inference frameworksC programmingC++ programmingC# programmingJava programmingJavaScript programming

Required

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience

Preferred

Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Experience with generative AI
Experience with distributed computing
Python and Python ecosystem (eg. uv, pybind/nanobind, FastAPI) expertise
Experience with large scale production inference
Experience with GPU kernel programming
Experience benchmarking, profiling, and optimizing PyTorch generative AI models
Experience with open source inference frameworks like vLLM and SGLang
Working experience and conversant with the material in the JAX scaling book

Benefits

Certain roles may be eligible for benefits and other compensation.

Company

Microsoft

company-logo
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.

H1B Sponsorship

Microsoft has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)

Funding

Current Stage
Public Company
Total Funding
$1M
Key Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M

Leadership Team

leader-logo
Satya Nadella
Chairman and CEO
linkedin
leader-logo
Vukani Mngxati
Chief Executive Officer - Microsft South Africa
linkedin
Company data provided by crunchbase