Microsoft · 5 days ago
Member of Technical Staff, LLM Inference - MAI Superintelligence Team
Microsoft is dedicated to advancing Copilot and other consumer AI products and research. The Member of Technical Staff in the LLM Inference team will work alongside researchers and engineers to implement frontier AI research ideas and improve model inference performance through the introduction of new systems and tools.
Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
Responsibilities
Work alongside researchers and engineers to implement frontier AI research ideas
Introduce new systems, tools, and techniques to improve model inference performance
Build tools to help debug performance bottlenecks, numeric instabilities, and distributed systems issues
Build tools and establish processes to enhance the team’s collective productivity
Find ways to overcome roadblocks and deliver your work to users quickly and iteratively
Enjoy working in a fast-paced, design-driven product development cycle
Embody our Culture and Values
Qualification
Required
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Preferred
Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Experience with generative AI
Experience with distributed computing
Python and Python ecosystem (eg. uv, pybind/nanobind, FastAPI) expertise
Experience with large scale production inference
Experience with GPU kernel programming
Experience benchmarking, profiling, and optimizing PyTorch generative AI models
Experience with open source inference frameworks like vLLM and SGLang
Working experience and conversant with the material in the JAX scaling book
Benefits
Certain roles may be eligible for benefits and other compensation.
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
H1B Sponsorship
Microsoft has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M
Leadership Team
Recent News
2026-01-16
Morningstar.com
2026-01-16
Company data provided by crunchbase