Microsoft · 3 hours ago
Member of Technical Staff, High Performance Computing Engineer - MAI SuperIntelligence Team
Microsoft AI is looking for experienced Member of Technical Staff, High Performance Computing Engineers to help build and scale the infrastructure that trains their frontier models. This role involves working on large scale supercomputers to enhance the capabilities of personal AI, Copilot, while collaborating with various teams to ensure optimal performance and user experience.
Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
Responsibilities
5+ years of hands on High Performance Compute (HPC) engineering experience
Production experience with HPC schedulers such as SLURM or k8s
Expertise in one of the HPC technologies like GPUs, storage, networking or any other aspects and day to day maintenance of massive clusters
Strong scripting skills in bash or Python
Work with researchers to solve issues in using HPC clusters and triaging job failures if needed
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values
Qualification
Required
Do you have a Bachelor's degree in computer science, or related technical field AND 4+ years technical engineering experience with deploying or operating on-premise or cloud high-performance clusters, AND 4+ years experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.), AND 4+ years experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP, + OR equivalent experience?
Preferred
Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with deploying or operating on-premise or cloud high-performance clusters, AND 6+ years experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.), AND 6+ years experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP, + OR equivalent experience
Experience with LLM training clusters
Experience working with AI platforms, frameworks, and APIs
Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally
Experience working with large-scale HPC or GPU systems (ex. NVIDIA H100/GB200 or equivalent)
Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience
Dedication to writing clean, maintainable, and well-documented code with a focus on application quality, performance, and security
Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, designers, and other engineers
Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders
Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies
Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements and deadlines
Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
H1B Sponsorship
Microsoft has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M
Leadership Team
Recent News
2026-01-16
Morningstar.com
2026-01-16
Company data provided by crunchbase