SIGN IN
Member of Technical Staff, High Performance Computing Engineer - MAI SuperIntelligence Team jobs in United States
cer-icon
Apply on Employer Site
company-logo

Microsoft · 3 hours ago

Member of Technical Staff, High Performance Computing Engineer - MAI SuperIntelligence Team

Microsoft AI is looking for experienced Member of Technical Staff, High Performance Computing Engineers to help build and scale the infrastructure that trains their frontier models. This role involves working on large scale supercomputers to enhance the capabilities of personal AI, Copilot, while collaborating with various teams to ensure optimal performance and user experience.
Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

5+ years of hands on High Performance Compute (HPC) engineering experience
Production experience with HPC schedulers such as SLURM or k8s
Expertise in one of the HPC technologies like GPUs, storage, networking or any other aspects and day to day maintenance of massive clusters
Strong scripting skills in bash or Python
Work with researchers to solve issues in using HPC clusters and triaging job failures if needed
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values

Qualification

High Performance ComputingHPC schedulersCloud infrastructureScripting bashScripting PythonLarge-scale HPC systemsMachine Learning frameworksPassion for learningInterpersonal skillsCommunication skillsAdaptability

Required

Do you have a Bachelor's degree in computer science, or related technical field AND 4+ years technical engineering experience with deploying or operating on-premise or cloud high-performance clusters, AND 4+ years experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.), AND 4+ years experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP, + OR equivalent experience?

Preferred

Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with deploying or operating on-premise or cloud high-performance clusters, AND 6+ years experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.), AND 6+ years experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP, + OR equivalent experience
Experience with LLM training clusters
Experience working with AI platforms, frameworks, and APIs
Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally
Experience working with large-scale HPC or GPU systems (ex. NVIDIA H100/GB200 or equivalent)
Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience
Dedication to writing clean, maintainable, and well-documented code with a focus on application quality, performance, and security
Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, designers, and other engineers
Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders
Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies
Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements and deadlines
Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team

Company

Microsoft

company-logo
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.

H1B Sponsorship

Microsoft has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)

Funding

Current Stage
Public Company
Total Funding
$1M
Key Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M

Leadership Team

leader-logo
Satya Nadella
Chairman and CEO
linkedin
leader-logo
Vukani Mngxati
Chief Executive Officer - Microsft South Africa
linkedin
Company data provided by crunchbase