Microsoft · 18 hours ago
Principal Software Engineer - Azure AI Inferencing
Microsoft is a leading technology company focused on empowering every person and organization on the planet. They are seeking a Principal Software Engineer to drive the design, optimization, and scaling of their Azure AI Inferencing platform, ensuring efficient operation of large AI models. The role involves leading engineering efforts to enhance inference performance and collaborating with various teams to deliver cutting-edge AI solutions.
Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
Responsibilities
Lead the design and implementation of core inference infrastructure for serving frontier AI models in production
Identify and drive improvements to end-to-end inference performance and efficiency of OpenAI and other state-of-the-art LLMs
Lead the design and implementation of efficient load scheduling and balancing strategies, by leveraging key insights and features of the model and workload
Scale the platform to support the growing inferencing demand and maintain high availability
Deliver critical capabilities required to serve the latest and greatest Gen AI models such as GPT5, Realtime audio, Sora, and enable fast time to market for them
Drive generic features to cater to the needs of customers such as GitHub, M365, Microsoft AI and third-party companies
Collaborate with our partners both internal and external
Mentor engineers on distributed inference best practices
Embody Microsoft's Culture and Values
Qualification
Required
Bachelor's degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or Golang + OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Preferred
4+ years' practical experience working on high scale, reliable online systems
Technical background and foundation in software engineering principles, distributed computing and architecture
Experience in real-time online services with low latency and high throughput
Experience working with L7 network proxies and gateways
Knowledge in Network architecture and concepts (HTTP and TCP Protocols, Authentication and Sessions etc)
Knowledge and experience in OSS, Docker, Kubernetes, C++, Golang, or equivalent programming languages
Cross-team collaboration skills and the desire to collaborate in a team of researchers and developers
Ability to independently lead projects
Benefits
Certain roles may be eligible for benefits and other compensation.
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
H1B Sponsorship
Microsoft has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M
Leadership Team
Recent News
2026-01-14
2026-01-14
Company data provided by crunchbase