Microsoft · 1 day ago
Principal Software Engineer, CoreAI
Microsoft is at the forefront of redefining how software is built and experienced, particularly through its CoreAI division. The Principal Engineer on the Observability team will shape the architecture and strategy for monitoring and scaling AI training workloads, driving the execution of the observability platform at supercomputer scale.
Application Performance ManagementArtificial Intelligence (AI)Business DevelopmentData ManagementDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
Responsibilities
Set the roadmap and drive the execution of the Observability platform built for AI workloads at a supercomputer scale
Deliver deep insights that empower customers to troubleshoot and optimize their large-scale AI workloads
Leverage production telemetry to influence next-generation infrastructure design, boosting efficiency, reliability, and performance
Mentor and guide engineering teams, elevating technical excellence and championing a customer-focused approach to system design
Qualification
Required
Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field
6+ years of experience building or operating distributed systems, with a strong focus on reliability, scalability, and performance
Proficiency in one or more programming languages such as C#, C++, Go, or Python
Strong understanding of Docker, Kubernetes, scalable architectures, and automation for production systems
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Preferred
Excellent analytical and problem-solving skills, with the ability to extract customer pain points, synthesize ambiguous requirements, and design clear, scalable solutions
Expertise with distributed observability technologies (e.g., Prometheus, OpenTelemetry, Grafana) and 2+ years of experience designing or scaling telemetry pipelines for high-throughput production systems
Advanced, hands-on experience with production ML systems
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
H1B Sponsorship
Microsoft has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M
Leadership Team
Recent News
MarketScreener
2026-01-06
2026-01-06
Company data provided by crunchbase