Microsoft · 2 days ago
Research Intern - Post-Training
Microsoft is a leading technology company seeking a Research Intern for their Human Superintelligence Post-Training team. The role involves designing datasets, advancing model training, and developing data infrastructure while collaborating with global teams on innovative AI projects.
Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
Responsibilities
Design & evaluate datasets: build high-quality datasets/benchmarks; run ablations to measure impact and improve data effectiveness
Advance model training: contribute to pre-training, post-training, and RL for language and multimodal models
Develop data infrastructure: extend pipelines for ingest, preprocess, filter, and annotate large, heterogeneous data
Data quality & analysis: assess text, image, video, audio, and code data for quality, diversity, and relevance; propose improvements
Tooling & workflows: create lightweight tools for dataset auditing, visualization, and versioning to speed iteration
Research & collaboration: work with researchers/engineers to push research and product boundaries with measurable impact
Qualification
Required
Currently enrolled in a BS/MS/PhD program in computer science, AI/ML, data science, electrical engineering, or a related field
Must have at least one additional quarter/semester of school remaining following the completion of the internship
Candidate must be enrolled in a full time bachelor's, masters, MBA, or PhD program in area relevant for the role during the academic term immediately before their internship
Effective coding skills in Python and modern data/ML libraries (NumPy, Pandas, PyTorch/JAX/TF)
Familiarity with training/evaluating ML models and with basic data-pipeline concepts
Preferred
First-author publication(s) at top-tier AI venues (e.g., NeurIPS, ICML, ICLR, CVPR) or equivalent journals; or demonstrably comparable research impact (e.g., widely used open-source, SOTA results, benchmark wins)
Experience with distributed data or training frameworks (Spark, Ray, Beam; PyTorch DDP/FSDP) and cloud ecosystems (Azure; data lakes)
Exposure to large-scale, un/semi-structured datasets (images, video, audio, code)
Prior work on LLMs, RL/RLHF, post-training, or multimodal models
Contributions to open-source tooling or reproducible research
Clear communication, self-motivated, curiosity, and a bias for hands-on experimentation
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
H1B Sponsorship
Microsoft has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M
Leadership Team
Recent News
MarketScreener
2026-01-06
2026-01-06
Company data provided by crunchbase