Research Intern - Post-Training jobs in United States
cer-icon
Apply on Employer Site
company-logo

Microsoft · 2 days ago

Research Intern - Post-Training

Microsoft is a leading technology company seeking a Research Intern for their Human Superintelligence Post-Training team. The role involves designing datasets, advancing model training, and developing data infrastructure while collaborating with global teams on innovative AI projects.

Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design & evaluate datasets: build high-quality datasets/benchmarks; run ablations to measure impact and improve data effectiveness
Advance model training: contribute to pre-training, post-training, and RL for language and multimodal models
Develop data infrastructure: extend pipelines for ingest, preprocess, filter, and annotate large, heterogeneous data
Data quality & analysis: assess text, image, video, audio, and code data for quality, diversity, and relevance; propose improvements
Tooling & workflows: create lightweight tools for dataset auditing, visualization, and versioning to speed iteration
Research & collaboration: work with researchers/engineers to push research and product boundaries with measurable impact

Qualification

PythonML librariesData pipeline conceptsDistributed training frameworksLarge-scale datasetsCuriosityHands-on experimentationClear communicationSelf-motivated

Required

Currently enrolled in a BS/MS/PhD program in computer science, AI/ML, data science, electrical engineering, or a related field
Must have at least one additional quarter/semester of school remaining following the completion of the internship
Candidate must be enrolled in a full time bachelor's, masters, MBA, or PhD program in area relevant for the role during the academic term immediately before their internship
Effective coding skills in Python and modern data/ML libraries (NumPy, Pandas, PyTorch/JAX/TF)
Familiarity with training/evaluating ML models and with basic data-pipeline concepts

Preferred

First-author publication(s) at top-tier AI venues (e.g., NeurIPS, ICML, ICLR, CVPR) or equivalent journals; or demonstrably comparable research impact (e.g., widely used open-source, SOTA results, benchmark wins)
Experience with distributed data or training frameworks (Spark, Ray, Beam; PyTorch DDP/FSDP) and cloud ecosystems (Azure; data lakes)
Exposure to large-scale, un/semi-structured datasets (images, video, audio, code)
Prior work on LLMs, RL/RLHF, post-training, or multimodal models
Contributions to open-source tooling or reproducible research
Clear communication, self-motivated, curiosity, and a bias for hands-on experimentation

Company

Microsoft

company-logo
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.

H1B Sponsorship

Microsoft has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)

Funding

Current Stage
Public Company
Total Funding
$1M
Key Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M

Leadership Team

leader-logo
Satya Nadella
Chairman and CEO
linkedin
leader-logo
Vukani Mngxati
Chief Executive Officer - Microsft South Africa
linkedin
Company data provided by crunchbase