Microsoft · 5 hours ago
Research Intern - AI Frameworks (Network Systems and Tools)
Microsoft offers dynamic Research Internships in a range of scientific and technical disciplines. The Research Intern will explore next-generation AI systems through performance modeling and architectural analysis, contributing to innovative research and development efforts.
Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
Responsibilities
Investigate and evaluate emerging disaggregated KV cache architectures
Implement a hierarchical storage architecture with multiple tiers
GPU Memory: Active working set of KV caches currently used by the model
CPU DRAM: Hot cache for recently used KV chunks using pinned memory for efficient GPU-CPU transfers
Local Storage: Large-scale local caching (NVMe, local disk)
Build Peer-to-Peer (P2P) service KV cache sharing architecture that enables direct, high-performance cache transfer between multiple LLM serving instances without requiring centralized cache servers
Qualification
Required
Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
Preferred
Research experience in areas such as computer architecture, AI/ML systems, performance modeling, distributed systems, or hardware–software co-design
Programming skills in Python, C/C++ with experience building prototypes, simulators, or performance analysis tools
Familiarity with modern AI workloads and/or deep learning frameworks (e.g., PyTorch)
Demonstrated ability to define and pursue original research directions in AI systems or architecture
Ability to collaborate effectively with researchers across disciplines and work in cross-group, cross-cultural environments
Proficient communication and presentation skills for sharing complex technical insights
Ability to think creatively and approach system and architecture challenges with unconventional or innovative solutions
Experience with PyTorch, CUDA, Triton, or performance-simulation tools
Background in large-scale system design, AI inference bottleneck analysis, or modeling cost/performance tradeoffs. Understanding of accelerator, memory-system, or interconnect design principles
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
H1B Sponsorship
Microsoft has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M
Leadership Team
Recent News
2026-01-16
Morningstar.com
2026-01-16
Company data provided by crunchbase