Modular · 11 hours ago
Software Engineering Intern, Cloud Inference
Modular is on a mission to revolutionize AI infrastructure by rebuilding the AI software stack. The Software Engineering Intern on the Cloud Inference team will contribute to building a platform for serving foundation models efficiently and effectively.
AI InfrastructureArtificial Intelligence (AI)Generative AIMachine LearningSoftware
Responsibilities
Contribute directly to the core components of Mammoth
Designing high-throughput, low-latency inference services, with features such as KV-aware routing and disaggregated inference
Developing distributed KV-cache manager, KV-cache offloading, and other optimizations needed to improve cache utilization
Solving challenges in running large frontier models (e.g., DeepSeek R1) across multiple nodes
Extending Kubernetes APIs and building controllers to support multi-model, multi-node, and multi-cluster deployments
Qualification
Required
Currently pursuing a Bachelor's or Master's degree in Computer Science, Software Engineering, Mathematics, or related field
Strong programming skills in any programming language
Interest in distributed systems, cloud infrastructure, or machine learning systems
Curiosity, problem-solving mindset, and ability to learn quickly in a fast-moving environment
Preferred
Familiarity with Kubernetes and cloud-native technologies
Strong programming skills in Go
Experience building efficient, scalable distributed systems
Understanding of LLMs and common serving optimizations
Benefits
Competitive Compensation.
Team Building Events.
Company
Modular
Modular provides AI infrastructure for deployment, serving, and programming GPUs.
H1B Sponsorship
Modular has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (10)
2024 (6)
2023 (8)
2022 (4)
Funding
Current Stage
Growth StageTotal Funding
$380MKey Investors
US Innovative Technology FundGeneral CatalystGoogle Ventures
2025-09-24Series C· $250M
2023-08-24Series B· $100M
2022-06-30Seed· $30M
Recent News
General Catalyst
2026-01-14
General Catalyst
2026-01-14
Greylock
2025-12-29
Company data provided by crunchbase