Software Engineer - Model Serving Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Anyscale · 5 hours ago

Software Engineer - Model Serving Infrastructure

Anyscale is on a mission to democratize distributed computing and make it accessible to software developers. They are seeking talented engineers to contribute to the development of next-generation, high-performance machine learning serving systems, focusing on building infrastructure that powers AI applications for millions of users worldwide.

AI InfrastructureArtificial Intelligence (AI)Developer PlatformInformation TechnologyMachine LearningOpen Source
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design and implement intelligent request routing systems that dynamically balance load across thousands of model replicas while maintaining strict latency SLAs
Build sophisticated traffic management systems that seamlessly transition between model versions at scale, handling terabytes of inference requests without dropping a single query
Create reactive systems that predict traffic patterns and scale model replicas from 1 to 10,000+ instances based on real-time demand signals
Architect frameworks for complex ML pipelines where dozens of models need to communicate, share resources, and maintain end-to-end latency guarantees
Build deep introspection tools that make it trivial to debug distributed ML applications—because "works on my laptop" doesn't cut it at scale

Qualification

Distributed SystemsMachine LearningSystems ProgrammingCloud-Native InfrastructureProduction ReliabilityCode QualityPerformance OptimizationOpen Source ContributionsOwnership Mindset

Required

Strong Systems Fundamentals: You understand operating systems, networking, concurrency, and distributed systems at a deep level
Production Experience: You've built and maintained systems that serve real users at scale
Code Quality: You write clean, tested, well-documented code that other engineers love to work with
Ownership Mindset: You take responsibility for your code in production—from design to deployment to incident response

Preferred

Experience with distributed systems frameworks (gRPC, Ray)
Background in ML/AI systems or serving infrastructure
Contributions to major open source projects
Experience with performance optimization and profiling
Knowledge of cloud-native technologies (Kubernetes, Istio, etc.)

Company

Anyscale

twittertwittertwitter
company-logo
Anyscale accelerates the development and productionization of any AI app, on any cloud, at any scale.

H1B Sponsorship

Anyscale has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (33)
2024 (14)
2023 (10)
2022 (10)
2021 (4)
2020 (1)

Funding

Current Stage
Growth Stage
Total Funding
$259M
Key Investors
New Enterprise AssociatesAndreessen Horowitz
2022-08-23Series C· $99M
2021-12-07Series C· $100M
2020-10-21Series B· $40M

Leadership Team

leader-logo
Keerti Melkote
Chief Executive Officer
linkedin
leader-logo
Robert Nishihara
Co-Founder
linkedin
Company data provided by crunchbase