GenAI Systems Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Modular · 16 hours ago

GenAI Systems Engineer

Modular is on a mission to revolutionize AI infrastructure by rebuilding the AI software stack. As a GenAI Systems Engineer, you'll architect robust frameworks and optimize advanced inference processes to enhance the Modular platform for AI model deployment.

AI InfrastructureArtificial Intelligence (AI)Generative AIMachine LearningSoftware
check
H1B Sponsor Likelynote

Responsibilities

Leverage a broad understanding of available libraries and concurrency techniques to inform high impact architecture decisions
Identify and implement architecture-level optimizations in complex distributed systems
Architect and implement building blocks and APIs to accelerate the development of advanced distributed optimizations
Lead cross-functional projects spanning multiple teams and multiple layers of a deep tech stack
Build beautiful abstractions to seamlessly weave async RESTful layers with intensive data processing layers
Collaborate with cloud inference team to maximize flexibility in scalable cluster deployments
Develop extensible customization interfaces to support open source community models and features
Develop detailed and intuitive metrics, logging, and profiling tools

Qualification

PythonSystems programmingPerformance optimizationDistributed systemsSoftware architectureConcurrency techniquesLow-latency applicationsAsync programmingCollaborationCreativity

Required

Expert-level Python programming with deep understanding of asyncio and event loops
5+ years of systems programming experience with focus on performance and concurrency
Hands on experience with robust low-latency applications running production workloads
Extensive experience designing software architecture, interfaces, and collaboration
Deep understanding of the fundamentals of profiling, benchmarking, and performance optimization
Creativity and curiosity for learning and solving complex distributed systems problems

Preferred

Experience working inside high-performance ML inference systems (e.g. vLLM, SGLang, etc.)
Experience with Kubernetes, containers, microservices, and cloud-native architectures
Experience with graph based (e.g. dataflow, actors) programming models and runtimes
Experience with distributed runtimes such as Ray, Open MPI, Dask, Spark, etc

Benefits

Premier insurance plans
Up to 5% 401k matching
Flexible paid time off
Stock options
Annual target bonus
Equity
Benefits

Company

Modular

twittertwittertwitter
company-logo
Modular provides AI infrastructure for deployment, serving, and programming GPUs.

H1B Sponsorship

Modular has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (10)
2024 (6)
2023 (8)
2022 (4)

Funding

Current Stage
Growth Stage
Total Funding
$380M
Key Investors
US Innovative Technology FundGeneral CatalystGoogle Ventures
2025-09-24Series C· $250M
2023-08-24Series B· $100M
2022-06-30Seed· $30M

Leadership Team

leader-logo
Chris Lattner
CEO + Co-Founder
linkedin
leader-logo
Tim Davis
Co-Founder & President
linkedin

Recent News

Company data provided by crunchbase