Software Development Engineer AI/ML, Inference Serving, AWS Neuron jobs in United States
cer-icon
Apply on Employer Site
company-logo

Amazon · 3 months ago

Software Development Engineer AI/ML, Inference Serving, AWS Neuron

Amazon, through its subsidiary Annapurna Labs, is seeking a Software Development Engineer for its AWS Neuron team, which focuses on developing infrastructure for machine learning models. The role involves leading the design of distributed ML serving systems, optimizing performance, and collaborating with cross-functional teams to enhance inference capabilities for generative AI applications.

Artificial Intelligence (AI)DeliveryE-CommerceFoundational AIRetail
check
H1B Sponsor Likelynote

Responsibilities

Architect and lead the design of distributed ML serving systems optimized for generative AI workloads
Drive technical excellence in performance optimization and system reliability across the Neuron ecosystem
Design and implement scalable solutions for both offline and online inference workloads
Lead integration efforts with frameworks such as vLLM, SGLang, Torch XLA, TensorRT, and Triton
Develop and optimize system components for tensor/data parallelism and disaggregated serving
Implement and optimize custom PyTorch operators and NKI kernels
Mentor team members and provide technical leadership across multiple work streams
Drive architectural decisions that impact the entire Neuron serving stack
Collaborate with customers, product owners, and engineering teams to define technical strategy
Author technical documentation, design proposals, and architectural guidelines

Qualification

Machine Learning FrameworksDistributed Systems DesignPerformance OptimizationSoftware Development Life CycleObject-Oriented DesignTechnical LeadershipMentoringCollaboration

Required

5+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience
5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
5+ years of non-internship professional software development experience
Experience as a mentor, tech lead or leading an engineering team

Preferred

Master's degree in computer science or equivalent
Deep expertise in ML Frameworks/Libraries such as JAX, PyTorch, vLLM, SGLang, Dynamo, TorchXLA, TensorRT

Benefits

Equity
Sign-on payments
A full range of medical, financial, and/or other benefits

Company

Amazon is a tech firm with a focus on e-commerce, cloud computing, digital streaming, and artificial intelligence.

H1B Sponsorship

Amazon has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)

Funding

Current Stage
Public Company
Total Funding
$8.11B
Key Investors
AmazonKleiner Perkins
2023-01-03Post Ipo Debt· $8B
2001-07-24Post Ipo Equity· $100M
1997-05-15IPO

Leadership Team

leader-logo
Douglas J. Herrington
CEO, Worldwide Amazon Stores
linkedin
leader-logo
Werner Vogels
VP & CTO
linkedin
Company data provided by crunchbase