Sr SDE, AGI Inference- GenAI jobs in United States
cer-icon
Apply on Employer Site
company-logo

Amazon · 7 hours ago

Sr SDE, AGI Inference- GenAI

Amazon is seeking a Senior Software Development Engineer for their Sensory Inference team at AGI, focusing on innovative multi-modal inference solutions. The role involves developing high-performance inference software, optimizing performance across various platforms, and collaborating with research scientists to implement next-generation neural models.

Artificial Intelligence (AI)DeliveryE-CommerceFoundational AIRetail
check
H1B Sponsor Likelynote

Responsibilities

Develop high-performance inference software for a diverse set of neural models, typically in C/C++
Design, prototype, and evaluate new inference engines and optimization techniques
Participate in deep-dive analysis and profiling of production code
Optimize inference performance across various platforms (on-device, cloud-based CPU, GPU, proprietary ASICs)
Collaborate closely with research scientists to bring next-generation neural models to life
Partner with internal and external hardware teams to maximize platform utilization
Work in an Agile environment to deliver high-quality software against tight schedules
Hold a high bar for technical excellence within the team and across the organization

Qualification

C/C++ programmingInference frameworksPerformance optimizationKernel programmingModel compression techniquesAgile environmentMentoringTeam leadership

Required

5+ years of non-internship professional software development experience
5+ years of programming with at least one software programming language experience
5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
Experience as a mentor, tech lead or leading an engineering team
Experience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cpp, etc

Preferred

5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
Experience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cpp
Proficiency in performance optimization for CPU, GPU, or AI hardware
Proficiency in kernel programming for accelerated hardware using programming models such as (but not limited to) CUDA, OpenMP, OpenCL, Vulkan, and Metal
Experience with latency-sensitive optimizations and real-time inference
Knowledge of model compression techniques (quantization, pruning, distillation, etc.)
Experience with LLM efficiency techniques like speculative decoding and long context

Benefits

Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
401(k) matching
Paid time off
Parental leave

Company

Amazon is a tech firm with a focus on e-commerce, cloud computing, digital streaming, and artificial intelligence.

H1B Sponsorship

Amazon has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)

Funding

Current Stage
Public Company
Total Funding
$8.11B
Key Investors
AmazonKleiner Perkins
2023-01-03Post Ipo Debt· $8B
2001-07-24Post Ipo Equity· $100M
1997-05-15IPO

Leadership Team

leader-logo
Douglas J. Herrington
CEO, Worldwide Amazon Stores
linkedin
leader-logo
Werner Vogels
VP & CTO
linkedin
Company data provided by crunchbase