Sr. Software Development Engineer, FAR (Frontier AI & Robotics) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Amazon · 2 weeks ago

Sr. Software Development Engineer, FAR (Frontier AI & Robotics)

Amazon's Frontier AI & Robotics team is seeking a Senior Software Development Engineer to work on groundbreaking foundation models and robotics applications. In this role, you will optimize large-scale transformer architectures and collaborate with scientists to enhance model performance, leveraging your expertise in CUDA and TensorRT.

Artificial Intelligence (AI)DeliveryE-CommerceFoundational AIRetail
check
H1B Sponsor Likelynote

Responsibilities

Drive inference optimization strategies for large-scale foundation models using TensorRT, CUDA, and other NVIDIA tools
Collaborate closely with scientists to influence model architectures for optimal hardware utilization
Design and implement efficient compilation pipelines for complex transformer architectures
Develop comprehensive benchmarking frameworks to measure and optimize model performance
Build robust monitoring solutions to ensure reliable model serving at scale
Explore and evaluate emerging optimization techniques including ONNX Runtime and other ML compilers
Maintain high engineering standards through proper testing, documentation, and code review practices
Optimize transformer blocks using custom CUDA kernels and TensorRT optimization techniques
Partner with scientists to analyze model architectures and propose efficiency improvements
Implement and benchmark various optimization strategies for large-scale models
Debug performance bottlenecks using NVIDIA profiling tools
Participate in technical discussions about new model architectures with the science team
Design and maintain performance monitoring systems for production deployment
Prototype new acceleration approaches using emerging compilation frameworks

Qualification

PythonC++CUDATensorRTML optimization frameworksNVIDIA ML stackML compilersTransformer model optimizationPerformance profilingMonitoring systemsLarge-scale ML servingDesign patternsScalingMentoringTeam leadershipReliability

Required

Bachelor's degree in computer science or equivalent
5+ years of non-internship professional software development experience
5+ years of programming with at least one software programming language experience
5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
Experience as a mentor, tech lead or leading an engineering team
Strong expertise in Python, C++ and CUDA programming
Experience with TensorRT or similar ML optimization frameworks
Track record of optimizing ML models for production

Preferred

Expertise in NVIDIA's ML stack (cuDNN, CUDA Graph, etc.)
Experience with ML compilers (ONNX Runtime, TVM, etc.)
Experience with transformer model optimization
Background in performance profiling and optimization
Experience working directly with research teams
Track record of building robust monitoring systems
Experience with large-scale ML serving systems

Benefits

Equity
Sign-on payments
Full range of medical, financial, and/or other benefits

Company

Amazon is a tech firm with a focus on e-commerce, cloud computing, digital streaming, and artificial intelligence.

H1B Sponsorship

Amazon has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)

Funding

Current Stage
Public Company
Total Funding
$8.11B
Key Investors
AmazonKleiner Perkins
2023-01-03Post Ipo Debt· $8B
2001-07-24Post Ipo Equity· $100M
1997-05-15IPO

Leadership Team

leader-logo
Douglas J. Herrington
CEO, Worldwide Amazon Stores
linkedin
leader-logo
Werner Vogels
VP & CTO
linkedin
Company data provided by crunchbase