Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs jobs in United States
cer-icon
Apply on Employer Site
company-logo

Amazon Web Services (AWS) · 2 weeks ago

Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs

Amazon Web Services (AWS) is a leader in cloud computing, and they are seeking a Machine Learning - Compiler Engineer II for their AWS Neuron team. This role involves building the next generation Neuron compiler to optimize ML models for deployment on AWS Inferentia and Trainium servers, solving complex optimization problems to enhance performance and usability.

Agentic AIConsultingDevOpsInformation TechnologySoftwareWeb Development
check
H1B Sponsor Likelynote

Responsibilities

You will design, implement, test, deploy and maintain innovative software solutions to transform Neuron compiler’s performance, stability and user-interface
You will work side by side with chip architects, runtime/OS engineers, scientists and ML Apps teams to seamlessly deploy state of the art ML models from our customers on AWS accelerators with optimal cost/performance benefits
You will have opportunity to work with open-source software (e.g., StableHLO, OpenXLA, MLIR) to pioneer optimizing advanced ML workloads on AWS software and hardware
You will also work on building innovative features that will deliver best possible experiences for our customers – developers across the globe
As you design and code solutions to help our team drive efficiencies in compiler architecture, you’ll create compiler optimization and verification passes, build features surface features and peculiarities of AWS accelerators to developers, implement tools to analyze numerical errors, and resolve the root cause of compiler defects
You’ll also participate in design discussions, code review, and communicate with internal (other Neuron SDK and Amazon wide teams) and external stakeholders (open-source communities)
Lastly, work in a startup-like development environment, where you’re always working on the most important stuff

Qualification

C++JavaCompiler designDeep learning modelsPyTorchOpenXLAStableHLOMLIRTechnical communicationTeam collaboration

Required

3+ years of non-internship professional software development experience
2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
Experience programming with at least one software programming language

Preferred

Master's degree or PhD in Computer Science, or a related technical field
3+ years of experience writing production grade code in object-oriented languages such as C++/Java
Experience in compiler design for CPU/GPU/Vector engines/ML-accelerators
Experience with OpenSource compiler toolset like LLVM/MLIR
Experience with the following technologies: PyTorch, OpenXLA, StableHLO, JAX, TVM, deep learning models, and algorithms
Experience with modern build systems like Bazel/CMake

Benefits

Equity
Sign-on payments
Full range of medical, financial, and/or other benefits

Company

Amazon Web Services (AWS)

company-logo
Launched in 2006, Amazon Web Services (AWS) began exposing key infrastructure services to businesses in the form of web services -- now widely known as cloud computing.

H1B Sponsorship

Amazon Web Services (AWS) has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)

Funding

Current Stage
Late Stage
Total Funding
unknown
Key Investors
BIRD Foundation
2025-01-22Grant

Leadership Team

leader-logo
Matt Garman
Chief Executive Officer
linkedin
leader-logo
Anand Desikan
CTO, CXO Advisor, and Enterprise Technologist
linkedin
Company data provided by crunchbase