Amazon Web Services (AWS) · 2 weeks ago
Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs
Amazon Web Services (AWS) is a leader in cloud computing, and they are seeking a Machine Learning - Compiler Engineer II for their AWS Neuron team. This role involves building the next generation Neuron compiler to optimize ML models for deployment on AWS Inferentia and Trainium servers, solving complex optimization problems to enhance performance and usability.
Agentic AIConsultingDevOpsInformation TechnologySoftwareWeb Development
Responsibilities
You will design, implement, test, deploy and maintain innovative software solutions to transform Neuron compiler’s performance, stability and user-interface
You will work side by side with chip architects, runtime/OS engineers, scientists and ML Apps teams to seamlessly deploy state of the art ML models from our customers on AWS accelerators with optimal cost/performance benefits
You will have opportunity to work with open-source software (e.g., StableHLO, OpenXLA, MLIR) to pioneer optimizing advanced ML workloads on AWS software and hardware
You will also work on building innovative features that will deliver best possible experiences for our customers – developers across the globe
As you design and code solutions to help our team drive efficiencies in compiler architecture, you’ll create compiler optimization and verification passes, build features surface features and peculiarities of AWS accelerators to developers, implement tools to analyze numerical errors, and resolve the root cause of compiler defects
You’ll also participate in design discussions, code review, and communicate with internal (other Neuron SDK and Amazon wide teams) and external stakeholders (open-source communities)
Lastly, work in a startup-like development environment, where you’re always working on the most important stuff
Qualification
Required
3+ years of non-internship professional software development experience
2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
Experience programming with at least one software programming language
Preferred
Master's degree or PhD in Computer Science, or a related technical field
3+ years of experience writing production grade code in object-oriented languages such as C++/Java
Experience in compiler design for CPU/GPU/Vector engines/ML-accelerators
Experience with OpenSource compiler toolset like LLVM/MLIR
Experience with the following technologies: PyTorch, OpenXLA, StableHLO, JAX, TVM, deep learning models, and algorithms
Experience with modern build systems like Bazel/CMake
Benefits
Equity
Sign-on payments
Full range of medical, financial, and/or other benefits
Company
Amazon Web Services (AWS)
Launched in 2006, Amazon Web Services (AWS) began exposing key infrastructure services to businesses in the form of web services -- now widely known as cloud computing.
H1B Sponsorship
Amazon Web Services (AWS) has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)
Funding
Current Stage
Late StageTotal Funding
unknownKey Investors
BIRD Foundation
2025-01-22Grant
Leadership Team
Recent News
2026-01-09
2026-01-09
2026-01-09
Company data provided by crunchbase