hackajob · 15 hours ago
Software Development Manager, Neuron Testing Service, AWS Neuron SDK at Amazon Development Center U.S., Inc.
hackajob is collaborating with Archer to connect them with exceptional tech professionals for this role. As the Software Development Manager for the Neuron Testing Infrastructure Team, you will lead a team to develop and maintain a critical testing service for AWS Neuron SDK, ensuring high availability and operational excellence.
Artificial Intelligence (AI)Generative AIHuman ResourcesRecruitingSoftware
Responsibilities
Lead a talented team of engineers to develop and maintain the critical testing service that enables continuous integration and validation across the entire Neuron SDK Development organization
Oversee the design, development, and operation of our large-scale EKS-based test execution platform that manages thousands of test runs daily across pre-release hardware, multiple EC2 instance types, and diverse software configurations
Manage the full lifecycle of a high-demand, business-critical service that directly impacts the velocity and quality of AWS Neuron releases
Ensure the platform maintains strict availability goals while scaling to meet growing demand from development teams
Integrate new EC2 instance types and pre-released hardware
Implement advanced queue management algorithms
Optimize resource utilization across large EKS clusters
Maintain operational excellence
Collaborate with cross-functional teams including compiler, runtime, and framework teams to ensure their testing needs are met efficiently and reliably
Qualification
Required
3+ years of engineering team management experience
7+ years of working directly within engineering teams experience
3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
8+ years of leading the definition and development of multi tier web services experience
Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
Experience partnering with product or program management teams
Hands-on experience managing large-scale EKS clusters (500+ nodes) in production environments
Experience with queue management systems and resource scheduling
Preferred
Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy
Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers
Experience with logging and monitoring tools, such as: AWS CloudWatch, Datadog, New Relic and Splunk
Experience with Kubernetes at scale, including autoscaling, resource optimization, and multi-tenant architectures
Benefits
Equity
Sign-on payments
Medical
Financial
Other benefits
Company
hackajob
The AI-native tech hiring platform trusted by enterprises, scale-ups, and 1M+ tech professionals worldwide.
Funding
Current Stage
Growth StageTotal Funding
$33MKey Investors
Volition CapitalDowning VenturesTechstars
2023-05-03Series B· $25M
2018-10-25Series A· $6.7M
2017-03-31Seed· $0.58M
Recent News
2025-10-23
2025-09-26
2025-09-12
Company data provided by crunchbase