Software Development Manager, Neuron Testing Service, AWS Neuron SDK at Amazon Development Center U.S., Inc. jobs in United States
info-icon
This job has closed.
company-logo

hackajob · 15 hours ago

Software Development Manager, Neuron Testing Service, AWS Neuron SDK at Amazon Development Center U.S., Inc.

hackajob is collaborating with Archer to connect them with exceptional tech professionals for this role. As the Software Development Manager for the Neuron Testing Infrastructure Team, you will lead a team to develop and maintain a critical testing service for AWS Neuron SDK, ensuring high availability and operational excellence.

Artificial Intelligence (AI)Generative AIHuman ResourcesRecruitingSoftware

Responsibilities

Lead a talented team of engineers to develop and maintain the critical testing service that enables continuous integration and validation across the entire Neuron SDK Development organization
Oversee the design, development, and operation of our large-scale EKS-based test execution platform that manages thousands of test runs daily across pre-release hardware, multiple EC2 instance types, and diverse software configurations
Manage the full lifecycle of a high-demand, business-critical service that directly impacts the velocity and quality of AWS Neuron releases
Ensure the platform maintains strict availability goals while scaling to meet growing demand from development teams
Integrate new EC2 instance types and pre-released hardware
Implement advanced queue management algorithms
Optimize resource utilization across large EKS clusters
Maintain operational excellence
Collaborate with cross-functional teams including compiler, runtime, and framework teams to ensure their testing needs are met efficiently and reliably

Qualification

EKS managementLarge-scale distributed systemsService reliabilityQueue management systemsEngineering team managementMulti-tier web servicesKubernetes at scaleLoggingMonitoring toolsCommunication skillsMentoring/coachingCollaboration with teams

Required

3+ years of engineering team management experience
7+ years of working directly within engineering teams experience
3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
8+ years of leading the definition and development of multi tier web services experience
Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
Experience partnering with product or program management teams
Hands-on experience managing large-scale EKS clusters (500+ nodes) in production environments
Experience with queue management systems and resource scheduling

Preferred

Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy
Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers
Experience with logging and monitoring tools, such as: AWS CloudWatch, Datadog, New Relic and Splunk
Experience with Kubernetes at scale, including autoscaling, resource optimization, and multi-tenant architectures

Benefits

Equity
Sign-on payments
Medical
Financial
Other benefits

Company

hackajob

twittertwittertwitter
company-logo
The AI-native tech hiring platform trusted by enterprises, scale-ups, and 1M+ tech professionals worldwide.

Funding

Current Stage
Growth Stage
Total Funding
$33M
Key Investors
Volition CapitalDowning VenturesTechstars
2023-05-03Series B· $25M
2018-10-25Series A· $6.7M
2017-03-31Seed· $0.58M

Leadership Team

leader-logo
Mark Chaffey
CEO
linkedin
leader-logo
Phil Kell
Head of Talent
linkedin
Company data provided by crunchbase