hackajob · 10 hours ago
Senior Lead Software Engineer- AI Platform engineer
hackajob is collaborating with J.P. Morgan to connect them with exceptional tech professionals for this role. As a Senior Lead Software Engineer at JPMorgan Chase, you will enhance, build, and deliver trusted market-leading technology products while driving significant business impact through your technical expertise.
Artificial Intelligence (AI)Generative AIHuman ResourcesRecruitingSoftware
Responsibilities
Provide technical guidance and direction to support business objectives, collaborating with technical teams, contractors, and vendors
Develop secure, high-quality production code, and review and debug code written by others
Influence product design, application functionality, and technical operations through informed decision-making
Advocate for firmwide frameworks, tools, and practices within the Software Development Life Cycle
Promote a culture of diversity, equity, inclusion, and respect within the team
Architect and deploy secure, scalable cloud infrastructure platforms optimized for AI and machine learning workloads
Collaborate with AI teams to translate computational needs into infrastructure requirements
Monitor, manage, and optimize cloud resources for performance and cost efficiency
Design and implement continuous integration and delivery pipelines for machine learning workloads
Develop automation scripts and infrastructure as code to streamline deployment and management tasks
Qualification
Required
Formal training or certification in software engineering concepts with 5+ years of applied experience
Hands-on experience in system design, application development, testing, and operational stability
Proficiency in programming languages such as Python and/or Golang
Ability to independently tackle design and functionality problems with minimal oversight
Background in Computer Science, Computer Engineering, Mathematics, or a related technical field
Strong knowledge of cloud computing delivery models (IaaS, PaaS, SaaS) and deployment models (Public, Private, Hybrid Cloud)
Proficiency in Linux environments, including scripting and administration
Foundational understanding of machine learning concepts, including transformer architecture, ML training, and inference
Experience in solutions design and engineering, containerization (Docker, Kubernetes), and cloud service providers (AWS, Azure, GCP)
Experience with Infrastructure as Code (Terraform, CloudFormation) and automation tools (Ansible, Chef, Puppet)
Deep understanding of cloud component architecture: Microservices, Containers, IaaS, Storage, Security, and routing/switching technologies
Preferred
Foundational understanding of NVIDIA GPU Infrastructure software (e.g., NVIDIA DCGM, BCM, Triton Inference)
Hands-on experience with ML frameworks such as PyTorch, TensorBoard
Experience with observability tools like Prometheus, Grafana
Experience in ML Ops and associated tooling like MLflow
Experience with High Performance Computing and Machine Learning frameworks such as vLLM, Ray.io, Slurm
Strong background in network architecture, database programming (SQL/NoSQL), and data modeling
Familiarity with cloud data services and big data processing tools
Benefits
Comprehensive health care coverage
On-site health and wellness centers
A retirement savings plan
Backup childcare
Tuition reimbursement
Mental health support
Financial coaching
Company
hackajob
The AI-native tech hiring platform trusted by enterprises, scale-ups, and 1M+ tech professionals worldwide.
Funding
Current Stage
Growth StageTotal Funding
$33MKey Investors
Volition CapitalDowning VenturesTechstars
2023-05-03Series B· $25M
2018-10-25Series A· $6.7M
2017-03-31Seed· $0.58M
Recent News
2025-10-23
2025-09-26
2025-09-12
Company data provided by crunchbase