Lead Machine Learning Engineer, Performance and Scalability, Generative AI jobs in United States
info-icon
This job has closed.
company-logo

Adobe · 7 months ago

Lead Machine Learning Engineer, Performance and Scalability, Generative AI

Adobe is a software company that provides its users with digital marketing and media solutions. They are seeking a Lead Engineer to focus on Performance and Scalability for their Generative AI systems, responsible for optimizing high-performance, scalable AI pipelines that support millions of users worldwide.

Artificial Intelligence (AI)ConsultingEnterprise SoftwareGraphic DesignImage RecognitionPhoto EditingSaaSSoftwareUX DesignWeb Design
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Architect and optimize ML pipelines to support scalable inference and model deployment on cloud-based GPU infrastructure (e.g., AWS P5 instances)
Develop and maintain high-throughput serving pipelines for generative AI models, ensuring low-latency, high-performance execution
Enable model serving optimizations by designing systems that support tensor parallelism, quantization, distillation, and caching, in collaboration with ML research teams
Develop automated monitoring and profiling tools to track system efficiency, detect performance regressions, and optimize resource utilization
Optimize GPU resource allocation and orchestration across cloud-based ML workloads
Integrate scalable load testing frameworks to validate model inference performance under high-traffic conditions
Collaborate with infrastructure and applied ML teams to transition models from experimentation to production-ready, cloud-optimized deployments
Establish standard methodologies for scaling and cloud-native ML architectures, ensuring efficient deployment across multi-region cloud environments

Qualification

High-performance ML infrastructureScalable AI systemsPython programmingC++ programmingAWS GPU instancesKubernetesRayONNXTensorRTCUDAAOT compilation techniquesCloud-native architecturesAutoscaling strategiesFault-tolerant ML systemsGPU orchestrationNsightPyTorch ProfilerPerf

Required

8+ years of proven track record in building high-performance ML infrastructure and scalable AI systems
MS, or PHD in computer science or related field
Strong programming skills in Python and C++, with expertise in building ML pipelines and model deployment infrastructure
Experience deploying large-scale ML models in cloud environments, including AWS GPU instances, Kubernetes, Ray, or similar
Experience with model conversion and optimization frameworks like ONNX and TensorRT, as well as AOT compilation techniques
Experience with cloud-native architectures, autoscaling strategies, and fault-tolerant machine learning systems
Proficiency in GPU orchestration, CUDA, and accelerated inference techniques
Hands-on experience with profiling tools (e.g., Nsight, PyTorch Profiler, perf) for system performance analysis
Ability to work in a fast-paced, startup-like environment with multi-functional teams

Benefits

Long-term incentives in the form of a new hire equity award

Company

Adobe is a software company that provides its users with digital marketing and media solutions.

H1B Sponsorship

Adobe has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1160)
2024 (1217)
2023 (750)
2022 (878)
2021 (742)
2020 (477)

Funding

Current Stage
Public Company
Total Funding
$2.5M
Key Investors
Apple
1986-08-20IPO
1984-10-01Series Unknown· $2.5M

Leadership Team

leader-logo
Shantanu Narayen
CEO
leader-logo
Dan Durn
Chief Financial Officer
linkedin
Company data provided by crunchbase