Machine Learning Engineer, ML Runtime & Optimization jobs in United States
cer-icon
Apply on Employer Site
company-logo

Pony.ai · 5 months ago

Machine Learning Engineer, ML Runtime & Optimization

Pony.ai is a global leader in autonomous mobility, recognized for its innovative technologies and services in the autonomous driving sector. The Machine Learning Engineer in ML Runtime & Optimization will develop technologies to enhance the training and inference of AI models for autonomous driving, collaborating across teams to optimize algorithms and improve performance on advanced compute architectures.

Artificial Intelligence (AI)AutomotiveAutonomous VehiclesTransportation
check
H1B Sponsor Likelynote

Responsibilities

Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures
Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure
Apply model optimization and efficient deep learning techniques to models and optimized ML operator libraries
Work across the entire ML framework/compiler stack (e.g.Torch, CUDA and TensorRT), and system-efficient deep learning models

Qualification

C/C++ programmingPython programmingModel optimizationDeep learning techniquesHardware performance understandingProfilingBenchmarkingParallel programmingSoftware design knowledgeCommon deep learning frameworksGPU optimization knowledgeCommunication skillsCross-functional collaboration

Required

BS/MS or Ph.D in computer science, electrical engineering or a related discipline
Strong programming skills in C/C++ or Python
Experience on model optimization, quantization or other efficient deep learning techniques
Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc
Experience with profiling, benchmarking and validating performance for complex computing architectures
Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks
Strong communication skills and ability to work cross-functionally between software and hardware teams

Preferred

Experience with parallel programming, ideally CUDA, OpenCL or OpenACC
Experience in computer vision, machine learning and deep learning
Strong knowledge of software design, programming techniques and algorithms
Good knowledge of common deep learning frameworks and libraries
Deep knowledge on system performance, GPU optimization or ML compiler

Benefits

Health Care Plan (Medical, Dental & Vision)
Retirement Plan (Traditional and Roth 401k)
Life Insurance (Basic, Voluntary & AD&D)
Paid Time Off (Vacation & Public Holidays)
Family Leave (Maternity, Paternity)
Short Term & Long Term Disability
Free Food & Snacks

Company

Pony.ai develops autonomous driving technology for vehicles that operates using artificial intelligence and machine learning.

H1B Sponsorship

Pony.ai has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (17)
2024 (13)
2023 (32)
2022 (44)
2021 (47)
2020 (24)

Funding

Current Stage
Public Company
Total Funding
$1.34B
Key Investors
NEOM Investment FundGAC Toyota MotorOntario Teachers' Pension Plan
2025-08-18Corporate Round· $12.9M
2024-11-27IPO
2023-10-24Series D· $100M

Leadership Team

leader-logo
James Peng
Founder and CEO
linkedin
Company data provided by crunchbase