Model Optimization - Lead Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lenovo · 17 hours ago

Model Optimization - Lead Engineer

Lenovo is a global technology powerhouse focused on delivering Smarter Technology for All. The Model Optimization Lead Engineer will lead the optimization and deployment of large models for edge devices, collaborating with various teams to ensure efficient AI inference and drive innovative product features.

ComputerConsumer ElectronicsElectronicsHardwareMobileWearables
check
H1B Sponsor Likelynote

Responsibilities

Lead optimization and deployment of large models (LLMs, VLMs, diffusion) for edge devices using quantization (INT4/INT8), pruning, knowledge distillation, and LoRA
Partner with silicon teams to optimize model execution on heterogeneous hardware: NPUs (Qualcomm Hexagon, Google Edge TPU), GPUs, and CPUs
Implement and benchmark deployment frameworks: TensorRT-LLM, ONNX Runtime, ExecuTorch, llama.cpp, MLC-LLM
Drive hardware-software co-design, influencing sensor and silicon roadmaps to enable efficient AI inference
Build ML ops infrastructure: model serving, A/B testing, performance monitoring, continuous optimization
Lead a team of optimization engineers and collaborate with ML researchers, hardware teams, and product managers
Stay at the forefront of on-device AI: sub-10B parameter models, mixed precision, sparse attention, federated learning

Qualification

Model optimizationQuantization frameworksEdge AI runtimesModel compressionMobile/edge AI frameworksC++/PythonPerformance optimizationHardware architecturesTeam leadershipCollaboration

Required

7+ years in ML engineering or systems, with 3+ years focused on model optimization and deployment
Bachelor's Degree in Engineering or Computer Science
Deep expertise in model compression: quantization (QAT, PTQ), pruning, distillation, low-rank adaptation
Hands-on experience with mobile/edge AI frameworks (TensorRT, ONNX, TFLite, CoreML)

Preferred

Understanding of hardware architectures: NPU/GPU/CPU characteristics, SIMD operations, memory hierarchies
Proficiency in C++/Python and performance optimization (CUDA, OpenCL, or NPU programming)
Track record of shipping ML models to production on resource-constrained devices

Company

Lenovo Group is a computer technology company that manufactures personal computers, smartphones, televisions, and wearable devices.

H1B Sponsorship

Lenovo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (76)
2024 (52)
2023 (75)
2022 (82)
2021 (58)
2020 (67)

Funding

Current Stage
Public Company
Total Funding
$3.35B
Key Investors
Alat
2025-01-08Post Ipo Debt· $2B
2024-04-01Post Ipo Debt· $500M
2017-10-03Post Ipo Equity· $500M

Leadership Team

leader-logo
Yang Yuanqing
Chairman & CEO
linkedin
leader-logo
Greg Huff
CTO, CSO, and SVP of Development, Quality, and Customer Care, Infrastructure Solutions Group
linkedin
Company data provided by crunchbase