Model Optimization Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lenovo · 3 hours ago

Model Optimization Engineer

Lenovo is a global technology powerhouse focused on delivering Smarter Technology for All. They are hiring a Model Optimization Engineer to optimize and deploy large models for edge devices, utilizing various technologies and frameworks to enhance AI performance.

ComputerConsumer ElectronicsElectronicsHardwareMobileWearables
check
H1B Sponsor Likelynote

Responsibilities

Optimization and deployment of large models (LLMs, VLMs, diffusion) for edge devices using quantization (INT4/INT8), pruning, knowledge distillation, and LoRA
Partner with silicon teams to optimize model execution on heterogeneous hardware: NPUs (Qualcomm Hexagon, Google Edge TPU), GPUs, and CPUs
Implement and benchmark deployment frameworks: TensorRT-LLM, ONNX Runtime, ExecuTorch, llama.cpp, MLC-LLM
Drive hardware-software co-design, influencing sensor and silicon roadmaps to enable efficient AI inference
Build ML ops infrastructure: model serving, A/B testing, performance monitoring, continuous optimization
Stay at the forefront of on-device AI: sub-10B parameter models, mixed precision, sparse attention, federated learning

Qualification

Model optimizationQuantization frameworksMobile/edge AI frameworksC++/PythonPerformance optimizationHardware architecturesML ops infrastructureBenchmarking frameworksSoft skills

Required

3+ years in ML engineering or systems, with 3+ years focused on model optimization and deployment
Bachelor's degree in an Engineering or Computer Science
Experience in model compression: quantization (QAT, PTQ), pruning, distillation, low-rank adaptation
Hands-on experience with mobile/edge AI frameworks (TensorRT, ONNX, TFLite, CoreML)
Experience in C++/Python and performance optimization (CUDA, OpenCL, or NPU programming)
Experience in shipping ML models to production on resource-constrained devices

Preferred

Understanding of hardware architectures: NPU/GPU/CPU characteristics, SIMD operations, memory hierarchies

Company

Lenovo Group is a computer technology company that manufactures personal computers, smartphones, televisions, and wearable devices.

H1B Sponsorship

Lenovo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (76)
2024 (52)
2023 (75)
2022 (82)
2021 (58)
2020 (67)

Funding

Current Stage
Public Company
Total Funding
$3.35B
Key Investors
Alat
2025-01-08Post Ipo Debt· $2B
2024-04-01Post Ipo Debt· $500M
2017-10-03Post Ipo Equity· $500M

Leadership Team

leader-logo
Yang Yuanqing
Chairman & CEO
linkedin
leader-logo
Greg Huff
CTO, CSO, and SVP of Development, Quality, and Customer Care, Infrastructure Solutions Group
linkedin
Company data provided by crunchbase