Lenovo · 3 hours ago
Model Optimization Engineer
Lenovo is a global technology powerhouse focused on delivering Smarter Technology for All. They are hiring a Model Optimization Engineer to optimize and deploy large models for edge devices, utilizing various technologies and frameworks to enhance AI performance.
ComputerConsumer ElectronicsElectronicsHardwareMobileWearables
Responsibilities
Optimization and deployment of large models (LLMs, VLMs, diffusion) for edge devices using quantization (INT4/INT8), pruning, knowledge distillation, and LoRA
Partner with silicon teams to optimize model execution on heterogeneous hardware: NPUs (Qualcomm Hexagon, Google Edge TPU), GPUs, and CPUs
Implement and benchmark deployment frameworks: TensorRT-LLM, ONNX Runtime, ExecuTorch, llama.cpp, MLC-LLM
Drive hardware-software co-design, influencing sensor and silicon roadmaps to enable efficient AI inference
Build ML ops infrastructure: model serving, A/B testing, performance monitoring, continuous optimization
Stay at the forefront of on-device AI: sub-10B parameter models, mixed precision, sparse attention, federated learning
Qualification
Required
3+ years in ML engineering or systems, with 3+ years focused on model optimization and deployment
Bachelor's degree in an Engineering or Computer Science
Experience in model compression: quantization (QAT, PTQ), pruning, distillation, low-rank adaptation
Hands-on experience with mobile/edge AI frameworks (TensorRT, ONNX, TFLite, CoreML)
Experience in C++/Python and performance optimization (CUDA, OpenCL, or NPU programming)
Experience in shipping ML models to production on resource-constrained devices
Preferred
Understanding of hardware architectures: NPU/GPU/CPU characteristics, SIMD operations, memory hierarchies
Company
Lenovo
Lenovo Group is a computer technology company that manufactures personal computers, smartphones, televisions, and wearable devices.
H1B Sponsorship
Lenovo has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (76)
2024 (52)
2023 (75)
2022 (82)
2021 (58)
2020 (67)
Funding
Current Stage
Public CompanyTotal Funding
$3.35BKey Investors
Alat
2025-01-08Post Ipo Debt· $2B
2024-04-01Post Ipo Debt· $500M
2017-10-03Post Ipo Equity· $500M
Leadership Team
Recent News
The Express Tribune
2026-01-12
2026-01-12
Techlicious Latest Articles
2026-01-11
Company data provided by crunchbase