WorkGenius Group · 2 days ago
Sr. Manager, AI Model Deployment & Optimization
WorkGenius Group is a technology company seeking a Sr. Manager for AI Model Deployment & Optimization. The role involves leading AI model deployment and optimization efforts, mentoring a high-performing engineering team, and ensuring alignment of AI workloads with hardware capabilities.
Computer Software
Responsibilities
Lead AI model deployment and optimization across devices, laptops, and cloud environments
Adapt, fine-tune, and optimize open-source and proprietary foundation models for production use
Drive initiatives in model compression, quantization, pruning, and distillation to improve performance on constrained hardware
Partner closely with hardware and systems teams to align AI workloads with accelerator capabilities
Build, mentor, and scale a high-performing applied AI engineering team
Qualification
Required
10+ years in production software or AI/ML engineering
3+ years in technical leadership or people management
Deep expertise in quantization, pruning, distillation, mixed precision, and graph optimization
Hands-on experience with frameworks such as ONNX Runtime, TensorRT, TVM, OpenVINO, RadeonML
Strong cross-functional collaboration, stakeholder management, and communication
Company
WorkGenius Group
WorkGenius Group is the global talent solution for today's fluid labour market.
Funding
Current Stage
Growth StageCompany data provided by crunchbase