Rackspace Technology · 1 month ago
AI Model Serving Specialist
Rackspace is a leading multicloud solutions provider that empowers customers through innovative technology. The AI Model Serving Specialist role focuses on deploying and optimizing AI workloads for enterprise clients, ensuring secure and efficient model-serving platforms within Rackspace's Private Cloud and Hybrid environments.
Web Hosting
Responsibilities
Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters
Tune performance (batching, KV-cache, TensorRT optimizations) for latency and throughput SLAs
Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy
Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers
Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing
Support RAG and agentic workflows by connecting to vector databases and context stores
Configure telemetry for GPU utilization, request tracing, and error monitoring
Collaborate with FinOps to enable usage metering and chargeback reporting
Assist solution architects in onboarding customers, creating reference patterns for BFSI, Healthcare, and other verticals
Provide troubleshooting and performance benchmarking guidance
Stay current with emerging model-serving frameworks and GPU acceleration techniques
Contribute to reusable Helm charts, operators, and automation scripts
Qualification
Required
Hands-on experience with NVIDIA Triton, vLLM, or similar serving stacks
Strong knowledge of Kubernetes, GPU scheduling, and CUDA/MIG
Familiarity with VMware VCF9, NSX-T networking, and vSAN storage classes
Proficiency in Python and containerization (Docker)
Understanding of observability stacks (Prometheus, Grafana) and FinOps principles
Exposure to RAG architectures, vector DBs, and secure multi-tenant environments
Excellent problem-solving and customer-facing communication skills
Preferred
NVIDIA Certified Professional (AI/ML)
Kubernetes Administrator (CKA)
VMware VCF Specialist
Rackspace AI Foundations (internal)
Benefits
Incentive compensation opportunities in the form of annual bonus or incentives
Equity awards
Employee Stock Purchase Plan (ESPP)
Company
Rackspace Technology
Rackspace Technology is a leading end-to-end hybrid cloud and AI solutions company.
Funding
Current Stage
Late StageTotal Funding
unknown2016-08-08Acquired
Recent News
GlobeNewswire News Room
2024-01-15
Company data provided by crunchbase