AI Model Serving Specialist jobs in United States
cer-icon
Apply on Employer Site
company-logo

Rackspace Technology · 1 month ago

AI Model Serving Specialist

Rackspace is a leading multicloud solutions provider that empowers customers through innovative technology. The AI Model Serving Specialist role focuses on deploying and optimizing AI workloads for enterprise clients, ensuring secure and efficient model-serving platforms within Rackspace's Private Cloud and Hybrid environments.

Web Hosting
badNo H1Bnote

Responsibilities

Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters
Tune performance (batching, KV-cache, TensorRT optimizations) for latency and throughput SLAs
Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy
Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers
Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing
Support RAG and agentic workflows by connecting to vector databases and context stores
Configure telemetry for GPU utilization, request tracing, and error monitoring
Collaborate with FinOps to enable usage metering and chargeback reporting
Assist solution architects in onboarding customers, creating reference patterns for BFSI, Healthcare, and other verticals
Provide troubleshooting and performance benchmarking guidance
Stay current with emerging model-serving frameworks and GPU acceleration techniques
Contribute to reusable Helm charts, operators, and automation scripts

Qualification

NVIDIA TritonKubernetesGPU schedulingPythonVMware VCF9Observability stacksRAG architecturesVLLMCUDA/MIGNSX-T networkingVSAN storage classesFinOps principlesCustomer-facing communicationNVIDIA Certified ProfessionalKubernetes AdministratorVMware VCF SpecialistRackspace AI FoundationsProblem-solving

Required

Hands-on experience with NVIDIA Triton, vLLM, or similar serving stacks
Strong knowledge of Kubernetes, GPU scheduling, and CUDA/MIG
Familiarity with VMware VCF9, NSX-T networking, and vSAN storage classes
Proficiency in Python and containerization (Docker)
Understanding of observability stacks (Prometheus, Grafana) and FinOps principles
Exposure to RAG architectures, vector DBs, and secure multi-tenant environments
Excellent problem-solving and customer-facing communication skills

Preferred

NVIDIA Certified Professional (AI/ML)
Kubernetes Administrator (CKA)
VMware VCF Specialist
Rackspace AI Foundations (internal)

Benefits

Incentive compensation opportunities in the form of annual bonus or incentives
Equity awards
Employee Stock Purchase Plan (ESPP)

Company

Rackspace Technology

company-logo
Rackspace Technology is a leading end-to-end hybrid cloud and AI solutions company.

Funding

Current Stage
Late Stage
Total Funding
unknown
2016-08-08Acquired

Leadership Team

leader-logo
Mark Marino
EVP, Chief Financial Officer
linkedin
Company data provided by crunchbase