Role Purpose
Enable enterprise customers to operationalize AI workloads by deploying and optimizing model-serving platforms (e.g., NVIDIA Triton, vLLM, KServe) within Rackspace’s Private Cloud and Hybrid environments. This role bridges AI engineering and platform operations, ensuring secure, scalable, and cost-efficient inference services.
Key Responsibilities : -
Model Deployment & Optimization
Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters.
Tune performance (batching, KV-cache, TensorRT optimizations) for latency and throughput SLAs.
Platform Integration
Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy.
Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers.
API & Service Enablement
Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing.
Support RAG and agentic workflows by connecting to vector databases and context stores.
Observability & FinOps
Configure telemetry for GPU utilization, request tracing, and error monitoring.
Collaborate with FinOps to enable usage metering and chargeback reporting.
Customer Engineering Support
Assist solution architects in onboarding customers, creating reference patterns for BFSI, Healthcare, and other verticals.
Provide troubleshooting and performance benchmarking guidance.
Continuous Improvement
Stay current with emerging model-serving frameworks and GPU acceleration techniques.
Contribute to reusable Helm charts, operators, and automation scripts.
#LI-VM1
#LI-US
"Remote postings are limited to candidates residing within the country specified in the posting location"
About Rackspace Technology
We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.
More on Rackspace Technology
Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.
Loading similar jobs...
Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!