Generative AI Inference Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Stability AI · 4 hours ago

Generative AI Inference Engineer

Stability AI is seeking passionate Machine Learning Engineers to join their Inference team, focusing on the creative applications of generative AI models. The role involves leading the design and development of customer-facing multi-modal ML inference systems and collaborating with various teams to optimize and deploy cutting-edge models.

Artificial Intelligence (AI)Generative AIImage RecognitionInformation TechnologySoftware

Responsibilities

Lead efforts to drive the design, development of customer-facing multi modal ML inference systems
Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment
Partner with leading cloud providers to deliver hosted Stability AI inference solutions
Be a strategic thought partner for leaders across the organization on driving business impact through machine learning
Be part of the team to bring new Stability models and pipelines into existence
Prototype and productionize inference platform improvements and new features

Qualification

Machine Learning SystemsPython ServicesDeep Learning FrameworksDiffusion ArchitectureNvidia GPU OptimizationCloud DeploymentDockerPrototyping SolutionsOpen-source ML EcosystemCommunication SkillsCollaboration SkillsDocumentation Skills

Required

7+ years working on productionizing machine learning systems, including inference pipeline development
Expert level knowledge on writing and running python services at scale
5+ years working on python scientific stack, pyTorch and at least one high-performance inference framework (e.g. Triton and TensorRT)
Deep understanding of Diffusion Architecture
Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight
Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV
Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure
Experience with Docker
Ability to rapidly prototype solutions and iterate on them with tight product deadlines
Strong communication, collaboration, and documentation skills
Experience with the open-source ML ecosystem (HuggingFace, W&B, etc.)

Company

Stability AI

twittertwittertwitter
company-logo
Stability AI is an artificial intelligence company focused on developing open-source generative AI models.

Funding

Current Stage
Growth Stage
Total Funding
$256M
Key Investors
WPPIntel
2025-03-05Corporate Round
2024-06-25Series Unknown· $80M
2023-11-09Convertible Note· $50M

Leadership Team

leader-logo
Prem Akkaraju
CEO
linkedin
leader-logo
Hanno Basse
Chief Technology Officer
linkedin
Company data provided by crunchbase