OpenAI · 2 weeks ago
Software Engineer, Inference - Multi Modal
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are looking for a software engineer to help serve OpenAI’s multimodal models at scale, focusing on building reliable infrastructure for real-time audio and image processing.
Agentic AIArtificial Intelligence (AI)Foundational AIGenerative AIMachine LearningNatural Language ProcessingSaaS
Responsibilities
Design and implement inference infrastructure for large-scale multimodal models
Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs
Enable experimental research workflows to transition into reliable production services
Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities
Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers
Qualification
Required
Experience building and scaling inference systems for LLMs or multimodal models
Worked with GPU-based ML workloads and understand the performance dynamics of large models, especially with complex data like images or audio
Enjoy experimental, fast-evolving work and collaborating closely with research
Comfortable dealing with systems that span networking, distributed compute, and high-throughput data handling
Familiarity with inference tooling like vLLM, TensorRT-LLM, or custom model parallel systems
Own problems end-to-end and are excited to operate in ambiguous, fast-moving spaces
Design and implement inference infrastructure for large-scale multimodal models
Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs
Enable experimental research workflows to transition into reliable production services
Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities
Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers
Preferred
Experience working with image generation or audio synthesis models in production
Exposure to distributed ML training or system-efficient model design
Company
OpenAI
OpenAI is an AI research and deployment company that develops advanced AI models, including ChatGPT. It is a sub-organization of OpenAI Foundation.
H1B Sponsorship
OpenAI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (1)
2023 (1)
2022 (18)
2021 (10)
2020 (6)
Funding
Current Stage
Growth StageTotal Funding
$79BKey Investors
The Walt Disney CompanySoftBankThrive Capital
2025-12-11Corporate Round· $1B
2025-10-02Secondary Market· $6.6B
2025-03-31Series Unknown· $40B
Recent News
2026-01-09
The Motley Fool
2026-01-09
Company data provided by crunchbase