Apply on Employer Site

OpenAI · 2 weeks ago

Software Engineer, Inference - Multi Modal

United States

Full-time

Remote

Mid, Senior Level

$325K/yr - $490K/yr

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are looking for a software engineer to help serve OpenAI’s multimodal models at scale, focusing on building reliable infrastructure for real-time audio and image processing.

Agentic AIArtificial Intelligence (AI)Foundational AIGenerative AIMachine LearningNatural Language ProcessingSaaS

Growth Opportunities

H1B Sponsor Likely

Responsibilities

Design and implement inference infrastructure for large-scale multimodal models

Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs

Enable experimental research workflows to transition into reliable production services

Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities

Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers

Qualification

Inference infrastructure designGPU-based ML workloadsHigh-throughput systemsMultimodal models experienceDistributed compute systemsProblem ownershipCollaboration skillsAdaptability

Required

Experience building and scaling inference systems for LLMs or multimodal models

Worked with GPU-based ML workloads and understand the performance dynamics of large models, especially with complex data like images or audio

Enjoy experimental, fast-evolving work and collaborating closely with research

Comfortable dealing with systems that span networking, distributed compute, and high-throughput data handling

Familiarity with inference tooling like vLLM, TensorRT-LLM, or custom model parallel systems

Own problems end-to-end and are excited to operate in ambiguous, fast-moving spaces

Design and implement inference infrastructure for large-scale multimodal models

Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs

Enable experimental research workflows to transition into reliable production services

Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities

Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers

Preferred

Experience working with image generation or audio synthesis models in production

Exposure to distributed ML training or system-efficient model design

Company

OpenAI

Glassdoor4.2

OpenAI is an AI research and deployment company that develops advanced AI models, including ChatGPT. It is a sub-organization of OpenAI Foundation.

Founded in 2015

San Francisco, California, USA

201-500 employees

https://www.openai.com

H1B Sponsorship

OpenAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (1)

2024 (1)

2023 (1)

2022 (18)

2021 (10)

2020 (6)

Funding

Current Stage

Growth Stage

Total Funding

$79B

Key Investors

The Walt Disney CompanySoftBankThrive Capital

2025-12-11Corporate Round· $1B

2025-10-02Secondary Market· $6.6B

2025-03-31Series Unknown· $40B

Leadership Team

Sam Altman

CEO & Co-Founder

Greg Brockman

President, Chairman, & Co-Founder

Recent News

Business Insider

This is the key breakthrough AI still requires to reach superintelligence, according to those building it

2026-01-09

Indian Express

AI showdown: ChatGPT web traffic slips as Gemini’s share rises 3.3%

2026-01-09

The Motley Fool

Why UiPath Stock Rocketed 29% Higher in 2025

2026-01-09

Company data provided by crunchbase