Software Engineer, Model Inference jobs in United States
cer-icon
Apply on Employer Site
company-logo

OpenAI · 4 days ago

Software Engineer, Model Inference

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are seeking a Software Engineer to optimize AI models for high-volume, low-latency production environments, collaborating with researchers and engineers to enhance performance and efficiency.

Artificial Intelligence (AI)Generative AIMachine LearningNatural Language ProcessingSaaS
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Work alongside machine learning researchers, engineers, and product managers to bring our latest technologies into production
Work alongside researchers to enable advanced research through awesome engineering
Introduce new techniques, tools, and architecture that improve the performance, latency, throughput, and efficiency of our model inference stack
Build tools to give us visibility into our bottlenecks and sources of instability and then design and implement solutions to address the highest priority issues
Optimize our code and fleet of Azure VMs to utilize every FLOP and every GB of GPU RAM of our hardware

Qualification

PyTorchNVidia GPUsDistributed systemsHPC technologiesModern ML architecturesProblem ownershipSelf-directedTeam collaboration

Required

Have an understanding of modern ML architectures and an intuition for how to optimize their performance, particularly for inference
Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done
Have at least 5 years of professional software engineering experience
Have or can quickly gain familiarity with PyTorch, NVidia GPUs and the software stacks that optimize them (e.g. NCCL, CUDA), as well as HPC technologies such as InfiniBand, MPI, NVLink, etc
Have experience architecting, building, observing, and debugging production distributed systems
Have needed to rebuild or substantially refactor production systems several times over due to rapidly increasing scale
Are self-directed and enjoy figuring out the most important problem to work on
Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed

Preferred

Bonus point if worked on performance-critical distributed systems

Company

OpenAI is an AI research and deployment company that develops advanced AI models, including ChatGPT. It is a sub-organization of OpenAI Foundation.

H1B Sponsorship

OpenAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (1)
2023 (1)
2022 (18)
2021 (10)
2020 (6)

Funding

Current Stage
Growth Stage
Total Funding
$79B
Key Investors
The Walt Disney CompanySoftBankThrive Capital
2025-12-11Corporate Round· $1B
2025-10-02Secondary Market· $6.6B
2025-03-31Series Unknown· $40B

Leadership Team

leader-logo
Sam Altman
CEO & Co-Founder
leader-logo
Greg Brockman
President, Chairman, & Co-Founder
linkedin
Company data provided by crunchbase