OpenAI · 4 days ago
Software Engineer, Model Inference
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are seeking a Software Engineer to optimize AI models for high-volume, low-latency production environments, collaborating with researchers and engineers to enhance performance and efficiency.
Artificial Intelligence (AI)Generative AIMachine LearningNatural Language ProcessingSaaS
Responsibilities
Work alongside machine learning researchers, engineers, and product managers to bring our latest technologies into production
Work alongside researchers to enable advanced research through awesome engineering
Introduce new techniques, tools, and architecture that improve the performance, latency, throughput, and efficiency of our model inference stack
Build tools to give us visibility into our bottlenecks and sources of instability and then design and implement solutions to address the highest priority issues
Optimize our code and fleet of Azure VMs to utilize every FLOP and every GB of GPU RAM of our hardware
Qualification
Required
Have an understanding of modern ML architectures and an intuition for how to optimize their performance, particularly for inference
Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done
Have at least 5 years of professional software engineering experience
Have or can quickly gain familiarity with PyTorch, NVidia GPUs and the software stacks that optimize them (e.g. NCCL, CUDA), as well as HPC technologies such as InfiniBand, MPI, NVLink, etc
Have experience architecting, building, observing, and debugging production distributed systems
Have needed to rebuild or substantially refactor production systems several times over due to rapidly increasing scale
Are self-directed and enjoy figuring out the most important problem to work on
Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed
Preferred
Bonus point if worked on performance-critical distributed systems
Company
OpenAI
OpenAI is an AI research and deployment company that develops advanced AI models, including ChatGPT. It is a sub-organization of OpenAI Foundation.
H1B Sponsorship
OpenAI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (1)
2023 (1)
2022 (18)
2021 (10)
2020 (6)
Funding
Current Stage
Growth StageTotal Funding
$79BKey Investors
The Walt Disney CompanySoftBankThrive Capital
2025-12-11Corporate Round· $1B
2025-10-02Secondary Market· $6.6B
2025-03-31Series Unknown· $40B
Recent News
Analytics Insight: Latest AI, Crypto, Tech News & Analysis
2026-01-07
2026-01-07
2026-01-07
Company data provided by crunchbase