OpenAI · 1 day ago
Software Engineer, Frontier Systems - Power Management
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. As a Software Engineer on the Frontier Systems team, you will focus on critical infrastructure for power management in large-scale supercomputers, ensuring efficient operations and reliability during model training.
Artificial Intelligence (AI)Generative AIMachine LearningNatural Language ProcessingSaaS
Responsibilities
Develop and implement system-level and software-level solutions to optimize power usage in large-scale supercomputers, ensuring efficient and reliable operations
Build automation to monitor power consumption patterns during training workloads and design algorithms to stabilize these fluctuations, preventing issues with grid reliability
Work with researchers and engineers to design tools for real-time monitoring, detection, and remediation of power-related hardware and system faults
Collaborate cross-functionally to translate complex electrical system requirements into code, while driving continuous improvements in power management solutions
Drive the development of power throttling mechanisms at the IT system level to dynamically adjust power usage based on workload demands and infrastructure limitations
Collaborate with hardware design teams to integrate system-level power control requirements into IT hardware design, ensuring seamless coordination between software-driven power management and hardware capabilities
Qualification
Required
7+ years of software engineering experience with a focus on solving large-scale, system-level challenges
Strong proficiency in Python and familiarity with automation and scripting tools (e.g., shell scripting)
Experience with distributed systems to efficiently aggregate and analyze streaming data
Knowledge of electrical engineering concepts including digital signal processing, power systems, Fast Fourier Transforms, or related areas
Experience in system-level investigations and development of automated solutions to address power management, fault detection, and remediation
Strong analytical skills and the ability to dig into noisy data (experience with SQL, PromQL, Pandas, etc.)
Comfort working with both hardware and software teams to solve multidisciplinary problems
Preferred
Deep expertise with the power characteristics of synchronous workloads (as seen in supercomputing or model training environments)
Knowledge of power control requirements in IT hardware design, with the ability to drive cross-functional collaboration to integrate power management features into hardware systems effectively
Working knowledge of control system fundamentals and how physical systems respond to control strategies
Company
OpenAI
OpenAI is an AI research and deployment company that develops advanced AI models, including ChatGPT. It is a sub-organization of OpenAI Foundation.
H1B Sponsorship
OpenAI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (1)
2023 (1)
2022 (18)
2021 (10)
2020 (6)
Funding
Current Stage
Growth StageTotal Funding
$79BKey Investors
The Walt Disney CompanySoftBankThrive Capital
2025-12-11Corporate Round· $1B
2025-10-02Secondary Market· $6.6B
2025-03-31Series Unknown· $40B
Recent News
2026-01-06
2026-01-06
The Motley Fool
2026-01-06
Company data provided by crunchbase