Sr Software Engineer - AI Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Oracle · 10 hours ago

Sr Software Engineer - AI Infrastructure

Oracle is a world leader in cloud solutions, and they are seeking a Senior Software Engineer - AI Infrastructure to lead the development of scalable and secure infrastructure systems for Oracle Cloud Infrastructure. The role involves designing and delivering automation pipelines for server provisioning, collaborating with cross-functional teams, and driving solutions for next-generation hardware integration.

Data GovernanceData ManagementEnterprise SoftwareInformation TechnologySaaSSoftware
check
H1B Sponsor Likelynote

Responsibilities

Design, develop, and maintain highly available and scalable microservices for OCI's server provisioning and lifecycle management
Lead automation of the full server lifecycle including rack integration, hardware bring-up, provisioning, and firmware management
Build systems that interface directly with bare metal components such as BMCs, ILOMs, NICs, SmartNICs, and GPUs
Develop automation pipelines for provisioning, firmware validation, and observability across OCI's global fleet
Implement firmware pinning and update mechanisms to support deterministic and secure customer environments
Deliver telemetry-backed monitoring and alerting systems to ensure infrastructure health and visibility
Support onboarding of new hardware platforms, including custom silicon and next-gen server technologies (e.g., NVIDIA GB200, AMD, Intel)
Enable secure root-of-trust (RoT) integrations and SmartNIC/HostNIC convergence for next-generation platform reliability
Collaborate with cross-functional teams across Compute, Networking, Security, Datacenter Engineering, and Hardware Development
Contribute to the evolution of OCI infrastructure toward composable hardware and next-generation data center clusters
Drive design reviews, participate in on-call rotations, and contribute to operational excellence and incident prevention
Provide technical leadership in troubleshooting, root cause analysis, and continuous improvement of service reliability

Qualification

AI InfrastructureCloud-scale automationMicroservices developmentServer provisioningOperating systemsHardware-software integrationDistributed servicesTelemetry monitoringRoot cause analysisTechnical leadershipCollaboration

Required

Deep understanding of operating systems
Hardware-software integration
Distributed services
Cloud-scale automation
Experience in designing, developing, and maintaining highly available and scalable microservices
Experience in leading automation of the full server lifecycle including rack integration, hardware bring-up, provisioning, and firmware management
Experience building systems that interface directly with bare metal components such as BMCs, ILOMs, NICs, SmartNICs, and GPUs
Experience developing automation pipelines for provisioning, firmware validation, and observability across global fleets
Experience implementing firmware pinning and update mechanisms to support deterministic and secure customer environments
Experience delivering telemetry-backed monitoring and alerting systems to ensure infrastructure health and visibility
Experience supporting onboarding of new hardware platforms, including custom silicon and next-gen server technologies
Experience enabling secure root-of-trust (RoT) integrations and SmartNIC/HostNIC convergence
Experience collaborating with cross-functional teams across Compute, Networking, Security, Datacenter Engineering, and Hardware Development
Experience contributing to the evolution of infrastructure toward composable hardware and next-generation data center clusters
Experience driving design reviews, participating in on-call rotations, and contributing to operational excellence and incident prevention
Experience providing technical leadership in troubleshooting, root cause analysis, and continuous improvement of service reliability

Benefits

Medical, dental, and vision insurance, including expert medical opinion
Short term disability and long term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre-tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
11 paid holidays
Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal
Voluntary benefits including auto, homeowner and pet insurance

Company

Oracle is an integrated cloud application and platform services that sells a range of enterprise information technology solutions.

H1B Sponsorship

Oracle has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1271)
2024 (846)
2023 (995)
2022 (1192)
2021 (985)
2020 (755)

Funding

Current Stage
Public Company
Total Funding
$25.75B
Key Investors
Sequoia Capital
2025-09-24Post Ipo Debt· $18B
2025-02-03Post Ipo Debt· $7.75B
1986-03-12IPO

Leadership Team

leader-logo
Esteban Rubens
Healthcare Field CTO
linkedin
G
Gerard Warrens
Field CTO, Business Strategy and Transformative Technologies
linkedin
Company data provided by crunchbase