Data Center Incident Program Manager, JoinOCI jobs in United States
cer-icon
Apply on Employer Site
company-logo

Oracle · 3 weeks ago

Data Center Incident Program Manager, JoinOCI

Oracle is a world leader in cloud solutions, and they are seeking a Data Center Incident Program Manager to oversee and support all incidents related to critical operations. The role involves managing incident response processes, driving root cause analysis, and collaborating across teams to ensure the reliability and uptime of data center operations.

Data GovernanceData ManagementEnterprise SoftwareInformation TechnologySaaSSoftware
check
H1B Sponsor Likelynote

Responsibilities

Provide comprehensive support for all incidents in data center critical operations, including monitoring, triage, and coordination of responses to issues involving power, cooling, HVAC, network, and other infrastructure systems to ensure uninterrupted service
Support the development and implementation of new training programs for data center teams, focusing on incident response best practices, system familiarity, and proactive risk mitigation to build team capabilities and preparedness
Identify opportunities to automate incident detection, alerting, and resolution tools, integrating with existing systems like BMS and EPMS to streamline workflows, reduce manual intervention, and improve overall response times
Manage and drive root cause analysis for incidents, collaborating with cross-functional teams to develop and implement corrective actions, track progress, and ensure lessons learned are applied to prevent recurrence
Work closely with Data Center Operations, engineering, security, and vendor teams to align on incident protocols, share insights, and foster a culture of continuous improvement in critical operations
Maintain detailed records of all incidents, RCAs, corrective actions, and outcomes. Generate reports for stakeholders to provide visibility into incident trends, resolution effectiveness, and areas for enhancement
Analyze incident data to recommend enhancements in processes, tools, and training, aiming to boost the efficiency, reliability, and resilience of data center operations

Qualification

Incident managementRoot cause analysis (RCA)Data center operationsIncident tooling automationBMSEPMS systemsHigh-pressure situation handlingProblem-solving skillsAttention to detailCommunication skillsCollaboration skills

Required

Minimum of 3 years of experience in incident management, program management, or related roles within data center operations or mission-critical environments
Proven experience supporting incidents in data center infrastructure, including familiarity with systems like BMS, EPMS, power, cooling, and HVAC
Hands-on experience with root cause analysis (RCA), corrective action planning, and incident tooling automation in high-stakes settings
Knowledge of data center architecture, including network, servers, power distribution, environmental controls, and security protocols
Strong problem-solving and analytical skills to lead RCAs and drive solutions under pressure
Attention to detail in documenting incidents, tracking actions, and monitoring compliance
Excellent communication and collaboration skills for training support, cross-team coordination, and reporting
Ability to handle high-pressure situations with composure, prioritizing actions to minimize downtime

Benefits

Medical, dental, and vision insurance, including expert medical opinion
Short term disability and long term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre-tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
11 paid holidays
Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal
Voluntary benefits including auto, homeowner and pet insurance

Company

Oracle is an integrated cloud application and platform services that sells a range of enterprise information technology solutions.

H1B Sponsorship

Oracle has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1271)
2024 (846)
2023 (995)
2022 (1192)
2021 (985)
2020 (755)

Funding

Current Stage
Public Company
Total Funding
$25.75B
Key Investors
Sequoia Capital
2025-09-24Post Ipo Debt· $18B
2025-02-03Post Ipo Debt· $7.75B
1986-03-12IPO

Leadership Team

leader-logo
Esteban Rubens
Healthcare Field CTO
linkedin
G
Gerard Warrens
Field CTO, Business Strategy and Transformative Technologies
linkedin
Company data provided by crunchbase