AI Data Center Operations Contractor jobs in United States
cer-icon
Apply on Employer Site
company-logo

SK hynix America · 1 month ago

AI Data Center Operations Contractor

SK hynix America is a global leader in semiconductor innovation, developing advanced memory solutions. They are seeking an AI Data Center Operations Contractor to manage high-performance computing environments that support AI/ML workloads, focusing on operational performance, regulatory compliance, and technical optimization.

Semiconductors
check
H1B Sponsor Likelynote

Responsibilities

Optimize AI Infrastructure Performance: Apply your operational expertise to enhance the performance, reliability, and efficiency of our AI computing systems. Ensure continuous availability of critical infrastructure while managing the unique demands of high-density AI workloads — including extreme power consumption, elevated thermal output, and advanced cooling requirements
Monitor & Manage Critical Systems: Lead daily operations by monitoring real-time power consumption across GPU clusters, proactively managing cooling systems to maintain optimal thermal conditions, and coordinating water resource use for cooling infrastructure. Develop and refine AI-specific Standard Operating Procedures (SOPs) to empower teams to respond swiftly and effectively to routine maintenance and emergency situations
Ensure Regulatory Compliance: Maintain strict adherence to operational permits, environmental regulations, and industry certifications. Conduct regular facility audits, serve as the primary liaison with regulatory inspectors and utility providers, and ensure all operational activities remain fully aligned with permit conditions and certification standards
Lead Technical Optimization Initiatives: Serve as the technical lead in implementing power and thermal management strategies that balance peak performance with energy efficiency. Optimize cooling distribution to eliminate thermal hotspots and strategically schedule maintenance windows to minimize disruptions to AI training and inference workloads
Identify efficiency improvements and develop enhanced monitoring capabilities to support AI workload management
Establish best practices for operational workflows and create comprehensive documentation for critical procedures
Implement automation where appropriate to reduce manual intervention and improve operational consistency

Qualification

AI data center operationsGPU cluster managementPower managementThermal managementPreventive maintenanceIncident responseCooling systemsRegulatory complianceTeam collaborationStrategic thinking

Required

Demonstrate concrete operational capabilities through hands-on experience managing AI or high-performance computing data centers
Minimum 5 years of data center operations experience, including at least 2 years focused specifically on AI/ML infrastructure operations
Direct experience managing facilities supporting GPU clusters with power densities exceeding 30 kW per rack
Deep operational knowledge of cooling systems in AI/ML environments — including liquid cooling, chilled water systems, and advanced thermal management technologies
Proven ability to maintain target operating temperatures in high-density compute environments and troubleshoot cooling system failures under pressure
Comprehensive understanding of power management — including UPS systems, generator operations, PDU management, and electrical load balancing
Skilled in reading and interpreting electrical diagrams, coordinating with utility providers during planned/unplanned outages, and implementing emergency power protocols
Expertise in permit compliance and certification maintenance — including environmental permits, water discharge permits, air quality regulations, and ongoing certifications such as ISO or Uptime Institute operational sustainability standards

Preferred

Bachelor's degree or equivalent practical experience
Technical depth paired with operational pragmatism — combining hands-on facility management with strategic thinking
Experience working in 24/7 operations teams, responding to critical incidents, and maintaining uptime targets exceeding 99.9% in demanding environments
Ability to balance performance requirements with cost efficiency — with a strong commitment to safety and compliance

Company

SK hynix America

twitter
company-logo
Semiconductors are essential to all IT products, and its performance often determines the performance of the final products.

H1B Sponsorship

SK hynix America has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (16)
2023 (3)
2022 (3)
2021 (2)
2020 (2)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Jennifer Lee
Director of Technology / Evangelist : Pathfinding & Partnerships
linkedin
Company data provided by crunchbase