Lambda · 2 months ago
Data Center Operations Engineer - Chicago ORD
Lambda is a company that builds Gigawatt-scale AI Factories for Training and Inference. They are seeking a Data Center Operations Engineer to ensure the smooth operation of their AI-IaaS infrastructure, managing everything from hardware sourcing to deployment and ongoing efficiency.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingData CenterGPUMachine Learning
Responsibilities
Make sure new servers, storage, and networking gear are racked, labeled, cabled, and configured the right way
Keep data center layouts and network topologies up to date in our DCIM software
Coordinate with supply chain and manufacturing teams so systems are deployed on time, especially for large-scale projects
Evaluate current and future data center needs based on growth and technology trends
Manage parts depot inventory and track equipment as it moves from delivery → storage → staging → deployment → handoff
Work closely with hardware support teams to get tickets resolved quickly
Create and manage RMA tickets when needed, making sure faulty parts are replaced and reinstalled without delay
Develop and maintain installation standards (placement, labeling, cabling) to ensure consistency across all data centers
Act as a subject matter expert on data center deployments, supporting sales engagements for major deployments in our facilities or at customer sites
Researching, evaluating, and securing the right hardware and infrastructure components
Building relationships with peers and supply chain to ensure cost-effective and timely supply
Monitoring day-to-day performance of data centers to maintain uptime and efficiency
Troubleshooting and resolving hardware or infrastructure issues quickly
Performing regular maintenance and upgrades to keep systems running at peak performance
Overseeing the full lifecycle of infrastructure, from initial setup to ongoing optimization
Coordinating deployments of new hardware and ensuring seamless integration with existing systems
Managing capacity planning to make sure infrastructure can scale with business growth
Working with product management, support, and other teams to align operational capabilities with company goals
Translating business priorities into technical and operational requirements
Supporting cross-functional projects where infrastructure plays a critical role
Ensuring infrastructure remains stable, secure, and scalable as demand increases
Continuously improving processes to boost efficiency and reduce downtime risks
Qualification
Required
Presence in Chicago/Elk Grove Village Data Center location 5 days per week
Make sure new servers, storage, and networking gear are racked, labeled, cabled, and configured the right way
Keep data center layouts and network topologies up to date in our DCIM software
Coordinate with supply chain and manufacturing teams so systems are deployed on time, especially for large-scale projects
Evaluate current and future data center needs based on growth and technology trends
Manage parts depot inventory and track equipment as it moves from delivery → storage → staging → deployment → handoff
Work closely with hardware support teams to get tickets resolved quickly
Create and manage RMA tickets when needed, making sure faulty parts are replaced and reinstalled without delay
Develop and maintain installation standards (placement, labeling, cabling) to ensure consistency across all data centers
Act as a subject matter expert on data center deployments, supporting sales engagements for major deployments in our facilities or at customer sites
Researching, evaluating, and securing the right hardware and infrastructure components
Building relationships with peers and supply chain to ensure cost-effective and timely supply
Monitoring day-to-day performance of data centers to maintain uptime and efficiency
Troubleshooting and resolving hardware or infrastructure issues quickly
Performing regular maintenance and upgrades to keep systems running at peak performance
Overseeing the full lifecycle of infrastructure, from initial setup to ongoing optimization
Coordinating deployments of new hardware and ensuring seamless integration with existing systems
Managing capacity planning to make sure infrastructure can scale with business growth
Working with product management, support, and other teams to align operational capabilities with company goals
Translating business priorities into technical and operational requirements
Supporting cross-functional projects where infrastructure plays a critical role
Ensuring infrastructure remains stable, secure, and scalable as demand increases
Continuously improving processes to boost efficiency and reduce downtime risks
Preferred
Certifications: Any Linux or project management
Military background
Experience in the machine learning or computer hardware industry
Benefits
Health, dental, and vision coverage for you and your dependents
Wellness and Commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible Paid Time Off Plan that we all actually use
Company
Lambda
Lambda is a cloud-based platform that provides high-performance GPU hardware and cloud infrastructure for AI model training and inference.
H1B Sponsorship
Lambda has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (1)
2023 (3)
2022 (2)
2021 (2)
2020 (3)
Funding
Current Stage
Late StageTotal Funding
$3.19BKey Investors
TWG GlobalJP MorganMacquarie Group
2025-11-18Series E· $1.5B
2025-08-19Debt Financing· $275M
2025-02-19Series D· $480M
Recent News
2026-01-11
2026-01-09
Company data provided by crunchbase