Senior Cloud Operations Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 1 month ago

Senior Cloud Operations Engineer

NVIDIA is a leading technology company seeking a highly skilled Senior Cloud Operations Engineer to join their NGC Cloud team. The role involves driving the efficiency, reliability, and scalability of systems that support global business operations, focusing on automation and operational workflows.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Driving day-to-day interactions with NVIDIA wide IT subsystems, ensuring smooth operational workflows across infrastructure and applications
Crafting and maintaining GitLab CI/CD pipelines to automate build, test, and deployment workflows
Monitoring system health, building/maintaining dashboards, creating alerts, and producing operational reports
Performing user offboarding, access reviews, and compliance-related tasks across multiple systems
Drive interactions with various IT subsystems, ensuring API performance and integration stability meet defined SLAs and SLOs
Coordinating changes and releases between engineering, operations, and security teams
Enforcing security guidelines, managing vulnerability remediation, and collaborating with security teams on audits and assessments
Maintaining documentation, SOPs, and process improvements to enhance operational maturity

Qualification

PythonGitLab CI/CDMonitoring toolsIT operationsJavaRDBMSNoSQLCommunicationProblem-solvingDocumentation skills

Required

8+ years of hands-on experience building/supporting complex services and BS/MS in Computer Science (or equivalent experience)
Knowledge in Python for automation, data handling, and tool development
Experience with monitoring tools (such as Prometheus, Grafana, Datadog, CloudWatch, Splunk) and reporting
Familiarity with ITSM practices, including incident, problem, and modification processes
Ability to perform secure and compliant offboarding and access-related tasks
Strong understanding of IT operations and system workflows
Knowledge in core Java - Collections API, Streams API, Concurrency, I/O
Knowledge in RDBMS and NoSQL (Cassandra, DynamoDb, Redis) databases
Excellent communication skills with the ability to collaborate across multiple teams
Excellent documentation, problem-solving, and communication skills for cross-team alignment

Preferred

Experience designing or implementing automation pipelines or internal operational tools
Background in customer support, technical support, or customer-facing engineering roles
Prior work in a security-conscious or compliance-heavy environment
Ability to build end-to-end monitoring solutions, dashboards, and automated reporting
Strong documentation habits and a continuous-improvement approach

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase