NVIDIA · 2 days ago
Senior Software Engineer, DevOps and Infrastructure Automation
Wonder how qualified you are to the job?
Artificial Intelligence (AI)GPU
Insider Connection @NVIDIA
Responsibilities
Build, deploy, and maintain GPU-based Servers for use in Metropolis platforms and machine learning applications in test, development, and production environments.
Lead design and be responsible for infrastructure components on Network topologies, Streaming Servers, and Security.
Collaborate with different software, IT, Security, and hardware teams across geographies for solving critical problems and performance issues.
Establish configuration environment for servers by creating processes and tools for software development, debugging, testing, benchmarking, and documentation.
Automate provisioning and management of bare-metals, internal cloud, Microsoft Azure, Amazon AWS.
Automate performance measurement of GPU-based AI applications.
Implement automated monitoring and operating procedures for a range of domains across on-premise/cloud environments.
Build and maintain infrastructures related to the delivery of software artifacts produced by Metropolis application development teams.
Build detailed documentation for customers, partners, and system integrators to replicate the deployment architecture prototyped.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
BS or MS in Computer Science, Computer Engineering or Electrical Engineering or related field (or equivalent experience)
5+ years of proven track record in Configuration Management, Server administration (Linux) in an Engineering Hardware Lab environment
Excellent programming skills in Python, Shell Scripting, ansible, terraform, Helm Template
Application Performance analysis measurement and reporting
Solid understanding of configuring and handling Elasticsearch, Logstash, Kibana, Kafka ecosystem
Software build, package and delivery skills with Jenkins, Pipeline Scripting, Dockerfile, Artifactory integration, Container Registry, Helm Package repositories
Good understanding of Kubernetes ecosystem and helm based application deployment patterns
Cloud Infrastructure provisioning automation with AWS, GCP, Azure, OCI using Terraform, Cloud Formation etc.
Preferred
Building configuration management, monitoring and automation tools
Familiarity in management of large scale of edge servers deployed in indoor and outdoor environments
Strong interpersonal skills
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Trends of Total Sponsorships
2023 (735)
2022 (892)
2021 (696)
2020 (534)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2017-05-24Post Ipo Equity· $4B
Recent News
2024-06-06
2024-06-06
2024-06-06
Company data provided by crunchbase