NVIDIA · 11 hours ago
Software Engineering Manager - Cloud Infrastructure Services, DGX Cloud
Maximize your interview chances
Insider Connection @NVIDIA
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Manage a team of Site Reliability engineers, including task planning and code reviews.
Define team strategy and roadmap, and drive adoption of test infrastructure across several product areas in DGX Cloud Computing environment.
Drive technical projects and provide leadership in an innovative and fast-paced environment.
Be responsible for the overall planning, actioning and success of technical projects.
Work closely with product management teams to ensure best-in-class product development.
Contribute technically to the technical projects for DGX Cloud Computing Services.
Interact with key internal stakeholders to provide operational and financial clarity on technical spend
Drive Decision making, visibility and operational rigor across business analytic initiatives such as budget and project & portfolio reporting. Lead efforts related to executive reporting, dashboards, and operational CTO metrics focusing on continuous improvement and evolution to maximize decision making and executive visibility.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
10+ overall years of Experience in engineering.
3+ years of leadership.
Bachelor / Master degree in Computer Science, or equivalent experience.
Experience in Containers / Virtualization environments/ Cluster solutions.
Experience in managing Technical Support / DevOps teams.
Comfortable to Commit to Excellence and deliver projects in tight deadlines.
Strong Knowledge in Unix/Linux.
Experience in a minimum of two of the following programming languages: Perl, Python, GoLang.
Experience implementing tools, process, internal instrumentation, methodologies and resolving blockages.
Experience in designing and implementing large-scale distributed systems.
Demonstrated people management and leadership skills, the proven track record of mentoring and coaching team members.
Ability to quickly learn and evaluate new technologies.
Ability to influence and establish relationships with other software and IT functional groups such as development, server, storage and security teams.
Preferred
Experience in using or running large private and public cloud systems based on Kubernetes, OpenStack and Docker.
Experience running Grafana, OpenTelemetry, Prometheus, and similar observability focused tools.
Interest in crafting, analyzing and fixing large-scale distributed systems.
Benefits
Equity and benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (735)
2022 (892)
2021 (696)
2020 (534)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
Crunchbase News
2024-12-23
2024-12-23
Company data provided by crunchbase