Cisco · 17 hours ago
AI Infrastructure Engineer - HPC (Remote/Hybrid)
Cisco is revolutionizing how data and infrastructure connect and protect organizations in the AI era. They are seeking an experienced Lead Engineer to build and manage AI platforms, focusing on GPU compute clusters and advanced technical leadership.
Communications InfrastructureEnterprise SoftwareHardwareSoftware
Responsibilities
Senior Engineer who can lead and motivate teams, present, and communicate complex topics
Technical hands-on role in building and supporting NVIDIA & Cisco UCS based artificial intelligence platforms
Plan, build, and install/upgrade new systems that support NVIDIA DGX and Cisco UCS hardware and software
Automate configuration management, software updates, and maintenance and monitoring of GPU system availability using modern DevOps tools (Ansible, GitLab, etc.)
Lead the advancement of artificial intelligence platforms and practices
Evaluate system performance based on industry-relevant benchmarks
Identify and optimize performance bottlenecks to drive system and workflow efficiency
Administer Linux systems, ranging from powerful GPU-enabled servers to general-purpose compute systems
Collaborate closely with internal Cisco Business Units, application teams, and cross-functional technical domains
Create written technical designs, documents, and presentations
Stay up to date with AI industry advancements and cutting-edge technologies
Accelerate the delivery of AI capabilities across our portfolio
Design new tools to monitor alerts that will help discover failures or issues before our customers
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
Qualification
Required
7+ years of previous experience deploying and administrating HPC clusters
Proficient in general-purpose programming languages (Python, GoLang, Bash and/or C/C++) and development platforms and technologies
Familiar with GPU resource scheduling managers (Slurm (preferred), Kubernetes, and/or RunAI, etc.)
Preferred
Master's degree or equivalent work experience
Proficient in Hybrid Cloud, Virtualization, and Container technologies
Experience with provisioning tools like Base Command Manager, Warewulf, Satellite, and/or Ironic
Experience with Agile and DevOps operating models, including project tracking tools (e.g., Jira), Git (any Version Control systems), and CI/CD systems (e.g., GitLab, GitHub Actions, Jenkins)
Experience with automation tools like Ansible, SaltStack, Puppet and/or Chef
Deep understanding of operating systems, computer networks, and high-performance applications
Established record of leading technical initiatives, delivering results, and a commitment to fostering a supportive work environment
Hard-working, dedicated to providing quality support for your customers
Benefits
Medical, dental and vision insurance
A 401(k) plan with a Cisco matching contribution
Paid parental leave
Short and long-term disability coverage
Basic life insurance
10 paid holidays per full calendar year
1 floating holiday for non-exempt employees
1 paid day off for employee’s birthday
Paid year-end holiday shutdown
4 paid days off for personal wellness determined by Cisco
16 days of paid vacation time per full calendar year
Flexible vacation time off program
80 hours of sick time off provided on hire date
Up to 80 hours of unused sick time carried forward
Optional 10 paid days per full calendar year to volunteer
Annual bonuses subject to Cisco’s policies
Company
Cisco
Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and other technology services and products. It is a sub-organization of Cisco Press.
H1B Sponsorship
Cisco has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1238)
2024 (1231)
2023 (1273)
2022 (2127)
2021 (1991)
2020 (1173)
Funding
Current Stage
Public CompanyTotal Funding
unknown1990-02-13IPO
Leadership Team
Recent News
2026-01-14
2026-01-14
2026-01-14
Company data provided by crunchbase