Senior HPC Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Texas A&M University · 3 months ago

Senior HPC Engineer

Texas A&M University is making significant advancements in artificial intelligence and supercomputing. They are seeking a Senior High Performance Computing Engineer (HPC) to provide technical expertise in the design and deployment of HPC systems, manage large-scale operations, and lead enterprise-wide projects.

Higher Education
check
Growth Opportunities
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote

Responsibilities

Manage large-scale HPC cluster operations, including OS upgrades, firmware patching, and performance tuning
Oversee networking, security, and infrastructure for HPC systems
Lead the development of specialized HPC computing clouds and scalable storage systems
Collaborate with stakeholders to develop service-based solutions
Serve as a strategic technical resource across departments
Lead enterprise-wide HPC projects using established project management protocols
Mentor junior system administrators and enforce performance standards

Qualification

High Performance ComputingLinux system administrationContainer orchestrationNetworking conceptsSlurm workload managerNVIDIA DGX systemsVirtualization technologiesIaaS platformsDDN storage solutionsNetwork-attached storage

Required

Bachelor's degree in applicable field or equivalent combination of education and experience
12 years of related experience
Must be a United States citizen, permanent resident, or a person granted asylum or refugee status in accordance with 15 CFR, Part 762; 22 CFR §§122.5, 123.22 and 123.26; and 31 CFR § 501.601

Preferred

Experience with High Performance Computing (HPC) environments
Advanced Linux system administration skills
Familiarity with computer networking concepts and protocols
Experience with container orchestration tools such as Kubernetes
Knowledge of Run:ai for AI workload management
Proficiency with Slurm workload manager
Experience working with NVIDIA DGX systems
Understanding of virtualization technologies
Familiarity with Infrastructure as a Service (IaaS) platforms
Experience with DDN storage solutions
Knowledge of network-attached storage systems

Benefits

Health, dental, vision, life and long-term disability insurance with Texas A&M contributing to employee health and basic life premiums
12-15 days of annual paid holidays
Up to eight hours of paid sick leave and at least eight hours of paid vacation each month
Automatically enrollment in the Teacher Retirement System of Texas
Health and Wellness: Free exercise programs and release time
Professional Development: All employees have access to free LinkedIn Learning training, webinars, and limited financial support to attend conferences, workshops, and more
Educational release time and tuition assistance for completing a degree while a Texas A&M employee

Company

Texas A&M University

company-logo
Texas A&M University has a proud history that stretches back to 1876 when The Agricultural and Mechanical College of Texas became the first public institution of higher learning in the state of Texas.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Jen Sommers, M.Ed.
Spectrum Living Learning Community Co-Founder
linkedin
leader-logo
Greg Hartman
Chief Operating Officer and Senior Vice President for Strategic Partnerships, Texas A&M University,
linkedin
Company data provided by crunchbase