High Performance Computing System Administrator jobs in United States
cer-icon
Apply on Employer Site
company-logo

Caterpillar Inc. · 1 day ago

High Performance Computing System Administrator

Caterpillar Inc. is a global team dedicated to creating sustainable communities through innovation and progress. They are seeking a High Performance Computing System Administrator to manage HPC systems and provide technical support, ensuring optimal performance and compliance with IT security standards.

ConstructionMachinery ManufacturingManufacturingMechanical Engineering
badNo H1Bnote

Responsibilities

Configuration, deployment, and maintenance of the Linux Cluster Hardware and HPC Software applications suite, associated Storage, and network infrastructure. Administration of the teams Hosting and management systems that enables the HPC
Provide technical support and troubleshooting for end users’ issues related to HPC hardware and Solver software applications, evaluate, and perform job performance and application testing
Work on HPC Operational and Strategic Projects efforts, participate in User Group Forums
Ensure compliance to enterprise IT security and technology controls
Evaluation and implementation of new tools and methods for improved operations and service delivery

Qualification

Linux operating systemsHPC deployment technologiesCloud computing AzureCloud computing AWSScripting PythonScripting PowerShellSystemTechnology IntegrationTCP/IP fundamentalsApplication DesignSystem TestingProblem Solving

Required

Configuration, deployment, and maintenance of the Linux Cluster Hardware and HPC Software applications suite, associated Storage, and network infrastructure
Administration of the teams Hosting and management systems that enables the HPC
Provide technical support and troubleshooting for end users' issues related to HPC hardware and Solver software applications, evaluate, and perform job performance and application testing
Work on HPC Operational and Strategic Projects efforts, participate in User Group Forums
Ensure compliance to enterprise IT security and technology controls
Evaluation and implementation of new tools and methods for improved operations and service delivery
Problem Solving: Knowledge of approaches, tools, techniques for recognizing, anticipating, and resolving organizational, operational or process problems; ability to apply knowledge of problem solving appropriately to diverse situations
Application Design, Architecture: Knowledge of basic activities and deliverables of application design; ability to utilize application design methodologies, tools and techniques to convert business requirements and logical models into a technical application design
System and Technology Integration: Knowledge of the features and facilities of systems; ability to integrate and communicate among applications, databases and technology platforms
System Testing: Knowledge of system and software testing; ability to design, plan and execute system testing strategies and tactics to ensure the quality of software at all stages of the system life cycle

Preferred

Typically 2+ years' experience in administration of heterogeneous IT compute and storage infrastructure
Extensive knowledge of Linux operating systems
Strong Scripting capability in one or more languages – Python, powershell, shell/bash,etc, Azure/Gitlab Dev-ops CICD pipelines
Knowledge of TCP/IP fundamentals
Demonstrated experience and relevant certifications with cloud-based computing resource deployment (Azure, AWS)
Working knowledge of distributed/parallel file systems and storage appliances (Isilon, Netapp, Qumulo, etc)
Experience with HPC deployment and middleware technologies (Bright Cluster manager, Altair PBS Pro, SLURM, Torque MOAB)

Benefits

Medical, dental, and vision benefits
Paid time off plan (Vacation, Holidays, Volunteer, etc.)
401(k) savings plans
Health Savings Account (HSA)
Flexible Spending Accounts (FSAs)
Health Lifestyle Programs
Employee Assistance Program
Voluntary Benefits and Employee Discounts
Career Development
Incentive bonus
Disability benefits
Life Insurance
Parental leave
Adoption benefits
Tuition Reimbursement
These benefits also apply to part-time employees

Company

Caterpillar Inc.

company-logo
For 100 years, we’ve been helping customers build a better, more sustainable world.

Funding

Current Stage
Public Company
Total Funding
$3.51B
Key Investors
US Department of EnergyAdvanced Propulsion Centre UK
2025-08-28Post Ipo Debt· $3.5B
2024-10-31Grant· $5.04M
2019-06-23Grant

Leadership Team

leader-logo
George Moubayed
Chief Sustainability and Strategy Officer / Senior Vice President Enterprise Strategy Division
linkedin
E
Eric Sporre
Vice President & Global Chief Information Security Officer (CISO)
linkedin
Company data provided by crunchbase