HPC Resident Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

PGTEK ยท 3 days ago

HPC Resident Engineer

PGTEK is a consulting organization dedicated to helping clients achieve their business and technology objectives. They are seeking an experienced HPC Resident Engineer to provide on-site technical expertise and operational support for a large-scale HPC environment running Omega software, ensuring optimal performance and reliability of the infrastructure.

Information Technology & Services
check
Growth Opportunities

Responsibilities

Provide hands-on, on-site operational support for large-scale HPC clusters running Omega software
Ensure high availability, performance, and reliability of CPU- and GPU-based compute environments
Monitor system health, analyze performance metrics, and proactively identify and mitigate potential issues
Support infrastructure refresh initiatives, including:
Compute node upgrades and replacements
Storage migrations and platform transitions
Lead and support the transition from existing storage platforms to PixStor on Dell
Perform break/fix troubleshooting on current hardware and software components
Coordinate with SLB and other vendors for support of out-of-warranty systems
Provide recommendations to improve cluster management, operational workflows, and overall efficiency
Document system configurations, procedures, and best practices
Act as a trusted technical advisor to the customer's engineering and operations teams

Qualification

High-Performance Computing (HPC)Linux-based systemsOmega softwareCPUGPU nodesHigh-performance storage solutionsTroubleshooting skillsCluster schedulingPerformance tuningCapacity planningVendor management

Required

Proven experience supporting High-Performance Computing (HPC) environments in production
Strong knowledge of Linux-based systems in large, clustered environments
Experience supporting Omega or similar seismic processing applications
Hands-on experience with CPU and GPU-accelerated compute nodes
Hands-on experience with large-scale cluster architectures
Hands-on experience with high-performance storage solutions
Strong troubleshooting skills across hardware, operating systems, and cluster components
Experience working with vendors and managing escalations for hardware and software issues
Ability to work independently in an on-site, customer-facing role

Preferred

Experience with PixStor and/or Dell HPC storage platforms
Background supporting seismic processing or energy-sector HPC workloads
Familiarity with cluster scheduling, performance tuning, and capacity planning
Experience supporting infrastructure refresh or data center modernization projects

Benefits

Comprehensive PPO medical coverage with access to a Health Savings Account (HSA) option
Vision plan
Dental insurance with the base dental plan option paid for by PGTEK
Life Insurance
Short and Long-Term disability
Critical Illness insurance have premiums covered
Matching 401(k) plan
Discount on pet insurance through ASPCA Pet Insurance
Employee Assistance Program is available at no cost to all employees
Generous amount of PTO and Holidays
Education Assistance Program is available after 12 months of employment

Company

PGTEK

twitter
company-logo
Our firm provides global IT infrastructure professional services to industry-leading OEM, infrastructure software, and significant private and public sector organizations.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Scott Podmilsak
CEO and Owner
linkedin
Company data provided by crunchbase