High Performance Computing Engineer - Mid-level jobs in United States
cer-icon
Apply on Employer Site
company-logo

General Dynamics Information Technology · 2 weeks ago

High Performance Computing Engineer - Mid-level

General Dynamics Information Technology is a global technology and professional services company that delivers consulting, technology, and mission services to the U.S. government and defense community. They are seeking a High Performance Computing Engineer who will be responsible for the operations and maintenance of HPC systems, assisting users in deploying jobs, and ensuring optimal performance of complex computing platforms.

Artificial Intelligence (AI)Cloud ComputingConsultingCyber SecurityInformation Technology
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote

Responsibilities

Responsible for the normal day-to-day HPC operations and maintenance of the HPC systems
Provide day to day systems administration duties for Nvidia GPUs, Commodity Cluster Systems and Cray HPC environments
Perform system monitoring, software installations, debug, upgrades, health checks, and identification/implementation of automated business processes
Provide assessments, on-going performance analysis and recommendations for future architectures
Responsible for operating all the host systems for the analysis
Works in a liaison role, linking the analysts and their specialty codes and applications, to the computing systems that are focused on yielding in-depth technically sound results
Oversees analytic applications running on a clustered HPC fabric including CPU and GPU systems
Managing job submission to clients applications and codes using MPI/OpenMPI
Provide in-depth analytic results, to achieve a best-tool-for-the-job approach
Partners with data scientists, engineers, and analysts conducting specialized scientific and engineering analysis
Escalate issues and problems to hardware support and/or engineering management as necessary
Responsible for continuous performance analysis and tuning the HPC environment
Assist with the identification, troubleshooting, and repair of software problems impacting performance of implemented HPC solutions
Perform installation of software patches including upgrades to operating systems and firmware
Assist with the resolution of trouble tickets and software problems identified by system’s users
Identify and expand services and functionalities offered in HPC environment
Be a primary point of contact to resolve any hardware or software malfunctions, including working with service personnel as necessary
Review system logs to identify and resolve software and systems related issues
Prepare reports related to the operational efficiency of the hardware and execution of users jobs
Experience with MPI/OpenMPI, SLURM, and Linux Operating Systems essential
Prior experience as a Systems Administrator essential, with a preference for experience working with clustered systems including GPUs in the hardware stack
Experience with high speed networking, and CUDA preferred
Software integration experience a plus
Other duties could be required to support the customer’s mission

Qualification

High Performance ComputingMPI/OpenMPISLURMLinux Operating SystemsAutomationScriptingToolingSystems AdministrationNvidia GPUsPerformance AnalysisSoftware Integration

Required

Top Secret SCI + Polygraph clearance level must currently possess
Top Secret SCI + Polygraph clearance level must be able to obtain
6 + years of related experience
US Citizenship Required
Experience with MPI/OpenMPI, SLURM, and Linux Operating Systems essential
Prior experience as a Systems Administrator essential, with a preference for experience working with clustered systems including GPUs in the hardware stack
Demonstrated on-the-job experience with integrating functionality from disparate systems via scripting/tooling/automation
Demonstrated on-the-job experience with the Sponsor's system security environment and requirements
Demonstrated experience leading systems architecture, operations, maintenance and administration

Preferred

Experience with high speed networking, and CUDA preferred
Software integration experience a plus

Benefits

Variety of medical plan options, some with Health Savings Accounts
Dental plan options
Vision plan
401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match
Full flex work weeks
Paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave
Short and long-term disability benefits
Life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance

Company

General Dynamics Information Technology

company-logo
General Dynamics Information Technology is an IT consulting company that specializes in cyber security, AI, and quantum computing. It is a sub-organization of General Dynamics.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Paul Nedzbala
Senior Vice President
linkedin
leader-logo
Ben Buckley
Vice President and General Manager
linkedin
Company data provided by crunchbase