General Dynamics Information Technology ยท 1 month ago
SYSTEMS ENGINEER PRINCIPAL (HPC/AI System Administrator, Storage Engineer, Monitoring Expert, Solution Architect, Security/Provisioning Engineer, or Multi-discipline Expert)
General Dynamics Information Technology is seeking a Systems Engineer Principal to advance customer operations. The role involves supporting the lifecycle sustainment and operational availability of High Performance Computing clusters for the National Weather Service.
Artificial Intelligence (AI)Cloud ComputingConsultingCyber SecurityInformation Technology
Responsibilities
Lead/Manage/Support the day-day operations, sustainment, HPC services delivery, and incremental enhancements of two, geographically separated HPC clusters that are GDIT contractor owned and contractor operated (COCO) and used exclusively for WCOSS
Collaborate with the GDIT WCOSS team as a senior-level HPC functional expert addressing intricate and multifaceted HPC challenges by providing innovative ideas, solutions, and resolution for customer requests, issues, and improvement efficiencies on a continuous basis
Drive and prioritize resource utilization towards continuously improving customer satisfaction with GDIT's HPC service delivery and exceeding the contract service level metrics of uptime, availability, performance, stability, and on-time product delivery
Utilize past experience, team collaboration, system management and troubleshooting applications, and ingenuity to support customer operations while working on systems that range in capacity from 1000-3000+ nodes and 100's of PB storage per system
Qualification
Required
Education: Bachelor of Arts/Bachelor of Science
Experience: 8+ years of related experience
Technical skills: Highly proficient with Linux (RockyOS, SLES, etc), scripting in Python, Perl, or Bash, networking concepts and technology such as Ethernet, InfiniBand and Slingshot, TCP/IP networking, basic routing, and network services, programming in Python, C/C++, or Fortran, administrating PBSpro, SLURM or other batch systems in an HPC cluster, and system performance monitoring and tuning in an HPC cluster environment (e.g., Opensearch, Grafana, Prometheus)
Security clearance level: must complete a satisfactory background investigation
US citizenship required
Role requirements: expected to perform as individual SME contributor, functional lead, or project/task leader responsible for work product delivery. Extensive experience in troubleshooting, diagnosing and repairing hardware failures to component level on servers; coordinating with vendors to resolve hardware and software problems. Minimal travel required for onsite work, team collaboration, training, and customer interaction
Benefits
Comprehensive benefits and wellness packages
401K with company match
Variety of medical plan options
Some with Health Savings Accounts
Dental plan options
Vision plan
Paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave
GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees
Short and long-term disability benefits
Life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance
Company
General Dynamics Information Technology
General Dynamics Information Technology is an IT consulting company that specializes in cyber security, AI, and quantum computing. It is a sub-organization of General Dynamics.
Funding
Current Stage
Late StageRecent News
2026-01-03
2025-12-16
Business Wire
2025-11-20
Company data provided by crunchbase