Sabre Systems, LLC · 7 hours ago
HPC Engineer
Sabre Systems, LLC is seeking an HPC Data Storage Engineer to support a mission-critical Department of Defense program dedicated to high-performance computing operations. The role involves designing, optimizing, and maintaining advanced high-performance computing environments to enable data-intensive research efforts essential to national defense.
Information Technology
Responsibilities
Utilize a wide variety of skills in system and network monitoring; large-scale systems administration; scripting and automation; security compliance; network distributed services; storage and backups; and hardware and software problem diagnosis and resolution
Diagnose and troubleshoot technical problems, often of a complex nature, associated with computer hardware and software interrelationships and dependencies
Conduct needs analysis, planning, and scheduling the installation of a wide variety of new or modified hardware/software
Develop functional and technical IT system requirements and specifications. Configure and optimize system tools and applications, to include job schedulers (Slurm and PBSPro) and system resources (GitLab, LUA/TCL modules, and system support applications)
Create and brief technical presentations to technical and non-technical stakeholders. Maintain detailed documentation of system configurations, procedures, and troubleshooting guides. Develop user facing documentation
Qualification
Required
Bachelor's in Computer Engineering, Computer Science, or related field and ten or more years of job related experience
Thorough knowledge of complex concepts, practices, and troubleshooting associated with HPC cluster systems design, installation, and maintenance
Advanced knowledge in distributed computing theory, parallel processing, applications, and associated infrastructure is required
Extensive experience with Linux/Unix systems including installation, configuration, networking, backups, updates and patching, data archiving, and system security
Functional knowledge of HPC middleware, and platform managers such as Bright Cluster Manager; employing job schedulers such as PBS, Slurm, Torque, etc.; and, optimizing job queues
Experience with HPC or large-scale distributed computing environments and technologies such as high-speed low-latency interconnects (e.g. InifiniBand), parallel file systems (e.g. Lustre), and virtualization environments and tools (e.g. VMWare)
Experience developing Python/bash/Perl scripts and employing automation frameworks such as Ansible
General knowledge employing Docker containers and Kubernetes ecosystems
Working knowledge in one or more programming languages (e.g. C/C++, Fortran, etc.)
Must be able and willing to travel to northern Virginia approximately 25% of the time
This position requires an active Top Secret DoD security clearance (U.S. Citizenship Required)
Benefits
Employee Referral Bonus Eligibility
Comprehensive, evolving benefits designed to meet their diverse needs
Company
Sabre Systems, LLC
Sabre Systems LLC is a quality driven, customer focused business providing innovative and sustainable solutions in the areas of Cyber, Systems and Software Engineering, Advanced Communication Technologies and Digital Transformation.
Funding
Current Stage
Growth StageTotal Funding
unknown2024-09-27Acquired
Recent News
Washington Technology
2025-10-09
2024-04-27
Travel And Tour World
2024-04-27
Company data provided by crunchbase