Omega Enterprise Solutions ยท 2 months ago
HPC Technical Expert/Functional Expert
Omega Enterprise Solutions is a Maryland-based, Service-Disabled Veteran-Owned Small Business (SDVOSB) focused on the U.S. Department of Defense and Intelligence Community. They are seeking an HPC Technical Expert to manage and support complex IT environments, ensuring the efficient operation of high-performance computing systems.
Business DevelopmentConsultingCyber SecurityInformation Technology
Responsibilities
Installation, configuration, tuning, troubleshooting and administration of:
Multi-vendor servers running numerous COTS, opensource, and in-house applications to accommodate HPC Division IT support requirements
Multi-vendor servers running Red Hat of SuSe with direct attached, FC SAN storage or SSDs
Distributing computing tools such as ReS, LSF, and SLURM
HPC farm systems, HPC MPP clustered systems, Front End servers of Special Purpose devices (SPDs) IBM of HP Blade servers with FC/SAS/Network back end
Multi-vendor filesystems such as XFS, GPFS and Lustre
Pre-Factory testing, Factory testing, System integration and Acceptance testing during the purchase process of the HPS systems
Configuration, tuning, testing, and advanced level troubleshooting and support of high performance filesystems such as XFS, GPFS and Lustre
Advanced level troubleshooting and support of HPC farm systems and associated applications such as Nagios, xcat, failover software, and compilers Working knowledge of HPC MPP systems
Configuration, tuning, testing, and advanced level troubleshooting and support of distributed computing tools such as RES, LSF and SLURM
Configuration, tuning, testing, and advanced level troubleshooting of RedHat and SuSe operating systems
Qualification
Required
Bachelor's Degree in Computer Science or related field
At least eight (8) years in a large and complex IT environment providing industry and government recognized functional expertise
Five (5) years of full-time computer science work that can substitute for the Bachelor's degree
Active Security Clearance and Polygraph
IAT Level 2 Certification Required
Installation, configuration, tuning, troubleshooting and administration of multi-vendor servers running numerous COTS, opensource, and in-house applications
Installation, configuration, tuning, troubleshooting and administration of multi-vendor servers running Red Hat or SuSe with direct attached, FC SAN storage or SSDs
Distributing computing tools such as ReS, LSF, and SLURM
HPC farm systems, HPC MPP clustered systems, Front End servers of Special Purpose devices (SPDs) IBM of HP Blade servers with FC/SAS/Network back end
Multi-vendor filesystems such as XFS, GPFS and Lustre
Pre-Factory testing, Factory testing, System integration and Acceptance testing during the purchase process of the HPS systems
Configuration, tuning, testing, and advanced level troubleshooting and support of high performance filesystems such as XFS, GPFS and Lustre
Advanced level troubleshooting and support of HPC farm systems and associated applications such as Nagios, xcat, failover software, and compilers
Working knowledge of HPC MPP systems
Configuration, tuning, testing, and advanced level troubleshooting and support of distributed computing tools such as RES, LSF and SLURM
Configuration, tuning, testing, and advanced level troubleshooting of RedHat and SuSe operating systems
Valid RHCSA or higher Red Hat certification
Valid VMWare certification
Preferred
Master's Degree in Computer Science or related field may substitute for two (2) years experience
An industry recognized professional certification may substitute as one year experience