HPC Systems Engineer jobs in United States
info-icon
This job has closed.
company-logo

Corvid Technologies ยท 1 hour ago

HPC Systems Engineer

Corvid Technologies is an engineering firm specializing in high-fidelity computational modeling and simulation. They are seeking HPC Systems Engineers with a strong background in Linux to support their large-scale High Performance Computer and optimize its performance and efficiency.

Information TechnologyProduct DesignWeb Design
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote
Hiring Manager
Rick Wilbourn
linkedin

Responsibilities

Supporting software installation and configuration of license management servers (e.g., FlexLM or RLM)
Implement site-to-site VPNs (e.g., IPSEC tunnels) to customers on customer HPC clusters
Troubleshoot slow, hanging, or failing HPC jobs on internal or customer HPC clusters
Automate repetitive tasks and implement custom solutions using scripting/programming languages such as Bash or Python
Provide guidance and support on HPC best practices and solutions for internal and external customers
Troubleshoot hardware and software issues on Linux servers
Installation of new hardware into existing compute clusters
Design, test, and implement an HPC environment consisting of a provisioner (e.g. xcat, warewulf), scheduler (e.g. Slurm, SGE, PBS), RDMA connections (e.g InfiniBand), a subnet manager, and 5+ compute nodes within the first 180 days of employment
Obtaining a CompTIA Security+ certification within the first year of employment

Qualification

LinuxHPC systemsScriptingCompTIA Security+InfiniBandJob management toolsConfiguration managementCommunication skillsProblem-solving skills

Required

Bachelor's degree in Engineering or related STEM field (master's preferred)
Scripting experience
Professional/personal experience using command-line Linux (RHEL derivatives preferred)
Experience in one or more engineering computational code OR 2+ years of IT-related experience (e.g., user support, basic networking, Linux server administration, a home Linux environment)
Obtain and maintain a U.S. security clearance
Supporting software installation and configuration of license management servers (e.g., FlexLM or RLM)
Implement site-to-site VPNs (e.g., IPSEC tunnels) to customers on customer HPC clusters
Troubleshoot slow, hanging, or failing HPC jobs on internal or customer HPC clusters
Automate repetitive tasks and implement custom solutions using scripting/programming languages such as Bash or Python
Provide guidance and support on HPC best practices and solutions for internal and external customers
Troubleshoot hardware and software issues on Linux servers
Installation of new hardware into existing compute clusters
Design, test, and implement an HPC environment consisting of a provisioner (e.g. xcat, warewulf), scheduler (e.g. Slurm, SGE, PBS), RDMA connections (e.g InfiniBand), a subnet manager, and 5+ compute nodes within the first 180 days of employment
Obtaining a CompTIA Security+ certification within the first year of employment

Preferred

Past experience as an HPC user on a large-scale cluster
Past experience managing information systems within a classified environment
Experience installing, configuring, and maintaining job management tools (such as SLURM, Moab, TORQUE, PBS, etc.)
Experience configuring, installing, and troubleshooting MPI and OpenMP applications
Experience with operating system deployment tools (e.g. XCAT, ROCKS)
Hands-on experience of at least one distributed file system (Spectrum Scale-GPFS, Lustre, BeeGFS, Gluster, IMRIX, PVFS, etc.)
Direct experience working with InfiniBand
Experience configuring, installing, tuning, and maintaining scientific software on large-scale systems
Experience supporting HPC compilers and libraries
Experience with configuration management tools such as Ansible or Puppet
Familiarity with authentication and access control systems (ADFS, LDAP, Kerberos)
Active U.S. security clearance
Current and active CompTIA Security+ certification

Benefits

Employee ownership through our generous 401(k) match in Corvid Stock
Medical insurance via Blue Cross - PPO and High-Deductible plans (with company HSA contribution)
Paid Time Off (PTO) starting at 3 weeks - based on years of industry experience not tenure
Career development and continuing education opportunities
Company provided life, long-term, and short-term disability insurance
Incentive opportunities to reward strong performance and corporate growth
Attractive campus facilities including Lake Norman access, kayaks, paddle boards, basketball and pickleball courts, grills, and more
Paid gym membership

Company

Corvid Technologies

twittertwitter
company-logo
Corvid Technologies is a defense & space company offering Fluid Dynamics, Structural Mechanics, and Warhead Design services.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
David Robinson
President/CEO
linkedin
leader-logo
Ted Berna
Chief Financial Officer
linkedin
Company data provided by crunchbase