IMCS Group ยท 3 weeks ago
Consultant - Infrastructure Management | DevOps | Continuous delivery - Environment management and provisioning
IMCS Group is one of the fastest growing MWBE staffing firms in the U.S. They are seeking a skilled HPC Slurm Administrator to manage and support high-performance computing environments. The ideal candidate will have hands-on experience with Slurm workload manager and Linux system administration, playing a key role in maintaining, optimizing, and scaling HPC infrastructure.
Staffing & Recruiting
Responsibilities
Administer and maintain HPC clusters using Slurm
Monitor system performance and ensure high availability and reliability
Troubleshoot and resolve issues related to job scheduling, compute nodes, and storage
Manage user accounts, permissions, and security policies
Automate administrative tasks using scripting languages (e.g., Bash, Python)
Collaborate with engineering and research teams to support compute-intensive workloads
Document system configurations, procedures, and operational changes
Participate in upgrades, patching, and scaling of HPC infrastructure
Qualification
Required
Experience in Linux system administration, preferably in HPC environments
Strong expertise with Slurm workload manager
Proficiency in Bash, Python, or other scripting languages
Familiarity with parallel file systems and high-speed networking (e.g., InfiniBand)
Experience with configuration management tools (e.g., Ansible, Puppet)
Minimum years of experience needed- 3+ years of experience
Company
IMCS Group
IMCS Group is an IT, Healthcare, and Professional Staffing Company that helps Enterprises optimize the business value of their Staffing investments and enables them to achieve world-class business performance.