Berkeley Lab · 20 hours ago
HPC Storage Systems Engineer
Berkeley Lab is a leading scientific research institution, and they are seeking a HPC Storage Systems Engineer to enhance their high performance computing and data analysis capabilities. The role involves managing and maintaining NERSC's mass storage systems, contributing to the storage strategy, and collaborating with a team of engineers to ensure efficient data management for scientific research.
Responsibilities
Participate in projects to architect, deploy and manage NERSC’s mass storage hierarchy
Contribute to the effort to manage and maintain the HPSS systems
Day to day administration of tape-based complex storage systems
Analyze storage usage and system monitoring
Administration of storage servers and block storage arrays
Participate in the management of storage area network
Troubleshoot and debug problems in our production storage systems
Help define storage requirements for NERSC, ensuring that NERSC users’ needs are represented
Engage with NERSC users to identify projects which will improve data management and movement at the center
Identify and evaluate new storage hardware and software technologies and features
Participate in 24x7 on-call rotation
Work on and resolve complex issues where analysis of situations or data requires an in-depth evaluation of variable factors
Exercise judgment in selecting methods, techniques and evaluation criteria for obtaining results
Determine methods and procedures on new assignments and may coordinate activities of other personnel
Network with key contacts outside of their own area of expertise
Lead the mass storage system administrators team within the Storage Systems Group, leading effort to manage and maintain the HPSS systems
Lead projects to architect, deploy and manage NERSC’s mass storage hierarchy
Work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles
Exercise independent judgment in methods, techniques and evaluation criteria for obtaining results
Present technical information at conferences and meetings
Qualification
Required
Bachelor's degree or equivalent experience and a minimum of 8 years of computing or storage experience; or 6 years and a Master's degree; or equivalent experience
Wide-ranging expertise in the areas of mass storage solutions (such as HPSS) and storage networking technologies (such as RDMA, RoCE, Infiniband and Fibre Channel)
Experience managing storage systems
Excellent technical troubleshooting skills with the ability to resolve complex issues in creative and effective ways
Knowledge of trends in storage system hardware and software
Strong communication skills, and the ability to work independently and collaboratively as part of a creative and diverse team
Ability to script in Python, Perl, Shell or other interpreted language
Knowledge of block storage arrays, storage networks, parallel file systems, hierarchical storage systems and object stores
Ability to resolve complex issues in creative and effective ways
Ability to network and collaborate with key contacts outside of their own area of expertise
Excellent oral and written communication skills
Demonstrated ability to work effectively as part of a cross-disciplinary team
Bachelor's degree or equivalent experience and a minimum of 12 years of computing or storage experience; or 8 years and a Master's degree; or equivalent experience
Broad expertise and/or unique knowledge in the areas of mass storage solutions (such as HPSS), storage networking technologies (such as RDMA, RoCE, Infiniband and Fibre Channel), NFS, storage tiering, and storage performance tuning
Experience providing direction to a project team, or leading a team of systems or storage administrators
Experience architecting storage system solutions to meet user requirements
Experience administering or developing HPSS, Versity or other hierarchical storage management systems
Experience troubleshooting high performance data transfer applications
Experience with an automated software provisioning and configuration management system
Understanding of file system internals, or prior work developing storage systems
Good understanding of data transfer protocols, for example TCP/IP, IB verbs or ROCE
Knowledge of typical Unix file system structure
Ability to work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles
Ability to exercise independent judgment in methods, techniques and evaluation criteria for obtaining results
Company
Berkeley Lab
Berkeley Lab is a national laboratory that creates advanced new tools for scientific discovery.
H1B Sponsorship
Berkeley Lab has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (154)
2024 (159)
2023 (163)
2022 (154)
2021 (165)
2020 (107)
Funding
Current Stage
Late StageLeadership Team
Recent News
MIT Climate Portal - Massachusetts
2025-07-18
Help Net Security
2025-04-15
Company data provided by crunchbase