Principal/Sr Principal HPC Systems Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Northrop Grumman Australia · 2 months ago

Principal/Sr Principal HPC Systems Engineer

Northrop Grumman is a trusted provider of mission-enabling solutions for global security, seeking a High-Performance Computing (HPC) Principal or Sr. Principal HPC Systems Engineer to support advanced technology development programs. The role involves overseeing the design, deployment, and operation of high-performance compute clusters while leading a team of HPC Systems Administrators to ensure system performance meets customer requirements.

Defense & Space
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote

Responsibilities

Oversee design, deployment, and lifecycle operation of a high-performance compute cluster
Lead team of HPC Systems Administrators
Assess and respond to customer requests for cluster modifications, including oversight of requirement gathering and analysis, planning, implementation, verification/validation, and production deployment maintenance
Investigate, diagnose, and resolve acute system faults
Ensure system performance aligns with customer requirements and remain within technical, schedule, and cost constraints
Maintain software deployments
Maintain security compliance
Monitor and maintain hardware
Contribute to design of new high-performance compute clusters
Interface with user support staff
Assess new technology for benefits and risks by performing trade studies of technological function, value proposition, and deployment timeline
Assess and report on cluster operational risks and propose, plan, and deploy mitigation strategies

Qualification

High-performance computingLinux systems administrationCluster management (Ansible)Job scheduling (SLURM)Security compliance (STIGs)Compiling softwareMonitoring hardwareIAT Level II certificationMPI-based implementationsHigh-speed network fabricsParallel file systemsGPUsTeam leadershipWritten communicationVerbal communication

Required

A degree in a STEM area (Science, Technology, Engineering or Math) with a minimum of 5 years of experience with a bachelor's degree, 3 years of experience with a master's degree, or 0 years of experience with a PhD
Demonstrated experience maintaining computational hardware through its lifecycle
Demonstrated experience analyzing and responding to customer requirements
Strong Linux systems administration proficiency (RHEL nice to have)
Strong knowledge and experience with concepts of high-performance computing system operations, including cluster management (Ansible), multi-user login environments, job scheduling (SLURM), and networked file systems
Strong knowledge and experience maintaining compliance with Security Technical Implementation Guides (STIGs)
Strong knowledge and experience with compiling software
Strong knowledge and experience monitoring and maintaining high-performance compute cluster hardware
Experience directing technical work of a small team of Linux Systems Administrators
Strong written and verbal communication skills
Candidate Must be a U.S. Citizen
Active US Government security clearance per customers requirements
Bachelor's degree in a STEM discipline with 8 years' relevant experience; 6 years' experience with a Master's degree in a STEM discipline; 4 years with PhD in a STEM discipline

Preferred

IAT Level II certification
Experience with MPI-based implementations
Experience with high-speed, low-latency network fabrics
Experience with parallel file systems
Experience with GPUs

Benefits

Health insurance coverage
Life and disability insurance
Savings plan
Company paid holidays
Paid time off (PTO) for vacation and/or personal business

Company

Northrop Grumman Australia

twitter
company-logo
Northrop Grumman Australia is the Australia-based arm of Northrop Grumman Corporation and committed to generating long-term prosperity, investing in advanced Research & Development, sovereign and exportable Intellectual Property, high-quality jobs and long-term technology leadership across the Commonwealth.

Funding

Current Stage
Late Stage
Company data provided by crunchbase