Lead/Senior HPC Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Autonomai Recruitment ยท 1 day ago

Lead/Senior HPC Engineer

Autonomai Recruitment is seeking a Lead/Senior HPC Engineer to build and evolve large-scale HPC and GPU compute platforms for a leading financial firm. The role involves designing, scaling, and optimizing compute, storage, and networking stacks that support AI, ML, and advanced quantitative workloads.

Staffing & Recruiting
Hiring Manager
Keatan Forrest
linkedin

Responsibilities

Design, build, and operate large-scale HPC and GPU clusters used for research, simulation, and AI workloads
Own the full lifecycle from hardware selection and build-out through production operations
Support and scale GPU-accelerated environments using NVIDIA platforms
Work closely with researchers and engineers running ML and data-intensive workloads
Architect and operate high-throughput, low-latency storage platforms in partnership with storage specialists
Optimise parallel file systems to support GPU-heavy workloads
Continuously profile, tune, and improve compute, network, and storage performance
Diagnose and resolve complex system-level issues across hardware, OS, drivers, fabrics, and filesystems
Build automation for provisioning, configuration, monitoring, and lifecycle management
Reduce operational friction through clean tooling and well-designed systems
Act as a technical partner to researchers, data scientists, and software engineers
Translate demanding workloads into robust, scalable infrastructure solutions
Contribute documentation, reusable code, and internal best practices
Provide subject-matter guidance on HPC, GPU compute, and performance engineering

Qualification

HPC architectureGPU computeLinux systemsParallel storage systemsJob schedulersCluster orchestrationScriptingAutomationCollaborationTechnical leadership

Required

7+ years designing, building, and running complex Linux-based HPC environments
Strong hands-on experience with GPU compute, ideally NVIDIA and CUDA ecosystems
Deep knowledge of parallel storage systems such as Lustre or Spectrum Scale
Advanced Linux systems skills with strong scripting and automation experience
Experience with job schedulers and cluster orchestration, strong preference for Slurm
Familiarity with high-speed interconnects such as InfiniBand or RoCE
Comfort operating at scale with performance and uptime as first-class concerns
Naturally curious, detail-oriented, and ownership-driven
Enjoys solving hard systems problems rather than working around them
Comfortable operating in an environment with high expectations and minimal hand-holding

Preferred

Advanced degree in a technical field
Exposure to buy-side finance, trading, or research-driven environments
Experience with cloud-based HPC, particularly GCP
Container technologies such as Docker or Singularity
Interest in applying AI techniques to infrastructure optimisation and monitoring

Company

Autonomai Recruitment

twitter
company-logo
Autonomai Recruitment is a boutique search agency specializing in tailored recruitment solutions for FinTech, Crypto, and Ai.

Funding

Current Stage
Early Stage
Company data provided by crunchbase