Autonomai Recruitment ยท 1 day ago
Lead/Senior HPC Engineer
Autonomai Recruitment is seeking a Lead/Senior HPC Engineer to build and evolve large-scale HPC and GPU compute platforms for a leading financial firm. The role involves designing, scaling, and optimizing compute, storage, and networking stacks that support AI, ML, and advanced quantitative workloads.
Responsibilities
Design, build, and operate large-scale HPC and GPU clusters used for research, simulation, and AI workloads
Own the full lifecycle from hardware selection and build-out through production operations
Support and scale GPU-accelerated environments using NVIDIA platforms
Work closely with researchers and engineers running ML and data-intensive workloads
Architect and operate high-throughput, low-latency storage platforms in partnership with storage specialists
Optimise parallel file systems to support GPU-heavy workloads
Continuously profile, tune, and improve compute, network, and storage performance
Diagnose and resolve complex system-level issues across hardware, OS, drivers, fabrics, and filesystems
Build automation for provisioning, configuration, monitoring, and lifecycle management
Reduce operational friction through clean tooling and well-designed systems
Act as a technical partner to researchers, data scientists, and software engineers
Translate demanding workloads into robust, scalable infrastructure solutions
Contribute documentation, reusable code, and internal best practices
Provide subject-matter guidance on HPC, GPU compute, and performance engineering
Qualification
Required
7+ years designing, building, and running complex Linux-based HPC environments
Strong hands-on experience with GPU compute, ideally NVIDIA and CUDA ecosystems
Deep knowledge of parallel storage systems such as Lustre or Spectrum Scale
Advanced Linux systems skills with strong scripting and automation experience
Experience with job schedulers and cluster orchestration, strong preference for Slurm
Familiarity with high-speed interconnects such as InfiniBand or RoCE
Comfort operating at scale with performance and uptime as first-class concerns
Naturally curious, detail-oriented, and ownership-driven
Enjoys solving hard systems problems rather than working around them
Comfortable operating in an environment with high expectations and minimal hand-holding
Preferred
Advanced degree in a technical field
Exposure to buy-side finance, trading, or research-driven environments
Experience with cloud-based HPC, particularly GCP
Container technologies such as Docker or Singularity
Interest in applying AI techniques to infrastructure optimisation and monitoring
Company
Autonomai Recruitment
Autonomai Recruitment is a boutique search agency specializing in tailored recruitment solutions for FinTech, Crypto, and Ai.
Funding
Current Stage
Early StageCompany data provided by crunchbase