Selby Jennings · 6 days ago
Machine Learning Performance Engineer
Selby Jennings is a company focused on optimizing performance in machine learning applications. They are seeking a Machine Learning Performance Engineer to design, build, and optimize high-performance training and inference pipelines for deep learning workloads while collaborating closely with researchers and engineers.
Responsibilities
Design, build, and optimize high‑performance training and inference pipelines for deep learning workloads
Work deep in the internals of open‑source ML frameworks, improving performance, efficiency, and scalability
Systematically profile, analyze, and remove performance bottlenecks across the full stack-software, hardware, and infrastructure
Collaborate closely with researchers, traders, and engineers to productionize research with strict latency and throughput constraints
Develop a strong understanding of low‑latency trading systems and the performance tradeoffs that underpin them
Qualification
Required
Deep knowledge of the internals of modern deep-learning frameworks such as PyTorch, JAX, TensorFlow, or similar
Strong understanding of computer architecture, memory hierarchies, parallelism, and hardware-aware optimization
Significant experience writing high-performance C++ and Python
A rigorous, metrics-driven approach to performance optimization and debugging
Preferred
Experience with the JAX ecosystem, including XLA, Flax, and related tooling
Hands-on experience optimizing for GPUs or accelerators using CUDA, Triton, Pallas, or similar
Linux systems programming and low-level debugging experience
Exposure to large-scale distributed training or inference systems
Contributions to open-source projects in machine learning, systems, or performance engineering
Company
Selby Jennings
Global recruitment firm specialising in Banking
Funding
Current Stage
Late StageRecent News
Business Insider
2025-09-30
2025-07-10
Seattle TechFlash
2025-05-03
Company data provided by crunchbase