Pinterest · 7 hours ago
Staff Software Engineer, Ads ML Inference Infrastructure
Pinterest is a platform that inspires creativity and planning for lasting memories. They are seeking a Staff Software Engineer to lead the development of large-scale ML inference systems that enhance their advertising capabilities.
InternetSocial BookmarkingSocial MediaSocial NetworkSoftware
Responsibilities
Lead and drive efforts to build next-generation model inference and feature serving systems that power up to 100x larger models and directly uplevel Pinterest’s monetization business
Design and optimize low-latency, high-throughput inference pipelines to meet strict SLOs while improving performance, efficiency, and cost
Partner with Ads ML and product teams to productionize new model architectures (including LLMs and multi-stage ranking models) and scale them reliably to global traffic
Evolve the online feature platform (feature computation, caching, and retrieval) to improve coverage, freshness, and consistency for Ads models
Evaluate and integrate new technologies (e.g., GPU acceleration, model compression, Triton, vLLM, Dynamo) to advance our inference stack
Build strong partnerships with other infra and ML teams to improve end-to-end reliability, observability, and developer velocity for Ads ML
Mentor and coach other engineers, guiding them through technical decisions, system design, and career development
Qualification
Required
BS (or higher) degree in Computer Science or a related field
~8+ years of relevant industry experience designing and operating large-scale, production ML or distributed infra systems
Deep knowledge of at least one programming language (Java, C++, Python)
Deep experience with distributed systems or recommendation / ads serving infrastructure (e.g., request routing, online storage, caching, feature serving, APIs)
Hands-on experience with at least one deep learning framework (PyTorch or TensorFlow) and bringing models from offline experimentation to production
Proven track record of leading complex projects, setting technical direction, and collaborating across functions and orgs; experience mentoring and coaching other engineers
Preferred
Experience with model / hardware accelerator libraries (e.g., CUDA, quantization, distillation, low-precision inference)
Experience with inference optimization and serving frameworks such as Triton, vLLM, or Dynamo
Benefits
Equity
Company
Pinterest is a visual bookmarking tool for saving and discovering creative ideas.
Funding
Current Stage
Public CompanyTotal Funding
$1.49BKey Investors
Elliott Management Corp.Brandtech VenturesGoldman Sachs Investment Partners,SV Angel,Wellington Management
2022-07-14Post Ipo Equity
2020-01-01Post Ipo Equity
2019-04-18IPO
Recent News
2026-02-07
2026-02-06
2026-02-06
Company data provided by crunchbase