Senior Machine Learning Inference Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

PictorLabs Inc · 4 hours ago

Senior Machine Learning Inference Engineer

Pictor Labs is the leading virtual staining company revolutionizing digital pathology adoption worldwide through cutting-edge AI-powered technology. We are seeking an experienced Senior ML Inference Engineer to join our team, focusing on optimizing and deploying our production virtual staining models at scale.

Artificial Intelligence (AI)BiopharmaBiotechnologyHealth CareHealth DiagnosticsSoftware
check
H1B Sponsor Likelynote

Responsibilities

Design, development, and optimization of production ML inference systems for virtual staining models (Deepstain, Restain, ClearStain) serving clinical and pharmaceutical customers
Architect and implement high-performance inference pipelines capable of processing gigapixel pathology images with sub-2-minute latency requirements
Work with ML Research and Engineering teams to optimize model architectures and deployment strategies for both cloud-based APIs and edge devices (NVIDIA DGX Sparc, Grace Blackwell superchips)
Evaluate, implement, and maintain state-of-the-art inference frameworks (TensorRT, Triton Inference Server, ONNX Runtime) to maximize GPU utilization and throughput
Profile and optimize deep neural networks on NVIDIA GPUs using tools such as NVIDIA Nsight, PyTorch Profiler, and custom instrumentation
Design and implement efficient model serving architectures that support both synchronous REST APIs and asynchronous batch processing workflows
Collaborate with Platform and Edge Device teams to containerize inference systems (Docker, Kubernetes) for deployment across cloud and on-premise environments
Partner with cloud providers (AWS, GCP, Azure) to optimize hosted inference solutions and leverage latest hardware accelerators
Ensure inference systems meet regulatory requirements (FDA 510(k), SOC2) with comprehensive monitoring, logging, and audit capabilities
Prototype and productionize new inference optimization techniques, including quantization, pruning, distillation, and dynamic batching strategies

Qualification

ML inference optimizationGPU programmingPythonPyTorchTensorRTTriton Inference ServerONNX RuntimeKubernetesDockerAWSGCPAzureImage processingCommunication skillsCollaboration skillsTechnical documentation

Required

7+ years of experience building and optimizing production ML inference systems at scale
Expert-level proficiency in Python and experience writing high-performance inference services
5+ years of hands-on experience with PyTorch and at least one production inference tools (TensorRT, Triton Inference Server, ONNX Runtime, TorchServe)
Deep understanding of computer vision model architectures, particularly generative models (GANs, diffusion models) and vision transformers
Extensive experience profiling and optimizing deep neural networks on NVIDIA GPUs, including memory optimization, kernel fusion, and mixed-precision inference
Strong background in image processing pipelines and libraries (OpenCV, Pillow, scikit-image) for handling large-scale medical imaging data
Proven experience deploying ML systems on Kubernetes and major cloud providers (AWS, GCP, Azure)
Experience with Docker containerization and orchestration for ML workloads
Strong software engineering practices including version control (Git), CI/CD, unit testing, and production debugging
Excellent communication, collaboration, and technical documentation skills

Preferred

Experience with medical imaging, digital pathology, or whole slide imaging (WSI) processing
Knowledge of edge device deployment and embedded systems for AI inference
Experience with MLOps tools (MLflow, Kubeflow, Apache Airflow) and model versioning
Understanding of FDA regulatory requirements for AI/ML in medical devices
Background in distributed inference systems and model parallelism techniques
Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK Stack)

Company

PictorLabs Inc

twittertwitter
company-logo
Pictor Labs is The Virtual Staining Company enabling digital pathology adoption worldwide through AI-powered technology that delivers quality results in minutes while preserving tissue for comprehensive analysis.

H1B Sponsorship

PictorLabs Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)
2024 (1)
2022 (2)

Funding

Current Stage
Growth Stage
Total Funding
$48.81M
Key Investors
Insight Partners
2024-09-18Series B· $30M
2022-05-26Series A· $15.21M
2020-10-07Seed· $3.6M

Leadership Team

leader-logo
Yair Rivenson
Chief Executive Officer
linkedin
Company data provided by crunchbase