Serve Robotics · 2 months ago
Sr. Software Engineer, ML Edge Inference
Serve Robotics is a company focused on transforming urban delivery through innovative robotic solutions. They are seeking a highly skilled Sr. Software Engineer, ML Edge Inference to bridge the gap between machine learning research and real-time deployment, ensuring advanced models run efficiently on edge hardware. The role involves close collaboration with various engineering teams to optimize and integrate ML models for robotic platforms.
Artificial Intelligence (AI)Food DeliveryLogisticsRobotics
Responsibilities
Own the full lifecycle of ML model deployment on robots—from handoff by the ML team to full system integration
Convert, optimize, and integrate trained models (e.g., PyTorch/ONNX/TensorRT) for Jetson platforms using NVIDIA tools
Develop and optimize CUDA kernels and pipelines for low-latency, high-throughput model inference
Profile and benchmark existing ML workloads using tools like Nsight, nvprof, and TensorRT profiler
Identify and remove compute and memory bottlenecks for real-time inference
Design and implement strategies for quantization, pruning, and other model compression techniques suited for edge inference
Ensure models are robust to the resource constraints of real-time, low-power robotic systems
Manage memory layout, concurrency, and scheduling for optimized GPU and CPU usage on Jetson devices
Build benchmarking pipelines for continuous performance evaluation on hardware-in-the-loop systems
Collaborate with QA and systems teams to validate model behavior in field scenarios
Work closely with ML researchers to influence model architectures for edge deployability and provide technical guidance on the feasibility of real-time ML models in the robotics stack
Qualification
Required
Bachelor's degree in Computer Science, Robotics, Electrical Engineering, or equivalent field
5+ years experience in deploying ML models on embedded or edge platforms (preferably robotics)
3+ years of experience with CUDA, TensorRT, and other NVIDIA acceleration tools
Proficient in Python and C++, especially for performance-sensitive systems
Experience with NVIDIA Jetson (e.g., Xavier, Orin) and edge inference tools
Familiarity with model conversion workflows (e.g., PyTorch → ONNX → TensorRT)
Preferred
Master's degree in Computer Science, Robotics, Electrical Engineering, or equivalent field
Experience with real-time robotics systems (e.g., ROS2, middleware, safety-critical constraints and linux embedded systems)
Knowledge of performance tuning under thermal, power, and memory constraints on embedded devices
Experience with model quantization (e.g., INT8), sparsity, and latency-aware model design
Contributions to open-source ML or CUDA projects is a plus
Company
Serve Robotics
Serve Robotics is an autonomous robotic delivery company that develops AI-powered sidewalk delivery robots.
H1B Sponsorship
Serve Robotics has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (4)
2022 (1)
2021 (5)
Funding
Current Stage
Public CompanyTotal Funding
$394MKey Investors
PostmatesNVIDIANeo
2025-10-10Post Ipo Equity· $100M
2025-01-07Post Ipo Equity· $80M
2024-12-01Post Ipo Equity· $86M
Recent News
2026-01-09
The Motley Fool
2026-01-09
Company data provided by crunchbase