Staff AI Software Engineer - Edge Model Optimization & Deployment jobs in United States
cer-icon
Apply on Employer Site
company-logo

FieldAI · 3 hours ago

Staff AI Software Engineer - Edge Model Optimization & Deployment

FieldAI builds field-proven embodied AI that enables robots to operate autonomously in complex, unstructured real-world environments. They are seeking a Staff AI Software Engineer to optimize and deploy deep learning models for edge hardware, ensuring their performance in real-time robotic applications.

Enterprise SoftwareRobotic Process Automation (RPA)Robotics
check
H1B Sponsor Likelynote

Responsibilities

Convert and optimize 2D/3D CNNs and Transformer-based models (PyTorch/TensorFlow → ONNX → TensorRT/Triton) for real-time inference on Jetson/Orin platforms
Apply model compression techniques—quantization, pruning, distillation, weight sharing—to meet strict constraints on latency, memory, bandwidth, and power
Develop custom TensorRT plugins and CUDA kernels for performance-critical components
Integrate optimized models into the broader robotic system using ROS nodes and interfaces
Build benchmarks, profile and debug end-to-end inference pipelines, and validate performance in real-world robotic scenarios
Collaborate closely with AI researchers, robotics engineers, and hardware teams to translate cutting-edge research into robust, deployable edge solutions
Ensure the reliability, robustness, and stability of deployed models operating continuously in challenging, resource-constrained environments

Qualification

Deep learning model deploymentPyTorchTensorRTCUDAC++Model optimization techniquesROS integrationEmbedded systemsCollaboration skillsIndependent work

Required

5+ years of professional experience developing and deploying deep learning models for edge, embedded, or real-time systems
BS, MS, PhD, or equivalent experience in Computer Science, Robotics, Electrical/Computer Engineering, or a related field
Strong proficiency in PyTorch, C++, Python, and CUDA for AI/ML development and model optimization
Hands-on experience with TensorRT, ONNX, and Triton, including authoring custom plugins for TensorRT
Proven experience applying model optimization techniques such as quantization, pruning, and distillation in production systems
Deep understanding of hardware constraints and performance tuning on Jetson / ARM platforms, GPUs, and embedded Linux systems
Experience integrating AI models into ROS-based robotic systems
Ability to work independently while collaborating effectively in a fast-paced, cross-functional engineering environment

Preferred

Experience with ROS2
Experience writing and optimizing custom CUDA kernels and low-level GPU performance tuning
Familiarity with Triton, ML compilers, or compiler-level optimizations for GPU inference
Experience with JAX or additional ML frameworks beyond PyTorch
Background deploying AI systems on real robots operating in the field, not just offline or in simulation
Familiarity with NVIDIA's edge and robotics ecosystem (e.g., Isaac ROS, DeepStream, JetPack)

Company

FieldAI

twittertwitter
company-logo
FieldAI is pioneering the development of a field-proven, hardware agnostic brain technology that enables many different types of robots to operate autonomously in hazardous, offroad, and potentially harsh industrial settings – all without GPS, maps, or any pre-programmed routes.

H1B Sponsorship

FieldAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)

Funding

Current Stage
Early Stage
Total Funding
$405M
2025-08-20Series Unknown· $91M
2025-08-20Series A· $314M

Leadership Team

leader-logo
Ali Agha
Founder and CEO
linkedin
Company data provided by crunchbase