Senior/Staff Software Engineer - ML Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Voxel · 4 months ago

Senior/Staff Software Engineer - ML Infrastructure

Voxel is a company dedicated to enhancing workplace safety through innovative AI and computer vision technology. They are seeking a Staff Machine-Learning Infrastructure Engineer to lead the development of their computer-vision platform, focusing on managing ML lifecycle components and building scalable infrastructure.

Artificial Intelligence (AI)Computer VisionIndustrialIndustrial AutomationRisk Management
check
H1B Sponsor Likelynote

Responsibilities

Own data & labeling pipelines – architect scalable labeling services (storage, query, retrieval), design ontologies, automate annotation workflows, and build quality-tiered datasets that stay within cost constraints
Build and operate training infrastructure – create multi-GPU / multi-node training frameworks (Ray, Spark, Kubernetes), optimize distributed jobs, and integrate accelerators (TensorRT, CUDA-graph, FP8, etc.)
Manage the full model lifecycle – stand up model registries, version control, evaluation suites, and continuous-learning loops that push updates from dev → staging → prod with zero-downtime rollbacks
Provide technical leadership, mentorship, and lightweight project management to a small infra + research squad
Establish DevOps-for-ML best practices (IaC, CI/CD, observability, cost monitoring) so researchers can iterate quickly and safely
Partner with ML engineers on architecture decisions, from data schemas to inference optimizations, ensuring infra and research road-maps stay tightly aligned

Qualification

ML infrastructure designKubernetes expertiseDistributed systemsData labeling automationDevOps practicesPython programmingTechnical leadershipMentorshipProject management

Required

Bachelor's (or higher) in Computer Science, EE, or related field
5+ years building and operating large-scale infrastructure, with at least 3 years focused on ML or data-intensive systems
Proven record designing highly available, distributed systems on Kubernetes (EKS, GKE, or on-prem)
Deep expertise with orchestration (K8s operators, Argo, Kubeflow), and cluster-scale storage / compute (S3, GCS, Ray, Spark, Dask)
Hands-on experience automating data-labeling or ground-truth workflows and maintaining dataset versioning
Strong software-engineering fundamentals; familiar with best practices for testing, observability, and secure coding
Demonstrated DevOps mindset — IaC (Terraform/CDK), CI/CD (GitHub Actions, ArgoCD), metrics & alerting (Prometheus/Grafana)

Preferred

Experience running multi-instance / multi-GPU training jobs, mixed-precision optimizations, or TensorRT / Triton inference
Familiarity with active-learning, continuous-training, or online distillation pipelines
Background in model registry tooling (MLflow, BentoML, SageMaker Registry) and evaluation dashboards
Prior work with computer-vision models (YOLO, DETR, Faster RCNN) or video understanding at scale
Contributions to open-source ML infra projects or published talks/blogs on MLOps
Exposure to edge-deployment or real-time inference systems
Experience shipping high quality production code in Python

Benefits

Extensive / Generous health, dental, and vision insurance.
Highly competitive paid parental leave and support system.
Ownership in the business through an Equity Incentive Plan.
Generous paid time off and / or flexible work arrangements.
Daily meals in-office, vibrant company events, team-building.
401K retirement plan, HSA options, pre-tax Commuter Card.

Company

Voxel

twittertwittertwitter
company-logo
Voxel enhances workplace safety and operational efficiency by transforming existing security cameras into intelligent monitoring systems.

H1B Sponsorship

Voxel has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)
2024 (4)
2023 (4)
2021 (2)

Funding

Current Stage
Growth Stage
Total Funding
$77M
Key Investors
Ericsson VenturesNewRoad Capital PartnersRite-Hite
2025-10-29Series Unknown
2025-06-03Series B· $47M
2023-08-30Series Unknown· $12M

Leadership Team

leader-logo
Troy Carlson
Co-Founder
linkedin
leader-logo
Kayvon Deldar
VP of Partnerships
linkedin
Company data provided by crunchbase