1.65 Senior Machine Learning Platform Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

FieldAI · 10 hours ago

1.65 Senior Machine Learning Platform Engineer

FieldAI is transforming how robots interact with the real world, building risk-aware, reliable AI systems to tackle complex challenges in robotics. As a Senior Machine Learning Platform Engineer, you will own the infrastructure that powers the Field-insight Foundation Model, designing and operating large-scale ML platforms and mentoring junior engineers.

Enterprise SoftwareRobotic Process Automation (RPA)Robotics
check
H1B Sponsor Likelynote

Responsibilities

Design and manage scalable ML infrastructure with IaC tools (Terraform, CloudFormation)
Develop and optimize cloud-based pipelines for training, evaluation, and inference on multimodal datasets
Build and operate data systems for large-scale video ingestion, indexing, and storage
Maintain MLOps workflows for versioning, experiment tracking, reproducibility, and CI/CD
Ensure reliability and observability with monitoring, logging, and alerting
Collaborate with AI/ML Engineers to productionize workflows
Optimize infrastructure for performance and cost across cloud and edge
Enforce best practices in security, compliance, and maintainability
Mentor and manage junior engineers, providing technical guidance and career development

Qualification

ML infrastructurePython/TypeScriptDistributed systemsCloud platformsMLOps workflowsCI/CD pipelinesInfrastructure-as-codeData managementSecurity complianceMentoring

Required

Bachelor's/Master's in Computer Science, Engineering, or related field (or equivalent experience)
4+ years of industry experience in ML infrastructure or platform engineering
Strong coding skills in Python/TypeScript and a strong foundation in software engineering best practices
Proven experience with distributed systems, cloud platforms (AWS preferred), containerization and orchestration (Docker, Kubernetes/EKS, Ray), and serverless
Hands-on experience building ML pipelines for distributed training and large-scale inference
Strong knowledge of data management at scale, including preprocessing and retrieval of video/image datasets
Proficiency with CI/CD pipelines, infrastructure-as-code (Terraform, CloudFormation), and automation
Familiarity with MLOps tools (MLflow, Kubeflow, Airflow)
Experience with system monitoring and observability in production
Design and manage scalable ML infrastructure with IaC tools (Terraform, CloudFormation)
Develop and optimize cloud-based pipelines for training, evaluation, and inference on multimodal datasets
Build and operate data systems for large-scale video ingestion, indexing, and storage
Maintain MLOps workflows for versioning, experiment tracking, reproducibility, and CI/CD
Ensure reliability and observability with monitoring, logging, and alerting
Collaborate with AI/ML Engineers to productionize workflows
Optimize infrastructure for performance and cost across cloud and edge
Enforce best practices in security, compliance, and maintainability
Mentor and manage junior engineers, providing technical guidance and career development

Preferred

Experience with vector databases (OpenSearch, Pinecone, Weaviate) for indexing and retrieval
Familiarity with distributed training frameworks (Horovod, DDP/FSDP, DeepSpeed, Ray)
Hands-on experience with GPU orchestration and auto-scaling (Karpenter, SageMaker, EKS)
Experience with agentic AI deployment workflows, orchestration frameworks, and retrieval-augmented generation
Strong knowledge of security and compliance in ML and cloud environments

Company

FieldAI

twittertwitter
company-logo
FieldAI is pioneering the development of a field-proven, hardware agnostic brain technology that enables many different types of robots to operate autonomously in hazardous, offroad, and potentially harsh industrial settings – all without GPS, maps, or any pre-programmed routes.

H1B Sponsorship

FieldAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)

Funding

Current Stage
Early Stage
Total Funding
$405M
2025-08-20Series Unknown· $91M
2025-08-20Series A· $314M

Leadership Team

leader-logo
Ali Agha
Founder and CEO
linkedin
Company data provided by crunchbase