AI Infrastructure Engineer - Fury Team jobs in United States
cer-icon
Apply on Employer Site
company-logo

Scout AI · 1 month ago

AI Infrastructure Engineer - Fury Team

ScoutAI is developing Fury, the first robotic foundation model for defense, aiming to empower U.S. forces with intelligent machines. The AI Infrastructure Engineer will build and scale the infrastructure that supports model training and deployment, ensuring efficient operations across edge and cloud environments.

Artificial Intelligence (AI)DatabaseText Analytics
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Design and implement data pipelines for ingesting, transforming, and storing petabytes of multimodal data from Fury’s robotic and operator systems
Develop internal tooling for dataset exploration, curation, versioning, and quality monitoring over time
Build and maintain distributed training infrastructure (cloud and on-prem) for large-scale multimodal and foundation model training
Implement job orchestration workflows for launching, tracking, and debugging large-scale model runs
Identify and remediate bottlenecks in compute, memory, storage, and network performance to optimize throughput and cost efficiency
Collaborate with AI, autonomy, and systems teams to ensure data and training infrastructure supports real-time and mission-critical use cases
Maintain observability and reliability tooling for training and inference pipelines
Stay current on best practices in MLOps, distributed training frameworks, and AI infrastructure at scale

Qualification

ML infrastructureDistributed trainingPythonCloud-native infrastructureData engineeringContainerizationWorkflow orchestrationMonitoring systemsGPU optimizationEdge-deployed ML systems

Required

3+ years of experience in ML infrastructure, MLOps, or large-scale data systems
Proven experience with distributed training (PyTorch DDP, DeepSpeed, Ray, or similar) and workflow orchestration (Kubernetes, Airflow, or equivalent)
Strong proficiency in Python and cloud-native infrastructure (AWS, GCP, or Azure)
Deep understanding of data engineering (ETL pipelines, object storage, data versioning, metadata management)
Familiarity with containerization and deployment (Docker, Kubernetes) and monitoring systems (Prometheus, Grafana)
Experience optimizing GPU cluster utilization, scaling training jobs, and profiling model performance
Bachelor's degree or higher in Computer Science, Electrical Engineering, or related technical field
Must be a U.S. Person due to required access to U.S. export controlled information or facilities

Preferred

Experience with edge-deployed ML systems
Experience with federated training
Experience with robotic data collection pipelines

Benefits

Premium medical, dental, and vision plans with $0 paycheck contribution
Competitive PTO and company holiday calendar
Catered lunch daily and fully stocked kitchen
EV charging
Relocation assistance (depending on role eligibility)

Company

Scout AI

twittertwitter
company-logo
Scout AI uses data and technology to serve as a scout agency that handles operations and recruiting using data analysis.

Funding

Current Stage
Early Stage
Company data provided by crunchbase