Scout AI · 1 month ago
AI Infrastructure Engineer - Fury Team
ScoutAI is developing Fury, the first robotic foundation model for defense, aiming to empower U.S. forces with intelligent machines. The AI Infrastructure Engineer will build and scale the infrastructure that supports model training and deployment, ensuring efficient operations across edge and cloud environments.
Artificial Intelligence (AI)DatabaseText Analytics
Responsibilities
Design and implement data pipelines for ingesting, transforming, and storing petabytes of multimodal data from Fury’s robotic and operator systems
Develop internal tooling for dataset exploration, curation, versioning, and quality monitoring over time
Build and maintain distributed training infrastructure (cloud and on-prem) for large-scale multimodal and foundation model training
Implement job orchestration workflows for launching, tracking, and debugging large-scale model runs
Identify and remediate bottlenecks in compute, memory, storage, and network performance to optimize throughput and cost efficiency
Collaborate with AI, autonomy, and systems teams to ensure data and training infrastructure supports real-time and mission-critical use cases
Maintain observability and reliability tooling for training and inference pipelines
Stay current on best practices in MLOps, distributed training frameworks, and AI infrastructure at scale
Qualification
Required
3+ years of experience in ML infrastructure, MLOps, or large-scale data systems
Proven experience with distributed training (PyTorch DDP, DeepSpeed, Ray, or similar) and workflow orchestration (Kubernetes, Airflow, or equivalent)
Strong proficiency in Python and cloud-native infrastructure (AWS, GCP, or Azure)
Deep understanding of data engineering (ETL pipelines, object storage, data versioning, metadata management)
Familiarity with containerization and deployment (Docker, Kubernetes) and monitoring systems (Prometheus, Grafana)
Experience optimizing GPU cluster utilization, scaling training jobs, and profiling model performance
Bachelor's degree or higher in Computer Science, Electrical Engineering, or related technical field
Must be a U.S. Person due to required access to U.S. export controlled information or facilities
Preferred
Experience with edge-deployed ML systems
Experience with federated training
Experience with robotic data collection pipelines
Benefits
Premium medical, dental, and vision plans with $0 paycheck contribution
Competitive PTO and company holiday calendar
Catered lunch daily and fully stocked kitchen
EV charging
Relocation assistance (depending on role eligibility)
Company
Scout AI
Scout AI uses data and technology to serve as a scout agency that handles operations and recruiting using data analysis.
Funding
Current Stage
Early StageRecent News
Company data provided by crunchbase