Member of Technical Staff, AI Training Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Fireworks AI · 1 day ago

Member of Technical Staff, AI Training Infrastructure

Fireworks AI is building the future of generative AI infrastructure, delivering high-quality models with scalable inference. The role involves designing and optimizing infrastructure for large-scale model training, collaborating with researchers to create robust training pipelines and ensure reliable model development.

Artificial Intelligence (AI)Data ManagementSaaSSoftware
check
H1B Sponsor Likelynote

Responsibilities

Design and implement scalable infrastructure for large-scale model training workloads
Develop and maintain distributed training pipelines for LLMs and multimodal models
Optimize training performance across multiple GPUs, nodes, and data centers
Implement monitoring, logging, and debugging tools for training operations
Architect and maintain data storage solutions for large-scale training datasets
Automate infrastructure provisioning, scaling, and orchestration for model training
Collaborate with researchers to implement and optimize training methodologies
Analyze and improve efficiency, scalability, and cost-effectiveness of training systems
Troubleshoot complex performance issues in distributed training environments

Qualification

Distributed systemsML infrastructurePyTorchCloud platformsContainerizationOrchestrationDistributed training techniquesML workflow orchestrationML DevOps practicesOpen-source contributions

Required

Bachelor's degree in Computer Science, Computer Engineering, or related field, or equivalent practical experience
3+ years of experience with distributed systems and ML infrastructure
Experience with PyTorch
Proficiency in cloud platforms (AWS, GCP, Azure)
Experience with containerization, orchestration (Kubernetes, Docker)
Knowledge of distributed training techniques (data parallelism, model parallelism, FSDP)

Preferred

Master's or PhD in Computer Science or related field
Experience training large language models or multimodal AI systems
Experience with ML workflow orchestration tools
Background in optimizing high-performance distributed computing systems
Familiarity with ML DevOps practices
Contributions to open-source ML infrastructure or related projects

Benefits

Meaningful equity in a fast-growing startup
Competitive salary
Comprehensive benefits package

Company

Fireworks AI

twittertwittertwitter
company-logo
Fireworks AI is an advanced platform that enables users to build, tune, and scale AI applications using open-source models

H1B Sponsorship

Fireworks AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (3)
2023 (1)

Funding

Current Stage
Growth Stage
Total Funding
$327M
Key Investors
Sequoia CapitalBenchmark
2025-10-28Series C· $230M
2025-10-28Secondary Market· $20M
2024-07-07Series B· $52M

Leadership Team

leader-logo
Lin Qiao
CEO and cofounder
linkedin
leader-logo
Aishwarya Srinivasan
Head of AI Developer Relations
linkedin

Recent News

Company data provided by crunchbase