Member of Technical Staff, Cloud Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Fireworks AI · 1 day ago

Member of Technical Staff, Cloud Infrastructure

Fireworks AI is building the future of generative AI infrastructure, delivering high-quality models with scalable inference. The role involves architecting and building foundational systems for their generative AI platform, focusing on reliability, efficiency, and scalability across various cloud providers.

Artificial Intelligence (AI)Data ManagementSaaSSoftware
check
H1B Sponsor Likelynote

Responsibilities

Architect and build scalable, resilient, and high-performance backend infrastructure to support distributed training, inference, and data processing pipelines
Lead technical design discussions, mentor other engineers, and establish best practices for building and operating large-scale ML infrastructure
Design and implement core backend services (e.g., job schedulers, resource managers, autoscalers, model serving layers) with a focus on efficiency and low latency
Drive infrastructure optimization initiatives, including compute cost reduction, storage lifecycle management, and network performance tuning
Collaborate cross-functionally with ML, DevOps, and product teams to translate research and product needs into robust infrastructure solutions
Continuously evaluate and integrate cloud-native and open-source technologies (e.g., Kubernetes, Ray, Kubeflow, MLFlow) to enhance our platform’s capabilities and reliability
Own end-to-end systems from design to deployment and observability, with a strong emphasis on reliability, fault tolerance, and operational excellence

Qualification

Cloud infrastructureDistributed systemsMachine learning platformsBackend services designKubernetesPythonCI/CD toolingTeam collaborationMentoringProblem-solving

Required

Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
5+ years of experience designing and building backend infrastructure in cloud environments (e.g., AWS, GCP, Azure)
Proven experience in ML infrastructure and tooling (e.g., PyTorch, TensorFlow, Vertex AI, SageMaker, Kubernetes, etc.)
Strong software development skills in languages like Python, or C++
Deep understanding of distributed systems fundamentals: scheduling, orchestration, storage, networking, and compute optimization

Preferred

Master's or PhD in Computer Science or related field
Experience leading infrastructure projects supporting large-scale ML/AI workloads or high-throughput systems
Familiarity with infrastructure-as-code and CI/CD tooling (e.g., Terraform, ArgoCD, GitOps)
Track record of driving system performance, reliability, and cost-efficiency improvements
Contributions to open-source cloud or ML infrastructure projects a plus

Benefits

Meaningful equity in a fast-growing startup
Competitive salary
Comprehensive benefits package

Company

Fireworks AI

twittertwittertwitter
company-logo
Fireworks AI is an advanced platform that enables users to build, tune, and scale AI applications using open-source models

H1B Sponsorship

Fireworks AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (3)
2023 (1)

Funding

Current Stage
Growth Stage
Total Funding
$327M
Key Investors
Sequoia CapitalBenchmark
2025-10-28Series C· $230M
2025-10-28Secondary Market· $20M
2024-07-07Series B· $52M

Leadership Team

leader-logo
Lin Qiao
CEO and cofounder
linkedin
leader-logo
Aishwarya Srinivasan
Head of AI Developer Relations
linkedin

Recent News

Company data provided by crunchbase