Fireworks AI · 1 day ago
Member of Technical Staff, Cloud Infrastructure
Fireworks AI is building the future of generative AI infrastructure, delivering high-quality models with scalable inference. The role involves architecting and building foundational systems for their generative AI platform, focusing on reliability, efficiency, and scalability across various cloud providers.
Artificial Intelligence (AI)Data ManagementSaaSSoftware
Responsibilities
Architect and build scalable, resilient, and high-performance backend infrastructure to support distributed training, inference, and data processing pipelines
Lead technical design discussions, mentor other engineers, and establish best practices for building and operating large-scale ML infrastructure
Design and implement core backend services (e.g., job schedulers, resource managers, autoscalers, model serving layers) with a focus on efficiency and low latency
Drive infrastructure optimization initiatives, including compute cost reduction, storage lifecycle management, and network performance tuning
Collaborate cross-functionally with ML, DevOps, and product teams to translate research and product needs into robust infrastructure solutions
Continuously evaluate and integrate cloud-native and open-source technologies (e.g., Kubernetes, Ray, Kubeflow, MLFlow) to enhance our platform’s capabilities and reliability
Own end-to-end systems from design to deployment and observability, with a strong emphasis on reliability, fault tolerance, and operational excellence
Qualification
Required
Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
5+ years of experience designing and building backend infrastructure in cloud environments (e.g., AWS, GCP, Azure)
Proven experience in ML infrastructure and tooling (e.g., PyTorch, TensorFlow, Vertex AI, SageMaker, Kubernetes, etc.)
Strong software development skills in languages like Python, or C++
Deep understanding of distributed systems fundamentals: scheduling, orchestration, storage, networking, and compute optimization
Preferred
Master's or PhD in Computer Science or related field
Experience leading infrastructure projects supporting large-scale ML/AI workloads or high-throughput systems
Familiarity with infrastructure-as-code and CI/CD tooling (e.g., Terraform, ArgoCD, GitOps)
Track record of driving system performance, reliability, and cost-efficiency improvements
Contributions to open-source cloud or ML infrastructure projects a plus
Benefits
Meaningful equity in a fast-growing startup
Competitive salary
Comprehensive benefits package
Company
Fireworks AI
Fireworks AI is an advanced platform that enables users to build, tune, and scale AI applications using open-source models
H1B Sponsorship
Fireworks AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (3)
2023 (1)
Funding
Current Stage
Growth StageTotal Funding
$327MKey Investors
Sequoia CapitalBenchmark
2025-10-28Series C· $230M
2025-10-28Secondary Market· $20M
2024-07-07Series B· $52M
Recent News
2025-11-19
Tech Startups - Tech News, Tech Trends & Startup Funding
2025-11-15
Tech Startups - Tech News, Tech Trends & Startup Funding
2025-11-15
Company data provided by crunchbase