fal · 3 months ago
Staff Software Engineer, Compute
Fal is a company focused on building large scale computation platforms. They are seeking an experienced Staff Software Engineer to develop and maintain their core Python platform and infrastructure layer, ensuring efficient workload orchestration and resource management.
AI InfrastructureArtificial Intelligence (AI)Developer PlatformInformation TechnologyMachine Learning
Responsibilities
Develop and maintain our core Python platform, which handles routing of requests, orchestration of AI workloads, GPU server capacity management, observability, authentication, rate limiting, and many others
Develop and maintain our infrastructure layer where we use Terraform, Ansible, and provider APIs to manage our fleet of GPU workers
Own K8s, FluxCD, Nomad, Prometheus, Thanos, Grafana, Loki, distributed networking storage, and other technologies that underpin our platform
Create the vision and lay the foundation for where our infrastructure should go in the next 1/2/5 years
Qualification
Required
Deep experience building distributed compute platforms, preferably with Python
Strong foundation in managing both cloud and bare metal infrastructure
Solid understanding of K8s and CI/CD on it
Excellent communication
Self-starter who executes quickly, takes ownership and constantly seeks improvement
Benefits
Employee-friendly equity terms (early exercise, extended exercise)
Health, dental, and vision insurance (US)
Regular team events and offsites
Company
fal
Fal is a generative media platform that helps developers create applications using AI models.
Funding
Current Stage
Late StageTotal Funding
$337MKey Investors
Sequoia CapitalMeritech Capital PartnersKindred Ventures
2025-12-09Series D· $140M
2025-07-31Series C· $125M
2025-02-12Series B· $49M
Recent News
2026-01-07
2025-12-16
alleywatch.com
2025-12-16
Company data provided by crunchbase