Software Engineer - Cloud FinOps & Reliability jobs in United States
cer-icon
Apply on Employer Site
company-logo

Luma AI · 2 weeks ago

Software Engineer - Cloud FinOps & Reliability

Luma AI is dedicated to building multimodal AI to enhance human capabilities, and they are seeking a Software Engineer specialized in Cloud FinOps and Reliability. This role focuses on optimizing financial efficiency within a large multi-cloud GPU infrastructure, requiring expertise in cloud architecture and cost management.

Artificial Intelligence (AI)Foundational AIGenerative AIVideoVideo Editing
check
H1B Sponsor Likelynote
Hiring Manager
Lorenzo Casanova
linkedin

Responsibilities

Analyze & Optimize: Actively monitor and analyze costs across our entire technical ecosystem—including multi-cloud infrastructure (AWS, GCP, OCI), on-premise clusters, and third-party services—to identify and execute on opportunities for cost optimization. Develop forecasting models to predict future spend and inform our capacity planning
Manage & Commit: Develop and actively manage a multi-million dollar portfolio of Reserved Instances (RIs) and Savings Plans to maximize commitment-based discounts across our global GPU and CPU fleets
Automate & Build: Apply a software engineering approach to design, build, and maintain custom tools and automation in Python and SQL. Your systems will track, analyze, and report on costs across our entire fleet of providers and services, with a focus on detecting anomalies immediately
Partner & Advise: Working closely as an embedded member of the SRE team, you will partner with fellow SREs and research teams to model the cost implications of new models and infrastructure designs, providing expert guidance on cost-performance trade-offs
Visualize & Report: Create and manage a centralized observability stack for cloud costs, building dashboards in tools like Grafana to give a real-time, granular view of our financial posture to all stakeholders

Qualification

Cloud cost managementPythonCloud architectureData analysisSQLAWSGCPDockerKubernetesTroubleshootingData-driven decision-makingFinOps Certified Practitioner

Required

5+ years of experience in a technical role such as Site Reliability Engineer, DevOps Engineer, Infrastructure Engineer, or a dedicated Cloud Cost Engineer
Deep, hands-on expertise with the cost models and optimization levers of at least one major cloud provider (AWS, GCP), and a willingness to learn others
Proficient in Python for the purpose of scripting, data analysis, and building automation tooling
Strong, foundational understanding of cloud infrastructure, including containerization (Docker, Kubernetes), networking, and storage
Not an accountant; a systems thinker who is passionate about applying engineering principles to solve financial challenges at scale
Tenacious troubleshooter and a data-driven decision-maker who thrives on finding the 'why' behind the numbers

Preferred

Experience managing a monthly cloud spend in excess of $1 million
Relevant certifications, such as the FinOps Certified Practitioner (FOCP)
Experience building custom cost allocation, showback, or chargeback systems from scratch
A background working with large-scale GPU clusters for AI/ML workloads

Company

Luma AI

twittertwittertwitter
company-logo
Luma AI develops tools that let users generate photorealistic images and videos from text, image, or video prompts.

H1B Sponsorship

Luma AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (10)
2024 (3)

Funding

Current Stage
Growth Stage
Total Funding
$1.06B
Key Investors
HUMAINAndreessen HorowitzAmplify Partners
2025-11-19Series C· $900M
2024-12-06Series B· $90M
2024-01-09Series B· $43M

Leadership Team

leader-logo
Amit Jain
Co-Founder
linkedin
Company data provided by crunchbase