Fractal · 2 days ago
Databricks SRE Platform Engineer
Fractal Analytics is a strategic AI partner to Fortune 500 companies with a vision to power every human decision in the enterprise. The Databricks SRE and Support Engineer will work on operations related to the AI/ML upskilling program on Databricks, ensuring stability, reliability, scalability, and performance.
Responsibilities
Provide continuous SRE support to thousands of geographically distributed users on the AI Dojo Databricks platform: respond to tickets, triage support, liaise with customers
Improve existing Infrastructure as Code (IaC) according to best DevOps practices
Develop and maintain monitoring frameworks to timely respond to outages and other service interruptions
Collaborate with internal cybersecurity teams to ensure all systems and operations comply with industry standards and are secure against evolving threats
Forecast and manage capacity requirements for the AI/ML training environment, while identifying opportunities to reduce costs without compromising performance
Qualification
Required
Bachelor's degree in computer science, information technology, or a related field
6+ years of infrastructure experience: Proven experience working on large-scale, cloud-based, enterprise-level software platforms and deep understanding of Databricks environment
Experience building Github Actions pipelines including composite actions, OIDC federation for cloud provider identity acquisition, and use of environments and deployment controls
Experience building Databricks Asset Bundle and Terraform pipelines to manage and deploy Databricks Platform and Workspace resources
Fluency in Python, experience with the Databricks Python SDK to perform Workspace operations, and familiarity with PySpark and Delta Lake
Deep familiarity with Databricks APIs, and use of the Databricks CLI for use provisioning Workspace identities, filesystem resources, and the querying of account and workspace level Users, Groups, and Service Principals
Strong understanding of security best practices and experience ensuring compliance with relevant regulatory frameworks
3+ years of practical experience in Infrastructure-as-Code and CI/CD tools like Terraform, GitHub Actions, Databricks Asset Bundles, and alike
3+ years of experience working with AWS IaC deployments performing account provisioning, implementing cross-account automation, and building resource and identity policies supporting least-privilege access roles
3+ years of experience working in support teams that are geographically distributed
Benefits
Health, dental, vision, life insurance, and disability plans
401(k) Plan
11 paid holidays
12 weeks of Parental Leave
“free time” PTO policy
Company
Fractal
Fractal is an AI firm with the aspiration to power every human decision in theenterprise.
Funding
Current Stage
Late StageTotal Funding
$862.67MKey Investors
Srikanth VelamakanniTPG Capital AsiaApax Partners
2025-07-15Secondary Market· $172M
2025-07-12Undisclosed· $5.67M
2022-01-05Private Equity· $360M
Leadership Team
Recent News
2026-01-03
2025-12-21
2025-12-05
Company data provided by crunchbase