Databricks · 3 days ago
Staff Software Engineer, Model Serving
Databricks is a data and AI company that enables data teams to solve complex problems through its platform. The Staff Engineer will design and implement core systems for the Model Serving product, ensuring scalability and reliability while collaborating with various teams to enhance product performance.
AnalyticsArtificial Intelligence (AI)Data StorageInformation TechnologyMachine Learning
Responsibilities
Design and implement core systems and APIs that power Databricks Model Serving, ensuring scalability, reliability, and operational excellence
Partner with product and engineering leadership to define the technical roadmap and long-term architecture for serving workloads
Drive architectural decisions and trade-offs to optimize performance, throughput, autoscaling, and operational efficiency for CPU and GPU serving workloads
Contribute directly to key components across the serving infrastructure — from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling — ensuring smooth and efficient operations at scale
Collaborate cross-functionally with product, platform, and research teams to translate customer needs into reliable and performant systems
Lead technical initiatives that improve latency, availability, and cost-effectiveness across both customer-facing and foundational serving layers
Establish best practices for code quality, testing, and operational readiness, and mentor other engineers through design reviews and technical guidance
Represent the team in cross-organizational technical discussions and influence Databricks’ broader AI platform strategy
Qualification
Required
10+ years of experience building and operating large-scale distributed systems
Deep expertise in model serving, inference systems, and related infrastructure (e.g., routing, scheduling, autoscaling, and observability)
Strong foundation in algorithms, data structures, and system design as applied to large-scale, low-latency serving systems
Proven ability to deliver technically complex, high-impact initiatives that create measurable customer or business value
Experience leading architecture for large-scale, performance-sensitive CPU/GPU inference systems
Strong communication skills and ability to collaborate across teams in fast-moving environments
Strategic and product-oriented mindset with the ability to align technical execution with long-term vision
Passion for mentoring, growing engineers, and fostering technical excellence
Benefits
Eligibility for annual performance bonus
Equity
Company
Databricks
Databricks is a data and AI platform that unifies data engineering, analytics, and machine learning on a lakehouse architecture.
H1B Sponsorship
Databricks has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (385)
2024 (319)
2023 (227)
2022 (222)
2021 (166)
2020 (64)
Funding
Current Stage
Late StageTotal Funding
$25.81BKey Investors
Counterpoint GlobalFranklin TempletonAndreessen Horowitz
2025-12-16Series Unknown· $4B
2025-09-08Series Unknown· $1B
2025-01-13Debt Financing· $5.25B
Recent News
Crunchbase News
2026-01-09
2026-01-09
Company data provided by crunchbase