Staff Software Engineer, Model Serving jobs in United States
cer-icon
Apply on Employer Site
company-logo

Databricks · 3 days ago

Staff Software Engineer, Model Serving

Databricks is a data and AI company that enables data teams to solve complex problems through its platform. The Staff Engineer will design and implement core systems for the Model Serving product, ensuring scalability and reliability while collaborating with various teams to enhance product performance.

AnalyticsArtificial Intelligence (AI)Data StorageInformation TechnologyMachine Learning
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design and implement core systems and APIs that power Databricks Model Serving, ensuring scalability, reliability, and operational excellence
Partner with product and engineering leadership to define the technical roadmap and long-term architecture for serving workloads
Drive architectural decisions and trade-offs to optimize performance, throughput, autoscaling, and operational efficiency for CPU and GPU serving workloads
Contribute directly to key components across the serving infrastructure — from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling — ensuring smooth and efficient operations at scale
Collaborate cross-functionally with product, platform, and research teams to translate customer needs into reliable and performant systems
Lead technical initiatives that improve latency, availability, and cost-effectiveness across both customer-facing and foundational serving layers
Establish best practices for code quality, testing, and operational readiness, and mentor other engineers through design reviews and technical guidance
Represent the team in cross-organizational technical discussions and influence Databricks’ broader AI platform strategy

Qualification

Large-scale distributed systemsModel serving expertiseCPU/GPU inference systemsAlgorithmsData structuresSystem designMentoring engineersCommunication skillsCollaboration across teamsStrategic mindset

Required

10+ years of experience building and operating large-scale distributed systems
Deep expertise in model serving, inference systems, and related infrastructure (e.g., routing, scheduling, autoscaling, and observability)
Strong foundation in algorithms, data structures, and system design as applied to large-scale, low-latency serving systems
Proven ability to deliver technically complex, high-impact initiatives that create measurable customer or business value
Experience leading architecture for large-scale, performance-sensitive CPU/GPU inference systems
Strong communication skills and ability to collaborate across teams in fast-moving environments
Strategic and product-oriented mindset with the ability to align technical execution with long-term vision
Passion for mentoring, growing engineers, and fostering technical excellence

Benefits

Eligibility for annual performance bonus
Equity

Company

Databricks

company-logo
Databricks is a data and AI platform that unifies data engineering, analytics, and machine learning on a lakehouse architecture.

H1B Sponsorship

Databricks has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (385)
2024 (319)
2023 (227)
2022 (222)
2021 (166)
2020 (64)

Funding

Current Stage
Late Stage
Total Funding
$25.81B
Key Investors
Counterpoint GlobalFranklin TempletonAndreessen Horowitz
2025-12-16Series Unknown· $4B
2025-09-08Series Unknown· $1B
2025-01-13Debt Financing· $5.25B

Leadership Team

leader-logo
Ali Ghodsi
CEO and Co-founder
linkedin
leader-logo
David Conte
Chief Financial Officer
linkedin
Company data provided by crunchbase