Staff Software Engineer - ML Observability jobs in United States
cer-icon
Apply on Employer Site
company-logo

Datadog · 2 weeks ago

Staff Software Engineer - ML Observability

Datadog is a global SaaS business focused on enabling digital transformation and infrastructure monitoring. The Staff Engineer will lead the development of observability tools for AI systems, particularly those using Large Language Models, and will influence product direction while collaborating with cross-functional teams.

AnalyticsCloud ComputingCloud Data ServicesCloud InfrastructureData ManagementDevOpsProductivity ToolsSaaS
check
H1B Sponsor Likelynote

Responsibilities

Drive design and implementation of LLM observability features
Ideate, prototype, and scale new product features to provide insights and drive improvements for generative AI systems
Work cross-functionally with other eng teams, product, UX, and applied science to iterate fast and find product-market fit
Develop and extend tools for tracing, evaluating, and debugging LLMs
Influence architecture decisions and mentor engineers to build resilient, high-performance systems
Stay close to customer pain points and use those insights to guide product and engineering priorities
Stay current with industry trends and advancements in machine learning and observability, driving innovation within the team

Qualification

LLM-powered applicationsDistributed systemsScalable backend architecturesObservability toolsModel evaluation techniquesClean codeProduct-oriented mindsetCommunication skills

Required

You have a BS/MS/PhD in a Computer Science, Engineering or related scientific field or equivalent experience
Deep understanding of distributed systems and scalable backend architectures
Hands-on experience building and shipping LLM-powered or GenAI applications
Understanding of model internals, inference pipelines, evaluation techniques, and prompt engineering
Ability to thrive in ambiguous, fast-changing spaces and have a product-oriented mindset
You're excited to shape the next generation of AI observability tools from the ground up
Communicate clearly, think rigorously, and take pride in clean, maintainable code
Experience with observability tools/platforms

Benefits

Healthcare
Dental
Parental planning
Mental health benefits
A 401(k) plan and match
Paid time off
Fitness reimbursements
A discounted employee stock purchase plan

Company

Datadog is an observability and security platform that offers infrastructure, applications, software development, and monitoring services.

H1B Sponsorship

Datadog has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (123)
2024 (66)
2023 (45)
2022 (53)
2021 (31)
2020 (29)

Funding

Current Stage
Public Company
Total Funding
$1.02B
Key Investors
ICONIQ GrowthIndex VenturesOpenView
2024-12-09Post Ipo Debt· $870M
2020-05-28Post Ipo Debt
2019-09-19IPO

Leadership Team

leader-logo
Olivier Pomel
Co-founder, CEO
linkedin
leader-logo
Alexis Le-Quoc
Co-founder & CTO
linkedin
Company data provided by crunchbase