Senior Software Engineer, Observability jobs in United States
cer-icon
Apply on Employer Site
company-logo

Nebius · 12 hours ago

Senior Software Engineer, Observability

Nebius is leading a new era in cloud computing to serve the global AI economy. They are seeking a Senior Software Engineer to design, build, and own backend systems that power metrics and monitor large-scale infrastructure.

AI InfrastructureCloud InfrastructureGPUIaaSPaaS
check
Growth Opportunities

Responsibilities

Design and build services and agents that provide deep visibility into large-scale server fleets and data center engineering systems
Evolve metrics, aggregation, and alerting pipelines, with a focus on signal quality and reliability
Design and operate maintenance and remediation systems that enable safe, predictable fleet-wide changes and keep infrastructure healthy
Investigate production incidents hands-on, including on-host Linux debugging, and drive root-cause fixes
Collaborate closely with hardware, networking, and data center operations teams to improve reliability

Qualification

PythonGoLinuxProduction systemsUbuntuCCNA

Required

5+ years of professional software engineering experience
Strong production experience with Python and Go, or the ability to ramp up quickly
Solid Linux fundamentals and comfort debugging live systems
Ability to write reliable, maintainable code and dig into complex, ambiguous problems
Experience building and operating production systems at scale

Preferred

Ubuntu experience, including internal tooling and packaging workflows (e.g., building Debian packages)
CCNA (Cisco Certified Network Associate) or equivalent networking experience

Benefits

Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
401(k) plan: up to 4% company match with immediate vesting.
Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
Remote work reimbursement: up to $85/month for mobile and internet.
Disability & life insurance: company-paid short-term, long-term and life insurance coverage.

Company

Nebius

twittertwittertwitter
company-logo
The Nebius AI Cloud brings powerful full-stack infrastructure for AI developers and practitioners across startups, enterprises and science institutes to build and deploy generative AI applications and rapidly deliver scientific breakthroughs by training and running ML models within a secure, high-performance, and cost-optimized cloud environment.

Funding

Current Stage
Late Stage
Total Funding
$1.04B
2025-06-04Debt Financing· $1B
2025-05-15Grant· $45M
2024-12-02Seed

Leadership Team

E
Evan Helda
Head of Physical AI
linkedin
leader-logo
Vinita Ananth
Sr. Director of Product
linkedin
Company data provided by crunchbase