SIGN IN
Lead Software Engineer, Observability - AdTech Leader jobs in United States
cer-icon
Apply on Employer Site
company-logo

Andiamo · 15 hours ago

Lead Software Engineer, Observability - AdTech Leader

Andiamo is a globally recognized staffing and consulting firm specializing in placing top technology professionals. They are seeking a Lead Software Engineer in Observability and Reliability to ensure large scale services remain resilient and predictable, while shaping how engineering teams operate critical systems.
ConsultingHuman ResourcesInformation TechnologyStaffing Agency
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

You will design, build, and operate foundational services that enable highly available and scalable systems
You will identify systemic bottlenecks and lead efforts to remove them, achieving meaningful gains in throughput, latency, and resilience
You will develop tooling, automation, and processes that prevent incidents before they happen, working with partners to address root causes rather than symptoms
You will define and own the technical roadmap for your domain, collaborating with stakeholders to prioritize the highest impact work
You will write and maintain production software that improves service availability, operational efficiency, and performance
You will work closely with product engineers and other reliability engineers to ship changes that matter
You will participate in an on call rotation with a strong emphasis on learning, prevention, and alert quality
When issues arise, you will help drive clear diagnosis, resolution, and long term fixes
You will use data and quantitative analysis to understand system behavior, guide scaling decisions, and measure improvement
You will actively promote reliability best practices through design reviews, documentation, and hands on collaboration

Qualification

Site Reliability EngineeringPlatform EngineeringPythonLinux SystemsGoContainer OrchestrationInfrastructure as CodeData AnalysisAutomationObservabilitySoft Skills

Required

A decade or more of experience in site reliability engineering, platform engineering, or DevOps focused roles
Significant experience operating production systems and understanding how software behaves under real world conditions
Comfortable leading through incidents and guiding teams from failure through root cause analysis to durable prevention
Strong understanding of Linux systems and networking fundamentals, from the operating system up through application level behavior
Experience building software as part of an engineering team and writing high quality code in languages such as Python, Go, or similar
Ability to apply sound engineering practices to both product code and operational tooling
Curious and growth oriented, always looking to improve skills and raise the bar for those around
Experience experimenting with artificial intelligence in professional or personal projects and eagerness to explore how new tools can responsibly improve reliability and efficiency

Company

The Talent Partners for the AI Revolution.

H1B Sponsorship

Andiamo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2022 (2)
2021 (1)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Patrick McAdams
CEO & Co-Founder
linkedin
leader-logo
Steven Kottler
CFO
linkedin
Company data provided by crunchbase