Andiamo · 15 hours ago
Lead Software Engineer, Observability - AdTech Leader
Andiamo is a globally recognized staffing and consulting firm specializing in placing top technology professionals. They are seeking a Lead Software Engineer in Observability and Reliability to ensure large scale services remain resilient and predictable, while shaping how engineering teams operate critical systems.
ConsultingHuman ResourcesInformation TechnologyStaffing Agency
Responsibilities
You will design, build, and operate foundational services that enable highly available and scalable systems
You will identify systemic bottlenecks and lead efforts to remove them, achieving meaningful gains in throughput, latency, and resilience
You will develop tooling, automation, and processes that prevent incidents before they happen, working with partners to address root causes rather than symptoms
You will define and own the technical roadmap for your domain, collaborating with stakeholders to prioritize the highest impact work
You will write and maintain production software that improves service availability, operational efficiency, and performance
You will work closely with product engineers and other reliability engineers to ship changes that matter
You will participate in an on call rotation with a strong emphasis on learning, prevention, and alert quality
When issues arise, you will help drive clear diagnosis, resolution, and long term fixes
You will use data and quantitative analysis to understand system behavior, guide scaling decisions, and measure improvement
You will actively promote reliability best practices through design reviews, documentation, and hands on collaboration
Qualification
Required
A decade or more of experience in site reliability engineering, platform engineering, or DevOps focused roles
Significant experience operating production systems and understanding how software behaves under real world conditions
Comfortable leading through incidents and guiding teams from failure through root cause analysis to durable prevention
Strong understanding of Linux systems and networking fundamentals, from the operating system up through application level behavior
Experience building software as part of an engineering team and writing high quality code in languages such as Python, Go, or similar
Ability to apply sound engineering practices to both product code and operational tooling
Curious and growth oriented, always looking to improve skills and raise the bar for those around
Experience experimenting with artificial intelligence in professional or personal projects and eagerness to explore how new tools can responsibly improve reliability and efficiency
Company
Andiamo
The Talent Partners for the AI Revolution.
H1B Sponsorship
Andiamo has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2022 (2)
2021 (1)
Funding
Current Stage
Growth StageCompany data provided by crunchbase