Lead Engineer/SRE, KMS - AdTech Leader jobs in United States
cer-icon
Apply on Employer Site
company-logo

Andiamo ยท 4 days ago

Lead Engineer/SRE, KMS - AdTech Leader

Andiamo is a globally recognized staffing and consulting firm specializing in placing top technology professionals. The Lead Site Reliability Engineer will guide the reliability and performance of large scale, customer facing systems while fostering strong engineering practices and improving platform performance.

ConsultingHuman ResourcesInformation TechnologyStaffing Agency
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Deliver foundational services that support rapid and predictable software delivery across the engineering organization
Create systems and operational processes that support reliable and scalable applications
Identify upstream solutions that prevent recurring issues and promote long term stability
Develop the technical roadmap for your area, collaborating with stakeholders to solve meaningful engineering challenges
Improve throughput and system performance by analyzing and eliminating architectural bottlenecks
Work with tools and technologies such as Python, AWS, Django, Kubernetes, Bash, Terraform, MySQL, Redis, and Postgres
Help foster a culture of strong engineering practices through thoughtful design discussions and collaborative whiteboarding sessions
Support and mentor engineers across the company, helping raise the standard of engineering quality and operational excellence
Write and maintain software that improves the reliability, performance, and efficiency of platform services
Participate in on call rotations with a focus on resolving issues at the source and reducing alert fatigue
Introduce architectural changes that significantly improve the scalability and resilience of critical systems
Work closely with product oriented engineers and other SREs to deliver improvements that have real customer impact
Use data driven analysis to understand system behavior, predict scaling needs, and guide strategic improvements
Promote site reliability principles across the engineering organization

Qualification

Site Reliability EngineeringPerformance EngineeringDistributed SystemsPythonAWSKubernetesTerraformMySQLRedisPostgresLinux SystemsNetworking StackData AnalysisAI ToolsCollaborationMentoring

Required

Ten or more years of experience in site reliability engineering, devops, or related fields
Degree in computer science or a related field, or equivalent hands on experience
Calm and focused during outages with the ability to drive investigations to clear root cause and long term corrective measures
Strong understanding of Linux systems and the full networking stack
Experience collaborating with engineering teams to build and operate production software
Proficiency writing code using best practices in languages such as Python, Ruby, or Go
Genuine interest in exploring emerging AI tools and responsibly experimenting with techniques that improve engineering workflows

Company

The Talent Partners for the AI Revolution.

H1B Sponsorship

Andiamo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2022 (2)
2021 (1)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Patrick McAdams
CEO & Co-Founder
linkedin
leader-logo
Steven Kottler
CFO
linkedin
Company data provided by crunchbase