Senior Engineering Manager, Engineering Operations jobs in United States
cer-icon
Apply on Employer Site
company-logo

Ridgeline · 8 hours ago

Senior Engineering Manager, Engineering Operations

Ridgeline is the industry cloud platform for investment management, and they are seeking a Senior Engineering Manager for Engineering Operations. This role involves leading a team to ensure platform reliability and cost-effectiveness while executing strategies for incident response and telemetry.

Artificial Intelligence (AI)Information TechnologySoftware
badNo H1Bnote

Responsibilities

Lead and evolve Ridgeline’s observability and telemetry ecosystem to ensure critical metrics are trustworthy, actionable, and widely adopted
Define and execute the company-wide incident management strategy, enabling rapid response and continuous learning
Drive cost optimization and forecasting by scaling our FinOps practice with integrated usage and financial telemetry
Collaborate with Site Reliability Engineering (SRE) to create cross-system observability standards and ensure consistency in logs, metrics, tracing, and cost data
Build a unified metrics platform that combines operational, financial, and organizational performance data for real-time executive decision-making
Identify, automate, and eliminate high-frequency operational tasks using AI, reducing toil and increasing focus on continuous improvement
Define, track, and communicate KPIs for system reliability, operational efficiency, and infrastructure cost-effectiveness
Mentor and grow a diverse team of engineers, fostering a culture of ownership, learning, and transparency

Qualification

SRE experienceObservability platformsIncident management frameworksSQL proficiencyData modelingBI toolsContinuous improvementResilienceCollaboration skillsEffective communicationMentorshipCalm under pressure

Required

10+ years of experience in SRE, infrastructure, or technical operations, including 3–6 years in a leadership role
Expertise in observability platforms like Datadog, Prometheus, ELK, or OpenTelemetry
Experience integrating technical telemetry with business metrics and cost models (e.g., cost-per-customer, MTTR, unit metrics)
Proven success scaling incident management frameworks and post-mortem processes
Proficiency with SQL, data modeling, or BI tools like Looker or Tableau
Strong collaboration skills and the ability to communicate technical insights to executive audiences
Calm, effective communicator who performs well under pressure and in incident response environments
Passion for continuous improvement, resilience, and mentorship

Preferred

Prior experience in the FinTech or SaaS industry
Familiarity with AI/ML solutions in observability and operations
Experience managing infrastructure in a cloud-native environment (e.g., AWS, Kubernetes)

Benefits

100% of Ridgeline employees can participate in our Company Stock Plan subject to the applicable Stock Option Agreement.
Unlimited vacation
Educational and wellness reimbursements
$0 cost employee insurance plans

Company

Ridgeline

company-logo
Ridgeline is a computer software company that specializes in software development to deliver for the investment management industry.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Dave Blair
Chief Executive Officer
linkedin
leader-logo
Lisa Faucher
SVP, Enterprise Operations
linkedin
Company data provided by crunchbase