Senior Site Reliability Engineer @ Hydrolix | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Senior Site Reliability Engineer jobs in Oregon, United States
53 applicants
company-logo

Hydrolix · 6 hours ago

Senior Site Reliability Engineer

ftfMaximize your interview chances
Cloud Data ServicesDatabase

Insider Connection @Hydrolix

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Deploy, maintain, and ensure a highly reliable fleet of Kubernetes clusters and Hydrolix deployments across multiple cloud platforms.
Design, implement, and maintain systems and processes to enhance the reliability, availability, and performance of our services.
Build and optimize CI/CD tools and processes to ensure efficient and reliable deployments.
Develop and manage monitoring, alerting, and incident response strategies to minimize downtime and enable rapid recovery.
Conduct comprehensive root cause analyses for system failures, implementing long-term preventive measures.
Automate repetitive tasks and optimize system performance to improve operational efficiency.
Participate in a 24/7 on-call rotation, covering weekday business hours and once-monthly weekend shifts.
Work closely with software engineering, infrastructure, and product teams to integrate reliability practices into every stage of the development lifecycle.
Champion SRE best practices and foster a culture of operational excellence across the organization.
Collaborate with a distributed team of engineers worldwide to provide round-the-clock support.
Interface with customers to address and resolve reported incidents, ensuring a seamless user experience.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Site Reliability EngineeringObservability ToolsCloud PlatformsProgramming LanguagesLinux SystemsSQL Databases

Required

Proven experience as a Site Reliability Engineer or similar role, with a history of supporting complex distributed systems.
Experience with monitoring and debugging tools like Prometheus, Vector, Grafana, Superset, or Kibana.
Proficiency in at least one major cloud platform (AWS, GCP, Azure, or Linode).
Experience with SQL databases.
Proficiency in programming languages such as Python, Go, or Rust.
Strong experience with Linux systems, including performance tuning and system-level troubleshooting.
Excellent written and verbal communication skills, with the ability to convey technical concepts clearly to diverse audiences, including customers and cross-functional teams.

Preferred

Familiarity with PostgreSQL is a plus but not required.

Company

Hydrolix

twittertwittertwitter
company-logo
Hydrolix is a data management company that specializes in providing solutions for storing, managing, and analyzing large-scale data.

Funding

Current Stage
Growth Stage
Total Funding
$65M
Key Investors
S3 VenturesNava VenturesWing Venture Capital
2024-05-22Series B· $35M
2023-06-05Series A· $20M
2021-02-24Seed· $10M

Leadership Team

leader-logo
Marty Kagan
Co-founder and CEO
linkedin
leader-logo
Hasan Alayli
Co-founder and Chief Scientist
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot