Site Reliability Engineer @ Voltage Park | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Site Reliability Engineer jobs in San Francisco, CA
Be an early applicantLess than 25 applicants
company-logo

Voltage Park · 7 hours ago

Site Reliability Engineer

ftfMaximize your interview chances
Cloud ComputingMachine Learning
badNo H1Bnote

Insider Connection @Voltage Park

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

At the direction of the Manager of Site Reliability Engineering, design, build, and roll out new platforms and patterns to minimize incidents and enable customer facing and internal features.
Deploy updates and improvements to support both Voltage Park’s internal and end customer use cases.
Collaborate with colleagues in network engineering, software development, and customer support in a flat organization.
Participate in the SRE on-call rotation (1 week on, 5+ weeks off).

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

LinuxAWSKubernetesTerraformAnsibleNetwork Attached StoragePrometheusELK StackPythonNetworking FundamentalsComplex Systems ArchitectureBare Metal ProvisioningGPU ServersNetwork SwitchesGitOpsGoBashRoutersFirewallsDocumentation

Required

8+ years working with Linux as a server / hosting platform, extra points for Ubuntu experience.
5+ years experience with AWS.
2+ years experience with Kubernetes and strong container fundamentals.
2+ years experience with Terraform and Ansible.
2+ years with network attached storage management (via NFS, ceph, or other protocols). Extra points for experience with VAST storage systems.
Experience working in a Slack-first, asynchronous remote work environment.
Experience with monitoring systems (Prometheus, ELK stack).
Familiarity with the gitops workflow.
Software development experience using Python, Go, bash, or other languages for the purposes of automation & connecting systems & APIs together.
Deep networking fundamentals, extra points for experience with datacenter level networks, 400Gb ethernet, and Infiniband.
Experience architecting, building, and delivering complex systems from 0 to 1.
Adept at balancing pragmatic development and ideal architectures. Effective at navigating tradeoffs between design, risk, cost, and outcomes.
Comfortable with navigating ambiguity.
Strong written and oral communication.

Preferred

Experience with bare metal hardware troubleshooting and provisioning, extra points for working with Dell hardware.
Experience with GPU servers, both in bare metal form or under virtualization.
Deep experience with network switches, routers, and firewalls, particularly SONiC switches, Palo Alto firewalls.
Experience with VAST storage systems.

Company

Voltage Park

twittertwitter
company-logo
Voltage Park provides infrastructure for machine learning.

Funding

Current Stage
Early Stage
Total Funding
$500M
2023-10-30Undisclosed· $500M

Leadership Team

leader-logo
Eric Park
Chief Executive Officer
linkedin
leader-logo
Mike Xia
Chief Product Officer
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot