WebstaurantStore · 1 day ago
Site Reliability Engineer, K8s (SRE) - Remote (Select States)
Maximize your interview chances
Consumer GoodsIndustrial
Comp. & BenefitsNo H1B
Insider Connection @WebstaurantStore
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Managing on-premise clusters.
Deploying resources with a CI/CD platform (Argo-CD, Gitlab-CD, Flux, etc).
Helm and Kustomize to manage deployments.
Troubleshooting pods, nodes, deployments, etc.
Secrets management platforms such as Sealed Secrets and HashiCorp Vault.
Managing persistent storage using Rook / Ceph, NFS, etc.
Configuring ingress controllers such as HAProxy, Nginx, Traefik, etc.
Using a Service Mesh such as Istio or Consul is a plus.
Configuration management (Ansible, Terraform, etc.)
Use of observability tools (OpenTelemetry/OTEL preferred).
Programming/scripting languages, preferably Python and/or Golang.
Handling and responding to production incidents.
Operating within a Linux environment.
Maintaining version control. (We use Git, but if you’ve never used it, we can train you).
Participating in on-call rotation. (The effort we put into reliability keeps the on-call volume low).
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
3+ years in a professional systems engineering, related development, or SRE role with experience in managing on-premise clusters.
Deploying resources with a CI/CD platform (Argo-CD, Gitlab-CD, Flux, etc).
Helm and Kustomize to manage deployments.
Troubleshooting pods, nodes, deployments, etc.
Secrets management platforms such as Sealed Secrets and HashiCorp Vault.
Managing persistent storage using Rook / Ceph, NFS, etc.
Configuring ingress controllers such as HAProxy, Nginx, Traefik, etc.
Configuration management (Ansible, Terraform, etc.)
Use of observability tools (OpenTelemetry/OTEL preferred).
Programming/scripting languages, preferably Python and/or Golang.
Handling and responding to production incidents.
Operating within a Linux environment.
Maintaining version control. (We use Git, but if you’ve never used it, we can train you).
Participating in on-call rotation.
Access to a reliable and secure high-speed internet connection. Cable or fiber internet connections (at least 75mbps download/10mbps upload) are preferred.
Access to a home router and modem.
A dedicated home office space that is noise- and distraction-free.
A valid, physical address (apartment, suite, etc.). PO Boxes are not supported.
The desire and ability to work and communicate with other team members via chat, webcam, etc.
Legal residents of one of the following states: (AK, AL, AR, AZ, CT, DE, FL, GA, IA, ID, IN, KS, KY, LA, MD, ME, MI, MN, MO, MS, NC, ND, NH, NM, NV, OH, OK, PA, SC, SD, TN, TX, UT, VA, VT, WI, WV, and WY).
Preferred
Using a Service Mesh such as Istio or Consul is a plus.
Use of observability tools (OpenTelemetry/OTEL preferred).
Company
WebstaurantStore
Since WebstaurantStore's start in 2004, we have worked hard to build an innovative, easy-to-use website that meets the purchasing needs of foodservice professionals throughout the world.
Funding
Current Stage
Late StageLeadership Team
Recent News
Company data provided by crunchbase