Sr Systems Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lucasfilm ยท 3 months ago

Sr Systems Reliability Engineer

Lucasfilm is seeking a skilled Sr Systems Reliability Engineer to join their Skywalker Sound Development Group, which is focused on developing next-generation tools for audio soundtracks and media distribution. In this role, you will design and manage critical infrastructure, collaborate on cloud provisioning, and ensure the delivery of reliable, high-quality solutions for creative teams.

FilmSoftwareTV ProductionVideo

Responsibilities

Design, manage and maintain critical infrastructure for both software development and deployed global production resources
Collaborate on the provisioning of cloud infrastructure in AWS using Terraform to ensure consistency and scalability
Maintain and manage multiple Kubernetes clusters across both cloud and on-premise environments
Implement and enforce best practices for secure software development and deployment in alignment with industry standards
Monitor, troubleshoot, and optimize build and deployment processes to maximize efficiency and minimize downtime
Collaborate with cross-functional teams, including developers and security experts, to ensure systems meet operational requirements
Develop, maintain, and enhance CI/CD pipelines using GitLab to support build automation, unit testing, and integration testing
Continuously evaluate and implement tools and technologies to improve workflows and platform reliability

Qualification

AWSKubernetesCI/CD pipelinesTerraformPythonDockerGitLab CISecurity practicesBashProblem-solvingCollaboration

Required

BS Degree in Computer Science
5+ years of experience in DevOps, Site Reliability Engineering, or a related field
Extensive AWS knowledge: EC2, ECS/EKS, Lambda, ELB, ASGs, Route53, KMS, SSM, IAM, S3, ACM, VPC, RDS, Elasticache
Proficiency with modern observability practices: application monitoring, tracing, and profiling tools (e.g. Datadog, New Relic, OpenTelemetry, Splunk)
Proficiency with GitLab CI, Terraform, Helm, and Packer
Demonstrated experience designing and managing CI/CD pipelines for complex software platforms
In-depth knowledge of Containers and Container Orchestration technologies: Docker, Kubernetes
Experience with Terraform or other infrastructure as code tooling
Strong scripting skills in Python, Bash, or similar languages
Familiarity with modern security practices for protecting sensitive assets in distributed systems
Exceptional problem-solving skills, with a proactive and collaborative mindset

Preferred

Experience working with media and entertainment pipelines or pre-release content workflows
Proficiency with Golang, Python, or C++
Experience with modern AI/ML frameworks (e.g., TensorFlow, PyTorch, Hugging Face) and their integration into operational workflows
Knowledge of container security tools and systems, such as Falco or Aqua Security
Experience with emerging deployment systems like ArgoCD or Flux for GitOps workflows
Familiarity with serverless computing paradigms and technologies such as AWS Lambda or Google Cloud Run/Functions
Understanding of high-performance computing systems in cloud environments
Experience with administering VMWare vSphere clusters

Benefits

A bonus and/or long-term incentive units may be provided as part of the compensation package
The full range of medical, financial, and/or other benefits

Company

Lucasfilm

company-logo
Lucasfilm produces original content, postproduction effects, and audio for external clients, licensed products, and the gaming industry. It is a sub-organization of The Walt Disney Company.

Funding

Current Stage
Late Stage
Total Funding
unknown
2012-10-30Acquired

Leadership Team

leader-logo
Rob Bredow
SVP, Creative Innovation
linkedin
Company data provided by crunchbase