Brightstar Lottery · 9 hours ago
Cloud/Site Reliability Engineer
Brightstar Lottery is an innovative global leader in lottery solutions. They are seeking a Cloud/Site Reliability Engineer to join their Cloud Infrastructure Engineering, Operations & Automation team, focusing on building resilient systems and ensuring high availability of applications and services.
Responsibilities
Design and refine monitoring strategies using tools like Dynatrace, Prometheus, and ELK
Develop alerting standards that reduce noise and increase signal quality
Continuously improve observability to detect anomalies before they impact users
Assess application workloads key metrics for performance and reliability, together with infrastructure and middleware monitoring
Identify Public/Hybrid Cloud issues in services and resources
Correlate alerts with telemetry and logs to identify systemic issues and improvement opportunities
Work with L3 product engineers and with cloud vendors towards the resolution of the cases
Design, build, and maintain robust automation pipelines using tools such as Terraform, Ansible, Jenkins, Helm, and Bash to streamline cloud operations
Develop and implement self-healing capabilities that proactively detect and remediate issues, minimizing manual intervention and downtime
Analyze operational workflows to identify repetitive tasks and transform them into scalable, automated solutions
Collaborate with the Architecture team to enhance and enforce cloud baseline standards for consistency and reliability
Automate incident response and recovery processes leveraging tools like PagerDuty to accelerate resolution and improve system resilience
Manage Cloud infrastructure and services
Monitor and optimize Cloud resources usage
Open and manage Microsoft support tickets in collaboration with L3
Participate in 24x7 On-Call rotation with after-hours support for critical incident response
Qualification
Required
Hands-on experience in cloud operation or site reliability engineering field
Practical experience in public cloud infrastructure and services management (Azure / AWS public cloud knowledge would be preferred)
Proficiency in scripting and automation (Terraform, PowerShell, Python, Bash)
Experience with Infrastructure as Code (IaC) and GitOps principles
Hands-on experience on K8s and containers orchestration
Expertise in monitoring tools (Dynatrace, Datadog, Prometheus, ELK)
Strong analytical, troubleshooting, and communication skills
Preferred
Apply Agentic AI techniques to drive intelligent automation, optimize cloud services, accelerate troubleshooting and root-cause analysis, and enhance system resilience and recoverability
Familiarity with AI/ML Ops or AI-assisted observability tools
Thorough understanding of Java application workloads, and Java performance related topics
Deep knowledge of one programming language (Java/ Python / Go)
Strong Linux and networking skills
Understanding software architecture patterns and app-dev principles
Public cloud certifications would be considered as a plus
Experience in a 24/7 operations environment
Benefits
401(k) Savings Plan with Company contributions
Health, dental, and vision insurance
Life, accident, and disability insurance
Tuition reimbursement
Paid time off
Wellness programs
Identity theft insurance
Company
Brightstar Lottery
Brightstar Lottery is a global gaming company specializing in the design, manufacture, and marketing of electronic gaming.
Funding
Current Stage
Public CompanyTotal Funding
$1.3B2025-12-03Post Ipo Debt· $750M
2025-07-01Acquired
2024-09-10Post Ipo Debt· $551.02M
Recent News
2025-12-24
Company data provided by crunchbase