Staff Software Engineer, Site Reliability (SRE) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Character.AI · 1 day ago

Staff Software Engineer, Site Reliability (SRE)

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. As a Staff Software Engineer in Site Reliability, you will support infrastructure with thousands of nodes and ensure the reliability, scalability, and performance of the service as the user base grows.

AppsArtificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMobile AppsSoftware
check
H1B Sponsor Likelynote

Responsibilities

Maintain production services and keep them operational
Develop tools, Instrumentation and automation to monitor and optimize the performance and reliability of our service
Develop, implement and maintain automation tools and processes to prevent and mitigate service disruptions
Collaborate with development teams to design and implement scalable, reliable systems, CI/CD processes for deployment
Establish and support SLAs and SLOs for our site
Provide system monitoring and incident alerts
Participate in on-call rotations to provide support for critical incidents and outages
Develop plans for site reliability and disaster recovery

Qualification

PythonGolangCI/CDKubernetesTerraformSQLLinuxGCPIncident managementMonitoring toolsSoft skills

Required

5+ years of experience in a development focused DevOps/SRE role within a technology organization that has significant scale
Deep experience with and proven success in developing software tools and automation wherever needed using Python and Golang
Expertise with SQL, Linux, CI/CD, Kubernetes, Terraform to support a site/application within a large multi node infrastructure and a growing user base
Experience working with multiple cloud computing platforms such as GCP is also a must
Demonstrated experience to successfully and reliably troubleshoot technical issues and challenges across a range of platforms and systems
Experience with incident management and event postmortems

Preferred

Familiarity with GPU clusters and/or HPC environments is preferred
Experience with monitoring and logging tools such as Prometheus and Grafana
Hands-on experience scaling a consumer product from early days into hypergrowth

Company

Character.AI

twittertwittertwitter
company-logo
Character.ai provides open-ended conversational applications in which users create characters and converse with them.

H1B Sponsorship

Character.AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
2024 (16)
2023 (6)
2022 (1)

Funding

Current Stage
Growth Stage
Total Funding
$150.08M
Key Investors
Andreessen Horowitz
2023-03-23Series A· $150M
2023-01-24Seed· $0.08M

Leadership Team

leader-logo
Karandeep Anand
CEO
linkedin
Company data provided by crunchbase