Xyant Services · 1 day ago
Site Reliabilirty (Only Locals)
Xyant Services is seeking a Site Reliability Engineer to support their cloud-based services. The role involves deployment, capacity management, observability, and performance tuning while collaborating with security architecture teams and supporting engineering teams onboarding to new services.
Responsibilities
Perform SRE roles including deployment, capacity management, observability, and performance tuning
Collaborate with our Security Architecture team to define attestation for a variety of workloads spanning multiple compute platforms
Support engineering teams who will be onboarding to this new service
Qualification
Required
Proficiency in operating and supporting cloud-based services using IaC (infrastructure as code, Terraform)
Proven experience as a Service reliability engineer
Experience with CI/CD processes and source control mechanisms (GitHub)
Knowledge of federated trust models for identity and security
Understanding and use of public cloud infrastructure (AWS, Azure, GCP)
Strong focus on prioritizing customer experience and support
Ability to communicate clearly and efficiently with customers and leadership
Experience working with large enterprises with heterogeneous compute platforms
Generalist SRE profile
Strong AI-related skillset
Hands-on experience with Terraform
Containerized workloads (kubernetes)
Experience building and maintaining CI/CD pipelines (GitHub)
Proficient in monitoring and observability tools
Company
Xyant Services
Xyant is an AI innovation lab and data engineering company developing secure, production-grade Generative and Agentic AI systems and AI-ready products tailored to regulated industries.