Curative AI, Inc. · 6 hours ago
Senior DevOps Engineer
Curative AI, Inc. is an ambitious innovative early-stage startup revolutionizing the healthcare industry through cutting-edge AI-powered SaaS solutions. They are seeking a highly skilled and experienced Senior DevOps Engineer to build, maintain, and scale their cloud infrastructure and CI/CD pipelines, ensuring the reliability, security, and performance of their AI-powered healthcare platform.
Artificial Intelligence (AI)Cloud ComputingData VisualizationHealth CareHealth DiagnosticsMedicalMedical DevicemHealthSoftware
Responsibilities
Design, implement, and manage cloud infrastructure across multiple environments (AWS, Azure, GCP) to support our AI platform and applications
Develop and maintain CI/CD pipelines using GitHub Actions and other tools for automated build, testing, and deployment
Deploy and manage Kubernetes clusters and implement Istio for service mesh capabilities
Create and maintain Helm Charts for application deployment
Automate infrastructure deployment using Infrastructure as Code (IaC) practices with Terraform
Implement and manage DevSecOps practices, ensuring security and compliance (e.g., HIPAA, GDPR) in the deployment process
Set up end-to-end telemetry for the cloud platform and the applications on it, to monitor reliability and availability and also to troubleshoot when issues come up Monitor and optimize system performance, ensuring high availability and reliability
Collaborate with development, data science, and operations teams to streamline the release process and improve deployment efficiency
Troubleshoot and resolve infrastructure and application issues
Participate in on-call rotation
Stay up to date with the latest DevOps technologies and trends
Qualification
Required
You must currently be located in the Seattle Metro Region and able to work hybrid on-site a minimum of three days at our Bellevue location
Bachelor's degree in Computer Science or related field, or equivalent practical experience
5+ years of experience in a DevOps engineering role
Strong proficiency with at least one cloud service (AWS, Azure, GCP)
Experience with Infrastructure as Code (IaC) using Terraform
Strong experience with Kubernetes, Istio, and container orchestration
Proficiency with Docker containerization
Hands-on experience with GitHub and GitHub Actions
Strong scripting skills in Python, Bash, and/or JavaScript
Able to design and implement end-to-end telemetry across cloud infrastructure and applications, enabling comprehensive observability for system reliability, availability, and efficient root-cause analysis during incidents
Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack)
Experience with security best practices and compliance requirements
Knowledge of networking concepts and security protocols
Excellent problem-solving skills and a proactive attitude
Strong communication and teamwork skills
Experience working in Agile environments
Humble approach to learning, dedicated team player, and excited about innovation
Preferred
Knowledge of AI infrastructure and MLflow is a plus
Experience with healthcare or AI-related technologies is a plus
Benefits
Target Annual Performance Bonus
Equity Package: Generous equity participation in the company's future success
Comprehensive benefits package including medical, dental, vision, life and AD&D insurance; 401K; paid time off and holidays
Opportunity to work on cutting-edge AI projects and make an impact on the company's success
Chance to make a real impact on the company’s AI strategy and innovation