Straiker · 4 weeks ago
Dev Ops Engineer / Senior DevOps Engineer
Straiker is an AI startup backed by top Silicon Valley VCs with a mission to help enterprises embrace Gen AI by providing a layer of security, safety and trust. As a Senior DevOps Engineer at Straiker, you will be instrumental in building and maintaining the infrastructure that powers our AI detection cloud platform, ensuring high availability, scalability, and security of our AI services.
Artificial Intelligence (AI)Network SecuritySoftware
Responsibilities
Infrastructure as Code (IaC): Design, implement, and maintain infrastructure using tools like Terraform, CloudFormation, or Pulumi to ensure reproducible and scalable environments across development, staging, and production
CI/CD Pipeline Management: Build, optimize, and maintain continuous integration and deployment pipelines using tools like Jenkins, GitLab CI, GitHub Actions, or CircleCI to enable rapid and reliable software delivery
Container Orchestration: Deploy, manage, and scale containerized applications using Kubernetes, including cluster management, service mesh implementation, and optimization of container workloads
Cloud Architecture: Design and implement cloud-native solutions on AWS, Azure, or Google Cloud, including auto-scaling, load balancing, and disaster recovery strategies
Monitoring & Observability: Implement comprehensive monitoring, logging, and alerting solutions using tools like Prometheus, Grafana, ELK stack, or Datadog to ensure system health and performance
Security & Compliance: Implement security best practices, manage secrets and credentials, ensure compliance with industry standards, and conduct regular security audits of infrastructure
Automation & Scripting: Develop automation scripts and tools using Python, Bash, or Go to streamline operations, reduce manual tasks, and improve system reliability
AI/ML Infrastructure: Build and maintain specialized infrastructure for AI model training, fine-tuning, and deployment, including GPU cluster management and ML pipeline optimization
Performance Optimization: Analyze and optimize system performance, implement caching strategies, and ensure efficient resource utilization across all environments
Incident Response: Lead incident response efforts, perform root cause analysis, and implement preventive measures to minimize downtime and service disruptions
Collaboration: Work closely with software engineers, ML engineers, and security teams to ensure seamless integration of DevOps practices throughout the development lifecycle
Documentation: Create and maintain comprehensive documentation for infrastructure, deployment processes, and operational procedures
Qualification
Required
Bachelor's or Master's degree in Computer Science, Engineering, or related field
3-6 years of experience in DevOps, Site Reliability Engineering, or Infrastructure Engineering roles. (2+ years may be sufficient with a Master's degree)
Strong expertise in cloud platforms (AWS, Azure, or GCP) with relevant certifications preferred
Proficiency in Infrastructure as Code tools (Terraform, CloudFormation, Ansible)
Extensive experience with containerization (Docker) and orchestration (Kubernetes)
Strong scripting skills in Python, Bash, or Go
Experience with CI/CD tools and GitOps practices
Solid understanding of networking, security, and Linux system administration
Experience with monitoring and observability tools (Prometheus, Grafana, ELK, Datadog)
Strong problem-solving skills and ability to work in a fast-paced startup environment
Excellent communication skills and ability to work effectively with cross-functional teams
Preferred
Experience with AI/ML infrastructure and MLOps practices
Knowledge of service mesh technologies (Istio, Linkerd)
Experience with serverless architectures and event-driven systems
Familiarity with database administration (both SQL and NoSQL)
Experience with message queuing systems (Kafka, RabbitMQ, SQS)
Understanding of FinOps practices and cloud cost optimization
Experience with compliance frameworks (SOC2, HIPAA, GDPR)
Knowledge of chaos engineering and resilience testing practices
Contributions to open-source DevOps tools or infrastructure projects is a plus
Company
Straiker
Straiker is a software company that develops cybersecurity and AI applications via testing to identify threats and strengthen protection.
H1B Sponsorship
Straiker has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
Funding
Current Stage
Early StageTotal Funding
$21M2025-03-27Series A· $21M
Recent News
2025-10-31
Company data provided by crunchbase