4MindsAI Inc. · 15 hours ago
DevOps Engineer
4MindsAI Inc. is an enterprise AI fine-tuning platform that transforms how organizations build and operate private, domain-specific AI. The DevOps Engineer will build and maintain the infrastructure for the AI platform, design deployment pipelines, ensure system reliability, and support engineering teams in optimizing AI workloads.
Computer Software
Responsibilities
Design, implement, and maintain CI/CD pipelines for automated building, testing, and deployment of AI platform components
Manage infrastructure-as-code across AWS, GCP, Azure, and on-premises environments using Terraform, Pulumi, or similar tools
Build and maintain Kubernetes clusters optimized for AI/ML workloads, including GPU scheduling and resource management
Implement monitoring, logging, and alerting systems to ensure platform reliability and rapid incident response
Develop and enforce security best practices, including secrets management, access controls, and compliance automation
Collaborate with engineering teams to containerize applications and optimize deployment workflows
Create and maintain documentation for infrastructure, deployment procedures, and runbooks
Automate operational tasks to reduce toil and improve team velocity
Support enterprise customer deployments, including on-premises installations with unique infrastructure requirements
Optimize infrastructure costs while maintaining performance and reliability standards
Qualification
Required
BS in Computer Science, Engineering, or related technical field
5+ years of experience in DevOps, SRE, or infrastructure engineering roles
Strong proficiency with cloud platforms (AWS, GCP, or Azure), including compute, networking, and security services
Hands-on experience with Kubernetes in production environments, including deployment, scaling, and troubleshooting
Expertise with infrastructure-as-code tools (Terraform, Pulumi, CloudFormation, or similar)
Experience building and maintaining CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
Strong scripting skills in Python, Bash, or Go for automation
Solid understanding of networking fundamentals, including DNS, load balancing, and firewalls
Experience with monitoring and observability tools (Prometheus, Grafana, Datadog, or similar)
Ability to work autonomously and drive technical decisions in a fast-paced environment
Clear technical communication with both technical and non-technical stakeholders
Deep ownership mindset: you care about outcomes, not job titles
Preferred
MS in Computer Science, Engineering, or related technical field
7+ years of experience in DevOps, SRE, or infrastructure engineering roles
Experience supporting AI/ML infrastructure, including GPU clusters and model serving
Background with on-premises or hybrid cloud deployments for enterprise customers
Experience with data pipeline infrastructure (Kafka, Airflow, or similar)
Familiarity with security compliance frameworks (SOC 2, HIPAA, FedRAMP)
Track record of establishing DevOps practices and culture on engineering teams
Experience with service mesh technologies (Istio, Linkerd)
Contributions to open-source infrastructure projects
Previous enterprise software or B2B SaaS experience
Benefits
Comprehensive medical, dental, and vision coverage (80% employer-paid)
401(k) plan with company match
Unlimited PTO policy with 15 days minimum
11 paid company holidays
Flexible Spending Account (FSA) and Health Savings Account (HSA) options.
Annual training and certification budget
Access to online learning platforms
Conference attendance opportunities
Regular internal technical workshops and knowledge sharing sessions
Company
4MindsAI Inc.
Funding
Current Stage
Early StageCompany data provided by crunchbase