Fluidstack · 2 days ago
Software Engineer, Infrastructure Platform
Fluidstack is building the infrastructure for abundant intelligence, partnering with top AI labs and enterprises to deliver world-class infrastructure services. The Software Engineer, Infrastructure Platform will develop foundational platforms for global infrastructure and data center operations, focusing on internal tooling and automation to streamline deployment and management processes.
Cloud ComputingCloud StorageGenerative AIGPUInformation TechnologyMachine LearningPrivate CloudSoftware
Responsibilities
Design and build our next-generation CMDB system as the authoritative source of truth for infrastructure assets, network topology, and configuration data
Create DCIM platforms for rack operations, server/GPU deployment, OS installation, quality assurance, and white-screen operations
Develop end-to-end asset lifecycle management systems covering receiving, racking, inventory, break-fix, and decommissioning workflows
Build monitoring and observability platforms integrating telemetry from BMS, EPMS, and IT devices with intelligent alarming and incident management
Create self-service portals and automation for new region bootstrap, day-2 operations, and fleet-scale management
Eliminate manual toil through workflow automation and self-service tooling that empower operations and engineering teams
Build workflow orchestration systems for complex multi-step processes spanning incident, problem, and change management
Develop digital twin visualizations and operational dashboards surfacing actionable insights; partner with data teams on analytics
Create integration layers connecting internal platforms with external vendors and third-party systems
Collaborate with data center operations, system engineering, network engineering, and security teams to understand requirements and deliver high-impact solutions
Work with product and business stakeholders to prioritize features, define roadmaps, and balance competing needs
Align with support and operations teams to ensure platforms scale with organizational growth
Evaluate build vs. buy decisions for platform components, weighing in-house development against commercial SaaS and open-source solutions for scalability, cost, and flexibility
Champion modern development practices including CI/CD, infrastructure-as-code, automated testing, and observability-first design
Participate in architecture reviews and design discussions, contributing to technical direction and standards
Foster technical excellence through code reviews, documentation, and knowledge sharing
Design high-performance, fault-tolerant systems capable of handling thousands of QPS as our infrastructure footprint expands
Build comprehensive monitoring, logging, and debugging capabilities with robust error handling
Implement data migration strategies and manage upstream/downstream dependencies carefully during platform evolution
Own projects end-to-end from concept through deployment, ensuring production readiness and operational excellence
Qualification
Required
3+ years of professional software development experience building production systems
Strong programming skills in Python, Go, or similar languages with understanding of system design patterns
Experience designing and implementing RESTful APIs, data models, and distributed systems
Proficiency with relational and NoSQL databases (PostgreSQL, Redis, etc.)
Hands-on experience with containerization (Docker) and infrastructure-as-code tools (Terraform, Ansible)
Understanding of CI/CD pipelines and modern development workflows
Solid grasp of networking fundamentals (TCP/IP, DNS, HTTP) and Linux/Unix environments
Strong problem-solving abilities with attention to scalability, reliability, and operational concerns
Excellent communication skills—able to convey technical concepts to both technical and non-technical stakeholders
Experience with CMDB systems (NetBox, Device42) or asset management platforms
Background in infrastructure automation, DevOps, or platform engineering
Familiarity with workflow orchestration frameworks (Temporal, Airflow, Camunda)
Knowledge of monitoring and observability stacks (Prometheus, Grafana, OpenTelemetry)
Experience with time-series databases and data visualization
Understanding of ITSM frameworks (ITIL) and service management practices
Experience in data center operations, facilities management, or physical infrastructure
Contributions to open-source infrastructure projects
Bachelor's degree in Computer Science or equivalent practical experience
Benefits
Retirement or pension plan, in line with local norms.
Health, dental, and vision insurance.
Generous PTO policy, in line with local norms.
Company
Fluidstack
FluidStack is an AI cloud platform for frontier labs and startups.
H1B Sponsorship
Fluidstack has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)
Funding
Current Stage
Growth StageTotal Funding
$450MKey Investors
Seedcamp
2026-01-22Undisclosed· $450M
2025-06-01Undisclosed
2024-10-01Private Equity
Recent News
2026-02-03
2026-01-23
2026-01-20
Company data provided by crunchbase