Datum · 12 hours ago
Senior Engineer - Orchestration
Datum is on a mission to help 1k clouds thrive in the AI era by unlocking internet superpowers for every builder. They are seeking a Senior Engineer to build and run critical components of the Datum Cloud control plane, focusing on designing and building features for their open source business operating system, Milo, while working extensively with distributed systems and cloud-native infrastructure.
Financial ServicesInformation ServicesInformation TechnologySoftware
Responsibilities
Design, implement, and run Datum's core orchestration stack
Build customer-facing solutions to help our alt-cloud ecosystem thrive
Scale the management, monitoring, and metering of our edge locations
Partner with leadership to advance projects with key customers, partners, and suppliers
Design distributed solutions that scale from startup to hyperscale usage patterns
Implement intelligent traffic routing, load balancing, and failover
Build observability, monitoring, and diagnostic tools for complex environments
Optimize control plane performance for AI workloads and high-bandwidth applications with our network team
Drive technical networking decisions in collaboration with our open-source community
Review and mentor contributions from external developers on networking components
Maintain high code quality standards and documentation for network APIs
Represent Datum at conferences and in technical working groups
Design networking solutions that integrate seamlessly with Kubernetes and AI patterns
Build network policies and security frameworks for multi-tenant cloud environments
Implement service mesh integration and east-west traffic optimization
Ensure compatibility with major cloud provider networking services (AWS, GCP, Azure)
Qualification
Required
6+ years of large-scale production systems running Kubernetes with security as a first principle
Strong experience with Kubernetes patterns and APIs, having written custom resources, controllers, and preferably exposure to kubebuilder
Strong experience with distributed systems design, security, auth, consensus algorithms, async reconciliation, and fault tolerance
Experience modeling data in Kubernetes, or transferable knowledge from RDBMS, GraphQL, information retrieval
Extensive experience with multi-cloud networking and hybrid cloud connectivity
Deep knowledge of Kubernetes networking, CNI plugins, and service mesh architectures
Experience with infrastructure as code (Flux, Terraform, Pulumi) for provisioning
Understanding of edge computing, CDN architectures, and global traffic management
Track record of contributing to or maintaining networking-focused open-source projects
Experience mentoring engineers and driving technical decision-making in teams
Understanding of open-source governance, community building, and public development
Passion for building networking tools that other developers and operators love to use
Preferred
Familiarity with SRv6, eBPF, DPDK, VPP, mpTCP and other advanced networking technologies would be a huge plus
Languages: Go, Rust
Data: PostgreSQL, GraphQL, Elasticsearch, Meilisearch
Infrastructure: Kubernetes, Flux, Pulumi
Cloud Platforms: Cloudflare, AWS, GCP, Azure, multi-cloud networking
Monitoring: Prometheus, Grafana, OpenTelemetry, network flow analysis
Development: GitHub, CI/CD, automated testing, network simulation