Alldus · 1 week ago
Staff Software Engineer, Distributed Systems (Backend Core)
Alldus is a fast-growing AI company developing advanced autonomous systems for complex domains like healthcare, legal, and finance. They are seeking a Staff Software Engineer to design and implement core distributed systems that support reliable computation at scale.
Responsibilities
Core distributed services handling concurrency, coordination, and state management for their agent runtime
Custom messaging, replication, and scheduling mechanisms ensuring consistency and fault tolerance across nodes
Low-latency data and metadata stores optimized for high-throughput transactional workloads
Concurrency and synchronization primitives that make distributed execution predictable and safe
Observability and recovery mechanisms that provide deterministic replay and forensic auditing of AI decisions
Systems for high-availability deployment, cluster membership, and leader election without external dependencies
Qualification
Required
5+ years of experience building (not just using) distributed systems, databases, or runtime infrastructure
Deep understanding of concurrency, consensus, replication, and durability — and ability to implement them in code
Strong background in C++, Rust, or Go with emphasis on memory management, performance tuning, and correctness
Experience designing internal systems like queues, KV stores, schedulers, caching layers, or distributed file systems
Comfort working close to the metal (threads, sockets, async I/O, persistence layers)
Proven ability to reason about consistency models, CAP tradeoffs, and system invariants
Familiarity with observability and debugging of distributed systems in production
Curiosity for elegant, minimal designs and an instinct for measuring before optimizing
Preferred
Research or open-source contributions in distributed systems (e.g., databases, OS kernels, or storage engines)
Experience with Raft, Paxos, gRPC internals, or custom RPC frameworks
Prior work on systems like Kafka, TiKV, CockroachDB, etc
Exposure to regulated or safety-critical environments (finance, healthcare, aerospace)