Member of Technical Staff – Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

CloudCruise · 14 hours ago

Member of Technical Staff – Infrastructure

CloudCruise is building a coding agent for enterprise computer automation, focusing on healthcare automation challenges. The role involves owning and optimizing distributed systems for running browser automations at scale, ensuring reliability and performance in critical healthcare processes.

Artificial Intelligence (AI)Data ManagementSaaS

Responsibilities

Dynamic EC2 provisioning with auto-scaling, multi-OS support (Linux/Windows), health monitoring, crash recovery, and priority-based dispatch across resource groups
Socket.io with Redis adapter for horizontally scalable WebSockets, custom distributed job queues with leader election and credential locking, pub/sub messaging for cross-instance communication
Evolve our single-leader dispatcher toward sharded or multi-leader architectures, implement dynamic worker provisioning based on queue depth, optimize connection pooling and caching layers
Deploy and optimize inference for vision-language models powering our agents – low latency, high throughput, cost-efficient GPU utilization
Expand our OpenTelemetry and Langfuse tracing into full metrics dashboards, alerting, and SLO tracking
Lambda functions for event processing, EC2/SSM for remote execution, S3 for artifact storage, IAM and security hardening

Qualification

Distributed systemsAWS infrastructureRedisEC2 provisioningObservabilitySoft skills

Required

You've built distributed systems that handle real scale – worker orchestration, job queues, leader election
You're fluent in Redis as more than a cache: pub/sub, distributed locks, state management
You've operated production AWS infrastructure (EC2, Lambda, SSM) and understand the cost/reliability tradeoffs
You care about observability – you've built dashboards, set up alerting, and debugged production issues with traces
You're the person who sees 'custom job queue' and immediately thinks about failure modes

Benefits

Meaningful equity

Company

CloudCruise

twittertwittertwitter
company-logo
Cloud Cruise provides AI enabled software to automate repetitive tasks like data prospecting and data extraction

Funding

Current Stage
Early Stage
Total Funding
$0.5M
Key Investors
Y Combinator
2024-04-03Pre Seed· $0.5M
Company data provided by crunchbase