AI Infrastructure Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

StackAI · 2 months ago

AI Infrastructure Engineer

StackAI is a fast-growing Series A startup focused on shaping the future of AI in enterprise workflows. They are seeking an AI Infrastructure Engineer to design and implement scalable backend architectures, manage infrastructure reliability, and partner with ML engineers to bring models to production at scale.

AppsArtificial Intelligence (AI)Developer PlatformSoftware
check
H1B Sponsor Likelynote

Responsibilities

Design and implement scalable backend architectures for AI workloads (inference, orchestration, monitoring)
Own distributed job orchestration with Temporal and related systems
Improve data pipeline performance by designing smarter caching strategies (e.g., file deduplication, hot/cold storage, Redis caching layers) to reduce redundant compute and API calls
Build observability, monitoring, retries, and fault tolerance into all workflows
Manage infrastructure reliability, incident response, and performance
Develop tooling and platform infrastructure to support rapid growth
Partner with ML engineers to bring models to production at scale

Qualification

Backend engineeringDistributed systemsConcurrencyParallelismTemporalPythonContainersOrchestrationCloud platformsStorage systemsSoft skills

Required

4+ years of backend engineering (Python is a must)
Strong background in distributed systems, job orchestration, and task queues
Deep knowledge of concurrency, parallelism, and multithreading—including async/await, event loops, thread pools, synchronization primitives, deadlocks, and race conditions—is a must
Hands-on experience with Temporal, Redis, Airflow, Celery, RabbitMQ (or similar)
Experience with LLM serving and routing fundamentals (rate limiting, streaming, load balancing, budgets)
Comfortable with containers & orchestration: Docker, Kubernetes
Familiarity with cloud platforms (AWS/GCP) and IaC (Terraform)
Experience with multiple storage systems: S3, Postgres, MongoDB, Redis, and Elasticsearch
Track record scaling systems in startups or fast-paced environments
Understanding of deploying, monitoring, and optimizing AI/ML systems in production with strong CI/CD practices

Company

StackAI

twittertwittertwitter
company-logo
Build and deploy Enterprise-Grade AI Agents.

H1B Sponsorship

StackAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (4)

Funding

Current Stage
Growth Stage
Total Funding
$19.08M
Key Investors
Y Combinator
2024-12-16Undisclosed· $16.08M
2023-04-05Pre Seed· $3M

Leadership Team

leader-logo
Antoni Rosinol
Co-Founder & CEO
linkedin
leader-logo
Bernardo Aceituno
Co-Founder & President
linkedin
Company data provided by crunchbase