Senior Software Engineer, AI Runtime jobs in United States
cer-icon
Apply on Employer Site
company-logo

Apollo GraphQL · 4 months ago

Senior Software Engineer, AI Runtime

Apollo GraphQL is seeking a Senior Software Engineer to enhance their AI Runtime capabilities. The role involves scaling the MCP Server and Gateway to support multi-agent workflows, ensuring reliability and performance while tackling challenges in scalability and developer experience.

Developer APIsDeveloper PlatformDeveloper ToolsEnterprise SoftwareOpen SourceSoftware
check
H1B Sponsor Likelynote

Responsibilities

Scale an enterprise AI/MCP Server and Gateway that powers multi-agent workflows across Apollo, including routing, orchestration, and integration boundaries
Implement robust server infrastructure to ensure reliability, performance, and security at scale
Build and maintain tools for agent discovery, communication, and coordination
Define deployment strategies and runtime optimizations to maximize efficiency and minimize operational overhead
Develop frameworks and patterns that enable seamless multi-agent collaboration and AI-driven orchestration
Integrate observability, logging, and monitoring for full visibility into server and agent behavior
Explore and implement AI-enhanced developer workflows to optimize orchestration and agent interactions
Collaborate with teams within our org to ensure the MCP Server meets evolving product and developer needs
Build and scale the MCP Gateway—Apollo’s routing layer for agentic workflows—ensuring tools and services can be discovered, invoked, and orchestrated reliably across diverse environments
Design and implement high-performance routing infrastructure with reliability, scalability, and security at its core
Build and maintain routing patterns and coordination mechanisms that let agents interact with the right tools at the right time
Define deployment strategies and runtime optimizations to minimize latency and operational overhead
Explore and implement AI-driven routing strategies to optimize context retrieval, reduce cost, and improve decision accuracy
Collaborate with teams across Apollo to ensure the MCP Server and Gateway integrates seamlessly with Apollo’s control plane for AI tools
Integrate observability and monitoring into the routing layer to provide full visibility into traffic flows, tool availability, and agent interactions

Qualification

Rust programmingDistributed systemsServer architectureAgent-to-tool orchestrationProtocol designRuntime infrastructureObservability frameworksClean codeTechnical leadershipCross-team collaboration

Required

Expertise in agent-to-tool orchestration, routing, and coordination in scalable, fault-tolerant systems
Deep expertise in Rust programming language
Strong background in distributed systems, server architecture, and high-performance backend development
Proven experience with protocol design, message routing, and server-side orchestration frameworks
Experience building and maintaining robust runtime infrastructure that supports AI-driven workflows and enables reliable agent-to-tool interactions
Proven experience with protocol design, message routing, and building server-side frameworks that enable scalable, reliable multi-tool agent workflows
Hands-on experience with observability, monitoring, and debugging frameworks for complex systems
Passion for clean, maintainable code, high system reliability, and scalable architecture
Experience in strategic system design, making architectural trade-offs, and planning for long-term scalability and maintainability
Strong technical leadership and mentorship, including guiding junior engineers and driving engineering best practices across teams
Ability to influence cross-team architecture decisions and align engineering efforts with product and business objectives
Production ownership experience: leading incident response, debugging, and performance optimization in high-impact backend systems

Preferred

Exposure to AI/ML-enabled developer tooling or autonomous system orchestration
Familiarity with cloud-native architectures, containerization, or orchestration frameworks
Experience with performance optimization and cost-efficient scaling of high-throughput distributed systems

Company

Apollo GraphQL

twittertwittertwitter
company-logo
Apollo GraphQL helps developers build better software with a declarative, graph-based approach to API orchestration.

H1B Sponsorship

Apollo GraphQL has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (3)
2024 (4)
2023 (2)
2022 (5)
2021 (1)

Funding

Current Stage
Late Stage
Total Funding
$152M
Key Investors
Insight Partners
2022-03-01Series Unknown
2021-08-17Series D· $130M
2019-06-12Series C· $22M

Leadership Team

leader-logo
Matt DeBergalis
CEO and Cofounder
linkedin
leader-logo
Nick Martin
Co-founder
linkedin
Company data provided by crunchbase