Apply on Employer Site

Cerebras · 1 week ago

Deployment Engineer, AI Inference

Sunnyvale CA or Toronto Canada

Full-time

Onsite

Entry, Mid Level

2+ years exp

Cerebras Systems builds the world's largest AI chip, providing leading training and inference speeds for machine learning applications. The Deployment Engineer will be responsible for building and operating inference clusters and ensuring the reliable deployment of AI workloads across a global infrastructure.

AI InfrastructureArtificial Intelligence (AI)ComputerHardwareRISCSemiconductorSoftware

Growth Opportunities

Responsibilities

Deploy AI inference replicas and cluster software across multiple datacenters

Operate across heterogeneous datacenter environments undergoing rapid 10x growth

Maximize capacity allocation and optimize replica placement using constraint-solver algorithms

Operate bare-metal inference infrastructure while supporting transition to K8S-based platform

Develop and extend telemetry, observability and alerting solutions to ensure deployment reliability at scale

Develop and extend a fully automated deployment pipeline to support fast software updates and capacity reallocation at scale

Translate technical and customer needs into actionable requirements for the Dev Infra, Cluster, Platform and Core teams

Stay up to date with the latest advancements in AI compute infrastructure and related technologies

Qualification

PythonLinux systemsDockerKubernetesTelemetryAWS infrastructureNetworking architectureObservabilityOwnership mindsetFast-paced environment

Required

2-5 years of experience in operating on-prem compute infrastructure (ideally in Machine Learning or High-Performance Compute) or developing and managing complex AWS plane infrastructure for hybrid deployments

Strong proficiency in Python for automation, orchestration, and deployment tooling

Solid understanding of Linux-based systems and command-line tools

Extensive knowledge of Docker containers and container orchestration platforms like K8S

Familiarity with spine-leaf (Clos) networking architecture

Proficiency with telemetry and observability stacks such as Prometheus, InfluxDB and Grafana

Strong ownership mindset and accountability for complex deployments

Ability to work effectively in a fast-paced environment

Company

Cerebras

Cerebras Systems is the world's fastest AI inference. We are powering the future of generative AI.

Founded in 2015

Sunnyvale, California, USA

501-1000 employees

https://cerebras.ai

Funding

Current Stage

Late Stage

Total Funding

$2.82B

Key Investors

Tiger Global ManagementAtreides Management,FidelityAlpha Wave Ventures

2026-02-04Series H· $1B

2025-12-03Secondary Market

2025-09-30Series G· $1.1B

Leadership Team

Andrew Feldman

Founder and CEO

Bob Komin

Chief Financial Officer

Recent News

Investing.com

Vista Equity Partners and Intel to lead investment in AI chip startup SambaNova, sources say

2026-02-07

Techmeme

Source: Benchmark raised $225M in special funds to invest in Cerebras' ~$1B Series H, led by Tiger Global; Benchmark led Cerebras' $27M Series A in 2016 (Marina Temkin/TechCrunch)

2026-02-07

Pulse 2.0

Cerebras: AI Chipmaker Raises $1 Billion Series H At About $23 Billion Valuation

2026-02-06

Company data provided by crunchbase