Infrastructure Engineer @ Replicate | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Infrastructure Engineer jobs in United States
157 applicants
company-logo

Replicate · 4 hours ago

Infrastructure Engineer

ftfMaximize your interview chances
Artificial Intelligence (AI)Cloud Infrastructure
check
H1B Sponsor Likelynote

Insider Connection @Replicate

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Designing and building our deployment and model-serving platform.
Building technology to operate the latest advancements in the ML and AI space.
Designing systems to maximize the utilization and reliability of our Kubernetes clusters and GPUs, including multi-regional traffic shifting and failover capabilities.
Owning and optimizing fair and reliable task allocation and queuing across a diverse set of customers with heterogeneous workloads.
Working with our Models team to speed up model inference through techniques like caching, weights management, machine configurations, and runtime optimizations in Python and PyTorch.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

PythonKubernetesTerraformMachine LearningGoNode.jsAPI DesignRedisGoogle BigQueryPostgreSQLTask AllocationModel Inference Optimization

Required

Experience building platforms at scale.
Worked in complex systems with many moving parts; you have opinions on monoliths vs. services.
Designed and implemented developer-friendly APIs to enable scalable and reliable integration.
Hands-on experience setting up and operating Kubernetes.
A passion for building tools that empower developers.
Strong communication and collaboration skills, with the ability to understand customer needs and distill complex topics into clear, actionable insights.
At least 3 years of full time software engineering experience.

Preferred

You have worked on machine learning platform teams in the past.
You have experience working with or on teams that have put ML/AI into production, even though this role does not entail building ML models directly.
You have some exposure to serving Generative AI features where GPUs are costly commodities and workloads can take significant time to finish.

Company

Replicate

twittertwittertwitter
company-logo
Replicate lets software developers run open-source AI with an API.

H1B Sponsorship

Replicate has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (2)

Funding

Current Stage
Growth Stage
Total Funding
$57.8M
Key Investors
Andreessen HorowitzSequoia Capital
2023-12-05Series B· $40M
2023-02-21Series A· $12.5M
2023-02-21Seed· $5.3M

Leadership Team

leader-logo
Ben Firshman
Founder & CEO
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot