Replicate · 2 hours ago
Infrastructure Engineer
Maximize your interview chances
Artificial Intelligence (AI)Cloud Infrastructure
H1B Sponsor Likely
Insider Connection @Replicate
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Designing and building our deployment and model-serving platform.
Building technology to operate the latest advancements in the ML and AI space.
Designing systems to maximize the utilization and reliability of our Kubernetes clusters and GPUs, including multi-regional traffic shifting and failover capabilities.
Owning and optimizing fair and reliable task allocation and queuing across a diverse set of customers with heterogeneous workloads.
Working with our Models team to speed up model inference through techniques like caching, weights management, machine configurations, and runtime optimizations in Python and PyTorch.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Experience building platforms at scale.
Worked in complex systems with many moving parts; you have opinions on monoliths vs. services.
Designed and implemented developer-friendly APIs to enable scalable and reliable integration.
Hands-on experience setting up and operating Kubernetes.
A passion for building tools that empower developers.
Strong communication and collaboration skills, with the ability to understand customer needs and distill complex topics into clear, actionable insights.
At least 3 years of full time software engineering experience.
Preferred
You have worked on machine learning platform teams in the past.
You have experience working with or on teams that have put ML/AI into production, even though this role does not entail building ML models directly.
You have some exposure to serving Generative AI features where GPUs are costly commodities and workloads can take significant time to finish.
Company
Replicate
Replicate lets software developers run open-source AI with an API.
H1B Sponsorship
Replicate has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (2)
Funding
Current Stage
Growth StageTotal Funding
$57.8MKey Investors
Andreessen HorowitzSequoia Capital
2023-12-05Series B· $40M
2023-02-21Series A· $12.5M
2023-02-21Seed· $5.3M
Recent News
2024-10-25
Company data provided by crunchbase