Applied AI Inference Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Baseten · 2 weeks ago

Applied AI Inference Engineer

Baseten powers mission-critical inference for leading AI companies and is seeking an Applied AI Inference Engineer. In this role, you will partner with customers to architect, build, and deploy high-scale production AI applications, driving impact throughout the customer journey from initial exploration to production deployment.

Artificial Intelligence (AI)Developer ToolsMachine LearningSoftwareSoftware Engineering
check
H1B Sponsor Likelynote

Responsibilities

Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects
Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion
Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs
Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution
Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity
Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates

Qualification

PythonAI/ML pipelinesSoftware developmentProject managementTechnical communicationUser empathyProblem framing

Required

Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field
1+ years of professional work experience in a fast-paced, high-growth environment
Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python
Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment
Strong communication skills, particularly on complex technical topics

Preferred

Experience in building or optimizing AI/ML projects is highly valued

Benefits

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Company

Baseten

twittertwittertwitter
company-logo
Baseten is an AI infrastructure company that integrates machine learning into business operations, production, and processes.

H1B Sponsorship

Baseten has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (6)
2024 (8)
2023 (1)
2020 (1)

Funding

Current Stage
Late Stage
Total Funding
$285M
Key Investors
BondGreylock
2025-09-05Series D· $150M
2025-02-19Series C· $75M
2024-03-04Series B· $40M

Leadership Team

leader-logo
Aaron Relph
Design
linkedin
Company data provided by crunchbase