Apply on Employer Site

inference.net · 2 weeks ago

Applied Machine Learning Engineer

San Francisco

Full-time

Hybrid

Entry, Mid Level

$220K/yr - $320K/yr

2+ years exp

Inference is a company that trains and hosts specialized language models for businesses seeking high-quality AI solutions. They are looking for an Applied Machine Learning Engineer to build and improve core ML systems for their custom model training platform, ensuring model quality at scale and leading projects from data intake to trained model.

Artificial Intelligence (AI)Machine LearningSoftware

H1B Sponsor Likely

Responsibilities

Lead projects from from data intake through the full training pipeline, including processing, cleaning, and preparing datasets for model training

Build and maintain data processing pipelines for aggregating, transforming, and validating training data

Create dashboards and visualization tools to display training metrics, data quality, and model performance

Train models using our internal frameworks and iterate based on evaluation results

Develop robust benchmarks and evaluation frameworks that ensure custom models match or exceed frontier performance

Build systems to automate portions of the training workflow, reducing manual intervention and improving consistency

Take research features and ship them into production settings

Apply the latest techniques in SFT, RL, and model optimization to improve training quality and efficiency

Collaborate with infrastructure engineers to scale training across our GPU fleet

Deeply understand customer use cases to inform training strategies and surface edge cases

Qualification

PyTorchSFTRLTransformer architecturesNVIDIA GPUsETL pipelinesBenchmarksEvaluationsData visualization toolsModel distillationMultimodal modelsDistributed trainingOpen-source contributions

Required

2+ years of experience training AI models using PyTorch

Hands-on experience with post-training LLMs using SFT or RL

Strong understanding of transformer architectures and how they're trained

Experience with LLM-specific training frameworks (e.g., Hugging Face Transformers, DeepSpeed, Axolotl, or similar)

Experience training on NVIDIA GPUs

Strong data processing skills and comfortable building ETL pipelines and working with large datasets

Track record of creating benchmarks and evaluations

Ability to take research techniques and apply them to production systems

Preferred

Experience with model distillation or knowledge transfer

Experience building dashboards and data visualization tools

Familiarity with vision encoders and multimodal models

Experience with distributed training at scale

Contributions to open-source ML projects

Benefits

Equity in a high-growth startup

Comprehensive benefits

Company

inference.net

Inference.net helps teams ship AI that’s faster, smarter, and dramatically more cost-efficient.

Founded in 2023

Bozeman, Montana, USA

11-50 employees

https://usecontext.io/

H1B Sponsorship

inference.net has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (1)

2023 (1)

2022 (1)

2021 (1)

Funding

Current Stage

Early Stage

Total Funding

unknown

2023-05-03Pre Seed

Company data provided by crunchbase