Applied Machine Learning Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

inference.net · 2 weeks ago

Applied Machine Learning Engineer

Inference is a company that trains and hosts specialized language models for businesses seeking high-quality AI solutions. They are looking for an Applied Machine Learning Engineer to build and improve core ML systems for their custom model training platform, ensuring model quality at scale and leading projects from data intake to trained model.

Artificial Intelligence (AI)Machine LearningSoftware
check
H1B Sponsor Likelynote

Responsibilities

Lead projects from from data intake through the full training pipeline, including processing, cleaning, and preparing datasets for model training
Build and maintain data processing pipelines for aggregating, transforming, and validating training data
Create dashboards and visualization tools to display training metrics, data quality, and model performance
Train models using our internal frameworks and iterate based on evaluation results
Develop robust benchmarks and evaluation frameworks that ensure custom models match or exceed frontier performance
Build systems to automate portions of the training workflow, reducing manual intervention and improving consistency
Take research features and ship them into production settings
Apply the latest techniques in SFT, RL, and model optimization to improve training quality and efficiency
Collaborate with infrastructure engineers to scale training across our GPU fleet
Deeply understand customer use cases to inform training strategies and surface edge cases

Qualification

PyTorchSFTRLTransformer architecturesNVIDIA GPUsETL pipelinesBenchmarksEvaluationsData visualization toolsModel distillationMultimodal modelsDistributed trainingOpen-source contributions

Required

2+ years of experience training AI models using PyTorch
Hands-on experience with post-training LLMs using SFT or RL
Strong understanding of transformer architectures and how they're trained
Experience with LLM-specific training frameworks (e.g., Hugging Face Transformers, DeepSpeed, Axolotl, or similar)
Experience training on NVIDIA GPUs
Strong data processing skills and comfortable building ETL pipelines and working with large datasets
Track record of creating benchmarks and evaluations
Ability to take research techniques and apply them to production systems

Preferred

Experience with model distillation or knowledge transfer
Experience building dashboards and data visualization tools
Familiarity with vision encoders and multimodal models
Experience with distributed training at scale
Contributions to open-source ML projects

Benefits

Equity in a high-growth startup
Comprehensive benefits

Company

inference.net

twittertwittertwitter
company-logo
Inference.net helps teams ship AI that’s faster, smarter, and dramatically more cost-efficient.

H1B Sponsorship

inference.net has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2023 (1)
2022 (1)
2021 (1)

Funding

Current Stage
Early Stage
Total Funding
unknown
2023-05-03Pre Seed
Company data provided by crunchbase