Member of Technical Staff - ML Research Engineer, Data jobs in United States
cer-icon
Apply on Employer Site
company-logo

Liquid AI · 1 week ago

Member of Technical Staff - ML Research Engineer, Data

Liquid AI is a company spun out of MIT CSAIL that builds AI systems for various industries. They are seeking a Member of Technical Staff to build and maintain data pipelines for their foundation models, focusing on data quality and independent problem-solving.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
check
H1B Sponsor Likelynote

Responsibilities

Build and maintain data processing, cleaning, filtering, and selection pipelines at scale
Create and maintain pipelines for pretraining, midtraining, SFT, and preference optimization datasets
Monitor and evaluate public datasets across text, vision, and audio domains
Create synthetic data generation and augmentation pipelines
Build crawlers to gather datasets from the web where public data is lacking
Run ablations to assess dataset quality and inform training decisions
Collaborate with pre-training, vision, and audio teams on modality-specific data needs

Qualification

Data pipeline developmentData quality managementPythonData curationML frameworks (PyTorch)Synthetic data generationStaying current with datasetsProblem-solvingCollaboration

Required

5+ years of relevant work experience with a B.S., 3+ years with an M.S., or 1+ year with a Ph.D
Expertise in data curation, cleaning, augmentation, and synthetic data generation
Experience with LLMs and ML frameworks (PyTorch)
Strong Python skills with emphasis on clean, scalable code
Builds production data pipelines: Our team processes web-scale dataset with trillions of tokens
Understands data quality: Filtering, deduplication, augmentation, bias detection, and synthetic generation
Stays current: Public datasets drop constantly. You watch HuggingFace, arXiv, and know when something matters
Solve problems independently to power the team

Preferred

Experience with VLMs, computer vision, or audio data pipelines
Distributed training familiarity (DeepSpeed, FSDP, Megatron-LM)
First-author publications in top ML conferences (NeurIPS, ICML, ICLR, CVPR)
Contributions to open-source projects

Benefits

We pay 100% of medical, dental, and vision premiums for employees and dependents
401(k) matching up to 4% of base pay
Unlimited PTO plus company-wide Refill Days throughout the year

Company

Liquid AI

twittertwittertwitter
company-logo
Build efficient general-purpose AI at every scale.

H1B Sponsorship

Liquid AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)

Funding

Current Stage
Growth Stage
Total Funding
$293.1M
Key Investors
AMD VenturesOSS Capital L.P.
2024-12-13Series A· $250M
2023-12-01Seed· $37.5M
2023-05-05Seed· $5.6M

Leadership Team

leader-logo
Ramin Hasani
Co-founder and CEO
linkedin
leader-logo
Mathias Lechner
Co-founder and CTO
Company data provided by crunchbase