Liquid AI · 1 week ago
Member of Technical Staff - ML Research Engineer, Data
Liquid AI is a company spun out of MIT CSAIL that builds AI systems for various industries. They are seeking a Member of Technical Staff to build and maintain data pipelines for their foundation models, focusing on data quality and independent problem-solving.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Build and maintain data processing, cleaning, filtering, and selection pipelines at scale
Create and maintain pipelines for pretraining, midtraining, SFT, and preference optimization datasets
Monitor and evaluate public datasets across text, vision, and audio domains
Create synthetic data generation and augmentation pipelines
Build crawlers to gather datasets from the web where public data is lacking
Run ablations to assess dataset quality and inform training decisions
Collaborate with pre-training, vision, and audio teams on modality-specific data needs
Qualification
Required
5+ years of relevant work experience with a B.S., 3+ years with an M.S., or 1+ year with a Ph.D
Expertise in data curation, cleaning, augmentation, and synthetic data generation
Experience with LLMs and ML frameworks (PyTorch)
Strong Python skills with emphasis on clean, scalable code
Builds production data pipelines: Our team processes web-scale dataset with trillions of tokens
Understands data quality: Filtering, deduplication, augmentation, bias detection, and synthetic generation
Stays current: Public datasets drop constantly. You watch HuggingFace, arXiv, and know when something matters
Solve problems independently to power the team
Preferred
Experience with VLMs, computer vision, or audio data pipelines
Distributed training familiarity (DeepSpeed, FSDP, Megatron-LM)
First-author publications in top ML conferences (NeurIPS, ICML, ICLR, CVPR)
Contributions to open-source projects
Benefits
We pay 100% of medical, dental, and vision premiums for employees and dependents
401(k) matching up to 4% of base pay
Unlimited PTO plus company-wide Refill Days throughout the year
Company
Liquid AI
Build efficient general-purpose AI at every scale.
H1B Sponsorship
Liquid AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
Funding
Current Stage
Growth StageTotal Funding
$293.1MKey Investors
AMD VenturesOSS Capital L.P.
2024-12-13Series A· $250M
2023-12-01Seed· $37.5M
2023-05-05Seed· $5.6M
Recent News
2025-12-06
Digital Commerce 360
2025-11-15
Company data provided by crunchbase