Liquid AI · 2 hours ago
Member of Technical Staff - ML Research Engineer, Data
Liquid AI is a company that builds general-purpose AI systems efficiently across various deployment targets. They are seeking a Senior ML Research Engineer to develop and maintain data pipelines for training foundation models, ensuring high-quality datasets for various modalities.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Build and maintain data processing, cleaning, filtering, and selection pipelines at scale
Create and maintain pipelines for pretraining, midtraining, SFT, and preference optimization datasets
Monitor and evaluate public datasets across text, vision, and audio domains
Create synthetic data generation and augmentation pipelines
Build crawlers to gather datasets from the web where public data is lacking
Run ablations to assess dataset quality and inform training decisions
Collaborate with pre-training, vision, and audio teams on modality-specific data needs
Qualification
Required
5+ years of relevant work experience with a B.S., 3+ years with an M.S., or 1+ year with a Ph.D
Expertise in data curation, cleaning, augmentation, and synthetic data generation
Experience with LLMs and ML frameworks (PyTorch)
Strong Python skills with emphasis on clean, scalable code
Preferred
Experience with VLMs, computer vision, or audio data pipelines
Distributed training familiarity (DeepSpeed, FSDP, Megatron-LM)
First-author publications in top ML conferences (NeurIPS, ICML, ICLR, CVPR)
Contributions to open-source projects
Benefits
We pay 100% of medical, dental, and vision premiums for employees and dependents
401(k) matching up to 4% of base pay
Unlimited PTO plus company-wide Refill Days throughout the year
Company
Liquid AI
Build efficient general-purpose AI at every scale.
H1B Sponsorship
Liquid AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
Funding
Current Stage
Growth StageTotal Funding
$293.1MKey Investors
AMD VenturesOSS Capital L.P.
2024-12-13Series A· $250M
2023-12-01Seed· $37.5M
2023-05-05Seed· $5.6M
Recent News
2025-12-06
Digital Commerce 360
2025-11-15
Company data provided by crunchbase