Hugging Face · 2 months ago
Community ML Research Engineer, non-AI scientific fields - US Remote
Hugging Face is on a mission to democratize good AI, building a platform for AI builders with millions of users and organizations. The Community ML Research Engineer will bridge machine learning and scientific research, collaborating with researchers to develop ML tools and optimize data pipelines for scientific applications.
Artificial Intelligence (AI)Generative AIMachine LearningNatural Language ProcessingOpen SourceSoftware
Responsibilities
Building and optimizing datasets and data pipelines for scientific use cases, with a focus on fast, scalable reads across distributed filesystems (e.g., HPC, cloud, or hybrid environments)
Developing and adapting ML tools (not just models) to address real-world scientific challenges, from data preprocessing to model deployment
Collaborating with non-AI scientific communities to co-design solutions, publish datasets, and create open-source resources that lower the barrier to ML adoption in traditional sciences
Engaging with researchers and institutions to identify high-impact opportunities, whether through hands-on technical work or strategic partnerships
Qualification
Required
Built or optimized datasets, data pipelines, or tools for scientific applications, especially in distributed or high-performance computing environments
Worked with fast-reads, distributed storage, or large-scale data processing —bonus if you've tackled challenges like cross-filesystem data access or real-time scientific data workflows
Collaborated with non-AI research communities (e.g., biology, physics, chemistry) to translate their needs into technical solutions, whether through code, documentation, or open-source contributions
Experimented with diverse ML approaches (not just large models) to solve domain-specific problems, and enjoy iterating based on feedback from end-users
Are a technical generalist who loves both the 'weeds' (e.g., optimizing a dataset pipeline) and the 'big picture' (e.g., shaping a collaboration's long-term impact)
Thrive in fast-paced, ambiguous environments and can pivot between technical deep dives and cross-team communication in Hugging Face's decentralized culture
Believe the best solutions often come from iterative experimentation —whether it's testing a new data format, prototyping a tool, or refining a community workshop
Benefits
Reimbursement for relevant conferences, training, and education
Flexible working hours and remote options
Health, dental, and vision benefits for employees and their dependents
Flexible parental leave and paid time off
Relocation packages if necessary
Company equity as part of their compensation package
Company
Hugging Face
Hugging Face allows users to build, train, and deploy art models using the reference open source in machine learning.
H1B Sponsorship
Hugging Face has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (3)
2024 (5)
2023 (2)
2020 (2)
Funding
Current Stage
Late StageTotal Funding
$395.2MKey Investors
Salesforce VenturesLux CapitalAddition
2024-08-01Series Unknown
2024-01-16Series D
2023-08-23Series D· $235M
Recent News
The French Tech Journal
2025-12-25
The French Tech Journal
2025-12-18
MIT Technology Review
2025-12-15
Company data provided by crunchbase