Data/Infrastructure Advocate Engineer - US Remote jobs in United States
cer-icon
Apply on Employer Site
company-logo

Hugging Face · 5 hours ago

Data/Infrastructure Advocate Engineer - US Remote

Hugging Face is a company focused on democratizing AI, building a platform for AI builders with a vast user base. The Data/Infrastructure Advocate Engineer will bridge the gap between data infrastructure and the community, promoting best practices and tools for data workflows while collaborating with various teams to enhance user experience.

AI InfrastructureArtificial Intelligence (AI)Foundational AIGenerative AIMachine LearningNatural Language ProcessingOpen SourceSoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges. Engage with communities like Apache Parquet, Open Tables Formats, and data engineering forums to promote best practices and Hugging Face tools
Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet
Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub's value for data workflows
Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning
Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering
Produce high-quality tutorials, blog posts, and videos that make complex topics accessible
Share insights on storage optimization, dataset versioning, and deduplication to empower developers
Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration
Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases

Qualification

PythonData librariesStorage systemsParquetDataset versioningCommunity engagementOpen source advocacyTechnical writing

Required

Have strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3)
Are a hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning
Can clearly explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks
Are active in developer communities (GitHub, Discord, forums) and passionate about open source and knowledge sharing
Thrive in fast-moving environments and enjoy building in public to inspire others

Benefits

Health, dental, and vision benefits for employees and their dependents
Parental leave
Flexible paid time off
Reimbursement for relevant conferences, training, and education
Company equity as part of their compensation package

Company

Hugging Face

twittertwittertwitter
company-logo
Hugging Face allows users to build, train, and deploy art models using the reference open source in machine learning.

H1B Sponsorship

Hugging Face has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (3)
2024 (5)
2023 (2)
2020 (2)

Funding

Current Stage
Late Stage
Total Funding
$395.2M
Key Investors
Salesforce VenturesLux CapitalAddition
2024-08-01Series Unknown
2024-01-16Series D
2023-08-23Series D· $235M

Leadership Team

leader-logo
Clément Delangue
Co-founder & CEO
linkedin
leader-logo
Julien Chaumond
Co-founder
linkedin
Company data provided by crunchbase