Data Scientist/Machine Learning Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sumble · 4 months ago

Data Scientist/Machine Learning Engineer

Sumble is building a knowledge graph from web data with a focus on providing data for go-to-market teams. They are seeking a Data Scientist/Machine Learning Engineer to finetune language models, improve data quality, and push solutions into production environments.

AppsArtificial Intelligence (AI)Sales
check
H1B Sponsor Likelynote

Responsibilities

Finetuning small language models
Improving the quality of existing data using scalable approaches. Examples include: making sure URLs are associated the right company, we have the correct HQ address, we have mapped parents-subsidiary using techniques like LLM validation, SERP, and triangulating across sources
Adding new signals: this usually involves scrubbing, matching and normalizing new signals and matching to our existing ontology
Pushing solutions into production environments, which may involve touching data pipelines and/or backend systems

Qualification

Finetuning language modelsData normalizationData pipelinesLLM validationTeam collaboration

Required

Experience finetuning small language models
Ability to improve the quality of existing data using scalable approaches
Experience with scrubbing, matching, and normalizing new signals
Ability to push solutions into production environments
Familiarity with data pipelines and/or backend systems

Company

Sumble

twittertwitter
company-logo
Sumble provides a sales intelligence platform that crawls the web for technographic and contact data.

H1B Sponsorship

Sumble has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2023 (2)

Funding

Current Stage
Early Stage
Total Funding
$38.5M
Key Investors
Canaan PartnersCoatue
2025-10-22Series A· $30M
2025-01-01Seed· $8.5M
Company data provided by crunchbase