Computational Linguist with Gen AI experience jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sigma AI · 2 weeks ago

Computational Linguist with Gen AI experience

Sigma AI is a global training data collection, preparation and annotation services company. They are seeking a Senior Computational Linguist to collaborate with the Natural Language Processing team on the design and development of AI-based solutions, focusing on evaluating and supporting Generative and Agentic AI systems.

Artificial Intelligence (AI)Data Collection and LabelingGenerative AIInformation ServicesInformation TechnologyMachine LearningNatural Language Processing

Responsibilities

Collaborate with the Natural Language Processing team on the design and development of AI-based solutions for clients and grant-funded projects
Evaluate and support Generative and Agentic AI systems
Design annotation workflows, create and refine guidelines and internal documentation, prototype task-specific evaluation metrics, configure annotation tools, and analyze annotator, model and system performance using real-world data
Contribute to papers and articles as needed
Drive complex projects from concept to delivery
Work cross-functionally with researchers and annotators to design innovative, rigorous, and scalable evaluation processes for LLM-powered workflows

Qualification

Computational LinguisticsNatural Language ProcessingGenerative AIPython programmingTransformer-based modelsNLP librariesLinux environmentsModel evaluation methodologiesFluent in EnglishWritten communicationCross-functional teamwork

Required

Master's degree (or equivalent experience) in Computational Linguistics, NLP, Linguistics, or a related field
2+ years of experience in NLP or AI projects (industry or research)
At least one year of experience with Gen AI and/or Agentic AI
Experience using and fine-tuning transformer-based language models (e.g., BERT, GPT)
Proficiency in Python programming
Proficient with NLP and data science libraries: pandas, numpy, scikit-learn, NLTK
Experience with generative AI SDKs and frameworks (e.g., OpenAI, Google, Anthropic, LangChain)
Comfortable with Linux environments and Bash scripting
Experience working with public datasets (e.g. Hugging Face, Kaggle)
Familiarity with LLM behavior, prompt-based evaluation, and generative model outputs
Comfortable with structured data formats (JSONL, CSV), Jupyter notebooks, and pandas-based analysis
Experience using Git for version control and collaborative development
Understanding of model evaluation methodologies, including human-AI comparison and red teaming
Strong written communication skills for documenting experiments and results
Experience working in cross-functional or research-oriented teams
Fluent in English

Preferred

Strong understanding of current trends and techniques in generative AI
Experience with annotation tools (e.g., Label Studio, Prodigy) and quality metrics for human data
Experience designing annotation tasks and workflows (e.g., Label Studio or similar tools)
Experience creating and curating bespoke datasets
Familiarity with evaluation challenges in creative or subjective NLP tasks
Understanding of linguistic typology, multilingual NLP, or sociolinguistic variation
Experience working in WSL environments
Experience collaborating with annotation teams and working with QA processes

Company

Sigma AI

twittertwitter
company-logo
Sigma AI is a data labeling, annotation and data collection company.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Daniel Tapias
Co-Founder & CEO
linkedin
leader-logo
Nuria Gomez Bermejo
Founder & CFO
linkedin
Company data provided by crunchbase