LLM-GenAI Model Evaluator jobs in United States
cer-icon
Apply on Employer Site
company-logo

Pyramid Consulting, Inc · 9 hours ago

LLM-GenAI Model Evaluator

Pyramid Consulting, Inc is a leading IT Industry, and they are seeking a LLM-GenAI Model Evaluator. This role involves evaluating generative AI models and requires a strong understanding of LLMs and AI/ML frameworks.

ConsultingInformation TechnologyLegalProfessional ServicesSoftwareStaffing Agency
check
Growth Opportunities
badNo H1BnoteU.S. Citizen Onlynote

Qualification

Artificial IntelligenceMachine LearningPythonLLM understandingAI/ML frameworksData analysisModel evaluation frameworksPrompt engineeringBias detectionAdversarial testingEvaluation toolsPrompt testing tools

Required

Key Skills: - Artificial Intelligence, Machine Learning, AI/ML frameworks (PyTorch, TensorFlow, HuggingFace, LangChain)
Looking for GC and US Citizens
Strong understanding of LLMs, generative AI, and transformer-based architectures
Experience with Python, data analysis, and model evaluation frameworks
Familiarity with prompt engineering, embeddings, RLHF/RLAIF, and LLM-based scoring methods
Experience building evaluation datasets and working with annotation platforms
Understanding of safety alignment, bias detection, and adversarial testing
ML/AI frameworks: PyTorch, TensorFlow, HuggingFace, LangChain
Evaluation/annotation tools: Scale AI, GroundTruth, Labelbox, Prodigy
Prompt testing tools: Weights & Biases, MLflow, OpenAI evals, LLM-as-a-judge pipelines

Benefits

Health insurance (medical, dental, vision)
401(k) plan
Paid sick leave (depending on work location)

Company

Pyramid Consulting, Inc

company-logo
Pyramid Consulting, a global leader in workforce and technology solutions, empowers individuals and organizations to transform and thrive in the most challenging and competitive markets.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Ramesh Maturu
President and Co-Founder
linkedin
leader-logo
Manish Kaushik
Chief Financial Officer
linkedin
Company data provided by crunchbase