Senior Engineer – Model Evaluation & Generative AI jobs in United States
cer-icon
Apply on Employer Site
company-logo

WorkGenius Group · 11 hours ago

Senior Engineer – Model Evaluation & Generative AI

WorkGenius Group is in the Generative AI and Applied ML industry, and they are seeking a Senior Engineer to focus on model evaluation and generative AI. The role involves designing evaluation pipelines, benchmarking models, performing error analysis, and improving model fairness and robustness.

Computer Software

Responsibilities

Design, implement, and maintain evaluation pipelines for large generative AI models
Benchmark and compare public and proprietary models, analyzing trade-offs and performance characteristics
Perform deep error analysis to identify model failure patterns and generate actionable insights
Develop methods to detect and mitigate bias, improving fairness and equity in model outputs
Conduct robustness and adversarial testing to assess resilience to noise, edge cases, and real-world variation

Qualification

Model EvaluationGenerative AIMachine LearningPythonDeep LearningEvaluation MethodologiesExperiment DesignRobustness TestingBias MitigationAI Safety

Required

Bachelor's or Master's degree in Computer Science, Machine Learning, or related field
12+ years of software or ML engineering experience
Strong proficiency in Python and deep learning frameworks (e.g., PyTorch)
Deep understanding of ML evaluation methodologies and metrics (BLEU, ROUGE, F1, perplexity, etc.)
Proven ability to design rigorous experiments, analyze results, and draw statistically sound conclusions

Company

WorkGenius Group

twitter
company-logo
WorkGenius Group is the global talent solution for today's fluid labour market.

Funding

Current Stage
Growth Stage
Company data provided by crunchbase