WorkGenius Group · 11 hours ago
Senior Engineer – Model Evaluation & Generative AI
WorkGenius Group is in the Generative AI and Applied ML industry, and they are seeking a Senior Engineer to focus on model evaluation and generative AI. The role involves designing evaluation pipelines, benchmarking models, performing error analysis, and improving model fairness and robustness.
Computer Software
Responsibilities
Design, implement, and maintain evaluation pipelines for large generative AI models
Benchmark and compare public and proprietary models, analyzing trade-offs and performance characteristics
Perform deep error analysis to identify model failure patterns and generate actionable insights
Develop methods to detect and mitigate bias, improving fairness and equity in model outputs
Conduct robustness and adversarial testing to assess resilience to noise, edge cases, and real-world variation
Qualification
Required
Bachelor's or Master's degree in Computer Science, Machine Learning, or related field
12+ years of software or ML engineering experience
Strong proficiency in Python and deep learning frameworks (e.g., PyTorch)
Deep understanding of ML evaluation methodologies and metrics (BLEU, ROUGE, F1, perplexity, etc.)
Proven ability to design rigorous experiments, analyze results, and draw statistically sound conclusions
Company
WorkGenius Group
WorkGenius Group is the global talent solution for today's fluid labour market.
Funding
Current Stage
Growth StageCompany data provided by crunchbase