Apply on Employer Site

techire ai · 19 hours ago

Applied Scientist, Post-training (LLM, VLM, MLLM)

San Francisco Bay Area

Full-time

Hybrid

Mid, Senior Level

$200K/yr - $320K/yr

Techire AI is a well-funded startup focused on developing domain-specific reasoning systems and agentic AI. The Applied Scientist role involves creating models that can reason and make verifiable decisions, with a focus on post-training large multimodal models and ensuring AI systems are aligned with human reasoning.

Staffing & Recruiting

Hiring Manager

Marc Powell

Responsibilities

Develop models that can reason, explain their logic, and make verifiable decisions across complex, high-stakes industries

Focus on post-training large multimodal models, applying the latest techniques in RLHF, DPO, and preference learning

Design frameworks that turn raw model potential into transparent, trustworthy intelligence

Develop and optimise post-training pipelines, implement reward modelling for reasoning depth and factual accuracy, and build evaluation frameworks for verifiable, human-aligned behaviour

Run end-to-end experiments and deploy methods directly into production

Qualification

Transformer-based model trainingPost-training alignmentPythonPyTorchReward modellingMultimodal reasoningEvaluation frameworksCuriosity about reasoning agents

Required

Background in transformer-based model training (LLM, VLM, MLLM)

Experience in post-training or alignment (RLHF, DPO, reward modelling)

Strong practical skills in Python and PyTorch

Curiosity about reasoning agents, hybrid learning, and interpretability research

Preferred

Experience in multimodal reasoning

Experience in evaluation and verification

Prior research contributions in alignment or reasoning systems

Benefits

Bonus

Stock

Benefits

Company

techire ai

Techire AI - Your Gen AI Hiring Partner.

London, GB

2-10 employees

http://www.techire.ai

Funding

Current Stage

Early Stage

Company data provided by crunchbase