SIGN IN
AI Research Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

BluePill AI · 1 day ago

AI Research Engineer

BluePill AI builds AI Consumers — digital twins of real audiences. They are seeking an AI Research Engineer who will design, experiment with, and optimize LLM-based systems to simulate human judgment and decision-making.
Artificial Intelligence (AI)MarketingDigital MarketingMarket Research

Responsibilities

Design, experiment with, and optimize LLM-based systems for simulating human judgment, preference, and decision-making
Go beyond 'prompting' — work with fine-tuning, embeddings, retrieval, memory, reasoning scaffolds, and evaluation frameworks
Design and run rigorous experiments to measure model improvements using sound experimental design and statistical analysis
Build and iterate on model evaluation pipelines to measure realism, consistency, bias, drift, calibration, and alignment with human data
Analyze LLM failure modes and edge cases, including issues related to uncertainty, truthfulness, and overconfidence, and design interventions to fix them
Translate research insights into production-ready systems used by real customers
Collaborate closely with product, behavioral science, and engineering to ship end-to-end features
Stay close to the frontier: experiment with new models, papers, and techniques — and decide what’s actually worth using

Qualification

LLMsNLPPythonExperimental designStatistical analysisNeural networksMachine learningFine-tuningPyTorchTensorFlowJAXCuriosityCollaboration

Required

Deep hands-on experience working with LLMs (OpenAI, Anthropic, open-source, or similar)
Strong intuition for how LLMs behave internally — not just how to use them
Experience building real products or systems with LLMs in production
Strong foundation in NLP, neural networks, and machine learning fundamentals
Proficiency with Python and modern ML tooling
Demonstrated proficiency in experimental design and statistical analysis for evaluating and improving models
Understanding of uncertainty estimation, calibration, and truthfulness in model outputs
Comfort moving between messy experiments and clean, scalable implementations

Preferred

Experience working in deep tech environments (hard problems, long feedback loops, non-obvious failure modes)
Experience training LLMs from scratch or at significant scale
Experience with fine-tuning, RLHF-style techniques, or large-scale evaluation systems
Familiarity with PyTorch, TensorFlow, JAX, or distributed training setups
Experience working with noisy, real-world human data
Bachelor's or Master's degree in Computer Science, AI, ML, or a related field preferred

Company

BluePill AI

twittertwitter
company-logo
BluePill builds AI consumers — digital twins of real audiences trained on real-world social, survey, and research data — that think, react, and engage like your customers.

Funding

Current Stage
Early Stage
Total Funding
$6M
Key Investors
Ubiquity Ventures
2025-11-17Seed· $6M
Company data provided by crunchbase