Sully.ai · 1 day ago
Applied Research Scientist
Sully.ai is on a mission to revolutionize healthcare by building AI teammates that support clinicians. They are seeking an Applied Research Scientist to develop and scale automated evaluation pipelines for their AI models, focusing on improving clinical outcomes and operational efficiency.
Artificial Intelligence (AI)Health CareHospitalMachine LearningSoftware
Responsibilities
Build and scale automated evaluation pipelines (LLM-as-judge + human review) with clinical-grade benchmarks
Audit existing evaluation approaches for clinical and agentic tasks
Define initial benchmarks and build early automated pipelines
Partner with engineering to land first set of CI gates for accuracy, factuality, and safety
Deliver a repeatable evaluation framework with automated pipelines in production
Demonstrate measurable improvements in robustness, hallucination reduction, or safety
Publish or present internal research findings that directly shape product reliability
Qualification
Required
Proven experience designing agentic processes and LLM evaluation/benchmarking frameworks
Strong Python and ML background (PyTorch/TensorFlow, Hugging Face, LangChain/LlamaIndex)
Demonstrated ability to design rigorous experiments and translate findings into production
Track record of published research or deep applied work in LLMs and agent evaluation
Strong communication and technical writing skills to articulate complex findings clearly
Company
Sully.ai
AutonomousOS for healthcare organizations
Funding
Current Stage
Growth StageTotal Funding
$29.96MKey Investors
Amity Ventures
2025-01-24Series A· $21.83M
2024-04-01Seed· $8.13M
Recent News
2026-01-13
Company data provided by crunchbase