Proximity Works · 4 months ago
Senior Data Scientist - LLMs, RAG & Multimodal AI (Remote | Immediate joiner)
Proximity Works is one of the world’s most ambitious AI technology companies, shaping the future of Sports, Media, and Entertainment. They are seeking a Senior Data Scientist with expertise in large language models, retrieval-augmented generation, and multimodal learning to design and optimize intelligent search systems, collaborating closely with engineering and product teams.
Responsibilities
Design, fine-tune, and optimize LLMs for applied multimodal generation use cases
Build and productionize RAG pipelines that combine embedding-based search, metadata filtering, and LLM-driven re-ranking/summarization
Apply prompt engineering, RAG techniques, and model distillation to improve grounding, reduce hallucinations, and ensure output reliability
Define and implement evaluation metrics across semantic search (nDCG, Recall@K, MRR) and generation quality (grounding accuracy, hallucination rate)
Optimize inference pipelines for latency-sensitive use cases with strategies like token budgeting, prompt compression, and sub-100ms response targets
Train and adapt models via transfer learning, LoRA/QLoRA, and checkpoint reloading, ensuring robust deployment in production environments
Collaborate with product and research teams to explore innovative multimodal integrations for user-facing applications
Qualification
Required
Deep expertise in large language models (LLMs)
Experience with retrieval-augmented generation (RAG)
Knowledge of multimodal learning
Hands-on experience in designing, fine-tuning, and optimizing large-scale language and multimodal models
Ability to productionize retrieval-augmented pipelines
Experience in developing ranking and relevance techniques
Capability to define robust evaluation frameworks
Experience in applying prompt engineering, RAG techniques, and model distillation
Knowledge of evaluation metrics across semantic search (nDCG, Recall@K, MRR) and generation quality (grounding accuracy, hallucination rate)
Experience in optimizing inference pipelines for latency-sensitive use cases
Ability to train and adapt models via transfer learning, LoRA/QLoRA, and checkpoint reloading
Experience in collaborating with product and research teams
Company
Proximity Works
We are Proximity — a global team of coders, designers, product managers, geeks and experts.
Funding
Current Stage
Growth StageLeadership Team
Recent News
2025-10-07
Company data provided by crunchbase