Mindrift · 5 days ago
MCP & Tools Python Developer - Agent Evaluation Infrastructure
Mindrift is an innovative company that utilizes collective human intelligence to shape the future of AI. They are seeking a hands-on Python engineer to develop Model Context Protocol (MCP) servers and tools for evaluating agent behavior, focusing on ensuring compatibility with existing infrastructures and enhancing testing processes.
Computer Software
Responsibilities
Developing and maintaining MCP-compatible evaluation servers
Implementing logic to check agent actions against scenario definitions
Creating or extending tools that writers and QAs use to test agents
Working closely with infrastructure engineers to ensure compatibility
Occasionally helping with test writing or debug sessions when needed
Qualification
Required
4+ years of Python development experience, ideally in backend or tools
Solid experience building APIs, testing frameworks, or protocol-based interfaces
Understanding of Docker, Linux CLI, and HTTP-based communication
Ability to integrate new tools into existing infrastructures
Familiarity with how LLM agents are prompted, executed, and evaluated
Clear documentation and communication skills - you'll work with QA and writers
Preferred
Experience with Model Context Protocol (MCP) or similar structured agent-server interfaces
Knowledge of FastAPI or similar async web frameworks
Experience working with LLM logs, scoring functions, or sandbox environments
Ability to support dev environments (devcontainers, CI configs, linters)
JS experience
Benefits
Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
Participate in an advanced AI project and gain valuable experience to enhance your portfolio
Influence how future AI models understand and communicate in your field of expertise
Company
Mindrift
Welcome to Mindrift — a space where innovation meets opportunity.
Funding
Current Stage
Late StageCompany data provided by crunchbase