Merge Labs · 18 hours ago
Senior/Staff Data Engineer (Scientific Data Engineer)
Merge Labs is a frontier research lab focused on bridging biological and artificial intelligence. The senior-most data engineer will define and own the data pipelines that support molecular optimization, collaborating with experimentalists and ML engineers to transform laboratory outputs into structured datasets for scientific analysis.
BiotechnologyHuman ResourcesSoftware
Responsibilities
Build and operate ingestion pipelines from laboratory instruments into centralized storage
Design schemas and metadata capture standards for experimental data
Implement post-processing pipelines that produce analysis-ready datasets for scientists
Establish monitoring, alerting, and structured logging for both pipeline and data quality
Partner with biologists to map experimental workflows to data models
Build interfaces (APIs, dashboards, and LLM-enabled tools) that make data easily accessible
Drive continuous improvement of data infrastructure as new protocols and data types emerge
Qualification
Required
5–10+ years of experience building and operating data pipelines or backend systems in production
Strong software fundamentals in Python, SQL, and data modeling
Experience designing schemas and metadata frameworks for complex, evolving datasets
Proven ability to partner with non-technical users to understand needs and ship usable systems
Comfort owning systems end-to-end—from design and implementation to deployment and monitoring
Background in computational biology, bioinformatics, or scientific data systems
Preferred
Familiarity with C++, low-latency data pipelines and on-premises deployments
Company
Merge Labs
Merge Labs provides a single API to connect software applications with multiple third-party platforms.
Funding
Current Stage
Early StageTotal Funding
$252MKey Investors
OpenAI
2026-01-15Seed· $252M
Recent News
Company data provided by crunchbase