Senior/Staff Data Engineer (Scientific Data Engineer) jobs in United States
info-icon
This job has closed.
company-logo

Merge Labs · 18 hours ago

Senior/Staff Data Engineer (Scientific Data Engineer)

Merge Labs is a frontier research lab focused on bridging biological and artificial intelligence. The senior-most data engineer will define and own the data pipelines that support molecular optimization, collaborating with experimentalists and ML engineers to transform laboratory outputs into structured datasets for scientific analysis.

BiotechnologyHuman ResourcesSoftware

Responsibilities

Build and operate ingestion pipelines from laboratory instruments into centralized storage
Design schemas and metadata capture standards for experimental data
Implement post-processing pipelines that produce analysis-ready datasets for scientists
Establish monitoring, alerting, and structured logging for both pipeline and data quality
Partner with biologists to map experimental workflows to data models
Build interfaces (APIs, dashboards, and LLM-enabled tools) that make data easily accessible
Drive continuous improvement of data infrastructure as new protocols and data types emerge

Qualification

Data pipelinesPythonSQLData modelingSchema designMetadata frameworksComputational biologyBioinformaticsAPIsDashboardsMonitoringCollaboration

Required

5–10+ years of experience building and operating data pipelines or backend systems in production
Strong software fundamentals in Python, SQL, and data modeling
Experience designing schemas and metadata frameworks for complex, evolving datasets
Proven ability to partner with non-technical users to understand needs and ship usable systems
Comfort owning systems end-to-end—from design and implementation to deployment and monitoring
Background in computational biology, bioinformatics, or scientific data systems

Preferred

Familiarity with C++, low-latency data pipelines and on-premises deployments

Company

Merge Labs

twittertwittertwitter
company-logo
Merge Labs provides a single API to connect software applications with multiple third-party platforms.

Funding

Current Stage
Early Stage
Total Funding
$252M
Key Investors
OpenAI
2026-01-15Seed· $252M
Company data provided by crunchbase