Staff Software Engineer, Science jobs in United States
cer-icon
Apply on Employer Site
company-logo

Biohub · 10 hours ago

Staff Software Engineer, Science

Biohub is leading the new era of AI-powered biology to cure or prevent disease through its 501c3 medical research organization, with the support of the Chan Zuckerberg Initiative. As a software engineer on the Data Engineering team, you will contribute to architecture and implement data needs for platforms to enable scientists to interrogate large datasets without requiring computational expertise.

BioinformaticsBiotechnologyGenetics

Responsibilities

Own, maintain and continuously improve upon the data pipeline architecture
Design, build, and maintain robust, scalable data pipelines for ingesting, processing, and storing large volumes of structured and unstructured data
Develop and optimize ETL processes, ensuring data quality, validation, and consistency across diverse sources
Implement and manage data storage solutions, including data warehouses, data lakes, and distributed databases, ensuring secure and performant to handle massive volumes of single-cell transcriptomics data and imaging data
Monitor and troubleshoot data pipelines, build proactive exception handling, and ensure high reliability and uptime of production systems
Document processes, maintain data models, and support data governance, lineage, and compliance initiatives
Utilize modern tools and technologies, such as Argo Workflows, Kubernetes, AWS, Docker, and CI/CD pipelines
Actively contribute to team problem-solving, project planning, and process improvements with a mindset for innovation and social impact
Create user-friendly APIs to enable researchers and scientists to easily access and explore the curated data
Develop scalable, maintainable, and testable software systems and participate in team conversations and efforts on engineering excellence
Collaborate with data scientists, computational biologists, researchers, analysts, and other engineers to understand data requirements and deliver practical solutions that drive analytics, research, and AI/ML applications

Qualification

PythonAWSETLData Pipeline ArchitectureSQLDockerArgo WorkflowsData ModelingJavaCI/CDAnalytical Problem-SolvingCommunicationTeamworkSelf-Driven

Required

8+ years of experience as Software Engineer with data building data pipelines
Proficiency in programming languages (Python, Java) and SQL
Experience with big data, AWS(EC2, S3, EKS, IAM, SQS etc), Docker, and Argo Workflows
Strong data modeling, database design, and data integration skills, including ETL and pipeline orchestration tools
Strong fundamentals in systems design, data structures, algorithms, and object oriented programming principles
Experience with CI/CD, data governance, and observability/monitoring tools
Excellent communication, teamwork, and analytical problem-solving abilities
Passion for the CZI mission, innovation, and open, collaborative culture
Computer Science Engineering degree
Strong problem solving and analytical skills
Excellent written and verbal communication skills
Enthusiasm to ramp up on technologies and learn a new science domain
Must be self-driven and comfortable supporting data needs of multiple systems and products

Preferred

Experience working with Biology, Imaging or Sequencing data
Experience working with data formats related to biodata and solving challenges with that data
Experience building AI Agents related to data movement or ETL

Benefits

Provides a generous employer match on employee 401(k) contributions to support planning for the future.
Paid time off to volunteer at an organization of your choice.
Funding for select family-forming benefits.
Relocation support for employees who need assistance moving

Company

Biohub

twittertwittertwitter
company-logo
Our mission is to help scientists cure or prevent all disease.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Rafael Gómez-Sjöberg
Chief Technology Officer
linkedin
leader-logo
Eunitz Beganovic
Executive Assistant to the Chief Legal Officer
linkedin
Company data provided by crunchbase