Berkeley Lab · 3 weeks ago
Data Science Engineer
Lawrence Berkeley National Laboratory is hiring a Data Science Engineer within the Scientific Data division. The role involves developing new methods and software tools for scientific knowledge discovery using modern data management and machine learning technologies, focusing on multi-modal data modeling and analysis in bioscience research.
Research
Responsibilities
Design and develop user-friendly software packages for scientific data analysis and management
Develop machine learning and AI solutions for analysis of biological data in close collaboration with diverse teams of scientists
Work with domain experts to develop FAIR data models and management solutions for bioscience applications
Work closely with the community of developers of the Neurodata Without Borders and LinkML open source data ecosystems, as well as the Joint Genome Institute
Maintain and manage open source software products, including managing development priorities, software releases, continuous integration, and testing
Design, implement and maintain software tools for creating and running parallel data intensive analysis workflows
Design, implement and maintain high performance computing and cloud solutions for visualization and analysis of complex biological data
Train scientists and research software engineers in the use of the developed software products at workshops and conferences
Work on and resolve problems of diverse scope where analysis of data requires evaluation of identifiable factors
Demonstrate good judgment in selecting methods and techniques for obtaining solutions
Network with senior internal and external personnel in their own area of expertise
Qualification
Required
Typically requires a minimum of 5 years of related experience with a Bachelor's degree in computer science, data science, machine learning, bioinformatics, or equivalent.; or 3 years and a Master's degree; or equivalent work experience designing and developing software for data modeling or analysis
Experience developing complex software solutions
Experience testing large code bases
Experience contributing to community-driven open source software
Demonstrated experience in one or more of the following areas: machine learning, data management, scientific data analysis
Works well in a collaborative team environment
Demonstrated capability with programming languages, such as Python, C++, or Javascript
Demonstrated capability with the Git version control and continuous integration systems, such as GitHub or GitLab=
Ability to troubleshoot and solve problems of diverse scope where analysis of data requires evaluation of identifiable factors
Ability to network with senior internal and external personnel in their own area of expertise
Excellent oral and written communication skills
Demonstrated ability to work effectively as part of a cross-disciplinary team
Preferred
Master's or PhD in Computer Science or related field, with 5 or more years of professional experience designing and developing scientific data modeling or analysis software
Experience working with modern scientific data formats and database systems, such as HDF5, Zarr, MongoDB, PostgreSQL, MySQL, and Redis
Experience with Neurodata Without Borders, LinkML, or similar software ecosystems
Experience working with large biological data, such as in the areas of neurophysiology, microbiology, genomics, or protein design
Experience working with modern parallel compute technologies, such as Cloud, High-Performance Computing, containerization, and parallel programming environments (MPI, OpenMP, CUDA, or pthreads)
Experience developing web-based graphical user interfaces (GUIs) or application programming interfaces (APIs) for scientific data analysis and management
Benefits
Exceptional health and retirement benefits, including pension or 401K-style plans
A culture where you’ll belong - we are invested in our teams!
In addition to accruing vacation and sick time, we also have a Winter Holiday Shutdown every year.
Parental bonding leave (for both mothers and fathers)
Pet insurance
Company
Berkeley Lab
Berkeley Lab is a national laboratory that creates advanced new tools for scientific discovery.
H1B Sponsorship
Berkeley Lab has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (154)
2024 (159)
2023 (163)
2022 (154)
2021 (165)
2020 (107)
Funding
Current Stage
Late StageLeadership Team
Recent News
MIT Climate Portal - Massachusetts
2025-07-18
Help Net Security
2025-04-15
Company data provided by crunchbase