Staff Business Systems Analyst (Joint Genome Institute) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Berkeley Lab · 2 weeks ago

Staff Business Systems Analyst (Joint Genome Institute)

Lawrence Berkeley National Laboratory's Joint Genome Institute is seeking a Staff Business Systems Analyst to transform scientific data into AI-ready assets. The role involves leading data integration efforts, designing automated solutions, and collaborating with scientists to optimize data management processes.

Research
badNo H1Bnote

Responsibilities

Analyze and evaluate complex business problems and design automated system solutions
Provide technical expertise in identifying, evaluating, and developing cost-effective systems and procedures that meet user requirements
Lead the design and implementation of data integration processes for the JGI's Data Lakehouse, ensuring large scientific datasets are structured for efficient querying and analysis
Design, build, and maintain fault-tolerant, scalable, and efficient ETL/ELT data pipelines to ingest, transform, and load genomic data and associated metadata into the Data Lakehouse
Plan and perform unit, integration, and acceptance testing
Create system specifications aligned with business requirements
Provide consultation and guidance to domain scientists and other users on the use of automated systems
Collaborate closely with cross-functional teams to resolve business and system-related issues

Qualification

Data LakehouseETL/ELT toolsData engineering languagesPythonSQLData integrationAnalytical skillsGenomics knowledgeCommunication skillsInterpersonal skillsCollaboration

Required

A Bachelor's Degree (or equivalent knowledge/training) in Computer Science, Data Engineering, or a related technical field and a minimum of 8 years of demonstrated experience structuring large-scale datasets for efficient use in Data Lakehouse environments, leveraging technologies such as Parquet, Iceberg, Dremio, Spark, or similar lakehouse and data warehousing platforms or an equivalent combination of education and experience
Demonstrated proficiency with modern Extract, Transform, Load (ETL) and Extract, Load, Transform (ELT) tools and frameworks
Strong scripting skills in data engineering languages, including Python (Pandas, Polars, etc) and advanced SQL for data manipulation and performance optimization
Strong analytical skills including the ability to identify problems, troubleshoot, and demonstrate good judgement in selecting methods and techniques for obtaining solutions
Excellent oral and written communication skills, including experience organizing and presenting technical information to varying audiences
Demonstrated interpersonal skills including experience collaborating with an interdisciplinary research team

Preferred

A Master's Degree (or equivalent knowledge/training) in Computer Science, Data Engineering, or a related technical field
Experience with Data Lakehouse technologies like Dremio or Spark
Domain knowledge of genomics data

Benefits

Exceptional health and retirement benefits, including pension or 401K-style plans
A culture where you’ll belong - we are invested in our teams!
In addition to accruing vacation and sick time, we also have a Winter Holiday Shutdown every year.
Parental bonding leave (for both mothers and fathers)
Pet insurance

Company

Berkeley Lab

twittertwittertwitter
company-logo
Berkeley Lab is a national laboratory that creates advanced new tools for scientific discovery.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Mary Barnum, MBA
Business Manager, COO Office
linkedin
leader-logo
Rebecca Rishell
Deputy Chief Operating Officer
linkedin
Company data provided by crunchbase