Senior Business Systems Analyst (Joint Genome Institute) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Berkeley Lab · 20 hours ago

Senior Business Systems Analyst (Joint Genome Institute)

Berkeley Lab’s Joint Genome Institute is seeking a Senior Business Systems Analyst to play a critical role in transforming raw scientific outputs into high-value, AI-ready data assets. The role involves developing and maintaining a robust Data Lakehouse and leading integration efforts for scientific data to ensure it is well-structured and accessible for domain scientists and AI applications.

Research
badNo H1Bnote

Responsibilities

Analyze and evaluate complex business problems and design automated system solutions
Provide technical expertise in identifying, evaluating, and developing cost-effective systems and procedures that meet user requirements
Lead the design and implementation of data integration processes for the JGI's Data Lakehouse, ensuring large scientific datasets are structured for efficient querying and analysis
Design, build, and maintain fault-tolerant, scalable, and efficient ETL/ELT data pipelines to ingest, transform, and load genomic data and associated metadata into the Data Lakehouse
Plan and perform unit, integration, and acceptance testing
Create system specifications aligned with business requirements
Provide consultation and guidance to domain scientists and other users on the use of automated systems
Collaborate closely with cross-functional teams to resolve business and system-related issues

Qualification

Data LakehouseETL/ELT toolsData engineering languagesPythonSQLGenomics dataAnalytical skillsCommunication skillsInterpersonal skills

Required

A Bachelor's Degree (or equivalent knowledge/training) in Computer Science, Data Engineering, or a related technical field and a minimum of 8 years of demonstrated experience structuring large-scale datasets for efficient use in Data Lakehouse environments, leveraging technologies such as Parquet, Iceberg, Dremio, Spark, or similar lakehouse and data warehousing platforms or an equivalent combination of education and experience
Demonstrated proficiency with modern Extract, Transform, Load (ETL) and Extract, Load, Transform (ELT) tools and frameworks
Strong scripting skills in data engineering languages, including Python (Pandas, Polars, etc) and advanced SQL for data manipulation and performance optimization
Strong analytical skills including the ability to identify problems, troubleshoot, and demonstrate good judgement in selecting methods and techniques for obtaining solutions
Excellent oral and written communication skills, including experience organizing and presenting technical information to varying audiences
Demonstrated interpersonal skills including experience collaborating with an interdisciplinary research team

Preferred

A Master's Degree (or equivalent knowledge/training) in Computer Science, Data Engineering, or a related technical field
Experience with Data Lakehouse technologies like Dremio or Spark
Domain knowledge of genomics data

Benefits

Exceptional health and retirement benefits, including pension or 401K-style plans
A culture where you’ll belong - we are invested in our teams!
In addition to accruing vacation and sick time, we also have a Winter Holiday Shutdown every year.
Parental bonding leave (for both mothers and fathers)
Pet insurance

Company

Berkeley Lab

twittertwittertwitter
company-logo
Berkeley Lab is a national laboratory that creates advanced new tools for scientific discovery.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Mary Barnum, MBA
Business Manager, COO Office
linkedin
leader-logo
Rebecca Rishell
Deputy Chief Operating Officer
linkedin
Company data provided by crunchbase