Researcher III – Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

National Laboratory of the Rockies · 2 hours ago

Researcher III – Data Engineer

The National Laboratory of the Rockies (NLR) is a leading laboratory for energy systems research and development. They are seeking a skilled Data Engineer to design, build, and maintain scalable data pipelines and architectures to enhance analytics and data-driven decision-making capabilities in energy research.

Clean EnergyCleanTechEnergyRenewable Energy

Responsibilities

Develop bespoke applications using diverse technologies and hybrid (on-prem/cloud/HPC) solutions that meet specific requirements of our varied client projects
Design, construct, test, and maintain scalable data architectures, data lakes, databases, and datasets, as well as large-scale data processing systems
Develop high-quality software solutions to manage data workflow, optimization, and retrieval
Design and implement secure and compliant data models, ETL and analytical pipelines in a distributed and/or hybrid computing environment
Work closely with data scientists, analysts, and stakeholders to improve data collection, processing methods, and develop tailored software solutions
Ensure data quality and integrity by adhering to data privacy policies and compliance with data protection regulations
Proactively research and integrate new technologies, tools, and best practices in data and software engineering
Train and deploy machine learning models to accelerate data processes for energy research
Explore the use of large language models to enable better energy systems research
Develop documentation of data, software, processes and procedures as appropriate

Qualification

PythonSQLETL processesData managementStatistical analysisData qualityBig data toolsMachine learningInterpersonal skillsCommunication skillsProblem-solving skillsTeam collaboration

Required

Relevant PhD. Or, relevant Master's Degree and 3 or more years of experience. Or, relevant Bachelor's Degree and 5 or more years of experience
Demonstrates complete understanding and wide application of scientific technical procedures, principles, theories and concepts in the field
General knowledge of other related disciplines
Demonstrates leadership in one or more areas of team, task or project lead responsibilities
Demonstrated experience in management of projects
Very good technical writing, interpersonal and communication skills
Must meet educational requirements prior to employment start date
Familiarity with data engineering technologies (e.g., Python, SQL, ETL processes)
Extensive experience developing software and analyzing data with Python
Ability to translate technical requirements into structured data configurations
Strong understanding of data management concepts, data quality, and data cleaning
Experience with a diversity of data technologies
Strong analytical and problem-solving skills
Excellent interpersonal skills for engaging with both technical and non-technical users
Strong written and verbal communication skills for documenting workflows and explaining technical concepts
Ability to collaborate effectively in a team setting to meet project objectives

Preferred

The ideal candidate will have a strong blend of both data engineering and software engineering skills with the ability to work on hybrid tech stacks
Experience with big data (tens to hundreds of TB)
Experience with version control (git/GitHub)
Basic understanding of data management practices, such as multi-source data collection, workflow management, data storage, security, and availability, data governance & privacy
Familiarity with agile development
Software and/or data quality assurance (verification and validation, testing, etc.)
Experience with big data tools (e.g., Hadoop, Spark, Kafka, etc.), data pipelines, and software development frameworks
Experience with parallel programming (High Performance Computing experience is a plus) and hybrid computing (on prem and in the cloud)
Familiarity with cloud services (such as AWS S3/Glue/Athena/Lambda, Azure Blob Storage, GCP Storage/BigQuery), data warehousing solutions, and containerization technologies (Docker, Kubernetes)
Experience with SQL, relational databases, and NoSQL databases
Knowledge of machine learning frameworks, statistical analysis, and algorithm optimization
Experience working with meteorological data (e.g., wind or solar data in NetCDF of HDF5 format). And cloud-friendly formats, like Parquet
Excellent analytical, problem-solving, and troubleshooting skills
Strong communication skills and the ability to collaborate effectively in a multi-project environment with a multidisciplinary team
Web and API development experience is a bonus
Prior experience in the energy sector or a research environment is a plus

Benefits

Medical, dental, and vision insurance
Short- and long-term disability insurance
Pension benefits
403(b) Employee Savings Plan with employer match
Life and accidental death and dismemberment (AD&D) insurance
Personal time off (PTO) and sick leave
Paid holidays
Tuition reimbursement

Company

National Laboratory of the Rockies

company-logo
The U.S. Department of Energy's primary national laboratory for energy systems research and development.

Funding

Current Stage
Late Stage
Total Funding
$166.09M
Key Investors
US Department of EnergyARPA-E
2024-09-04Grant
2023-09-21Grant· $1M
2023-05-22Grant· $150M

Leadership Team

leader-logo
Clay Sumner
Deputy Chief Financial Officer
linkedin
leader-logo
Jennifer L.
Chief Financial Officer
linkedin
Company data provided by crunchbase