Research Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

University of Minnesota · 5 days ago

Research Data Engineer

The University of Minnesota is a large public research university that provides advanced research computing infrastructure and expertise. They are seeking a Research Data Engineer responsible for maintaining and improving data workflows for the HBCD release process, coordinating efforts with data managers and operations staff.

EducationHigher EducationUniversities
check
H1B Sponsorednote

Responsibilities

Run, maintain, and optimize the data pipelines used to produce the HBCD releases as well as aspects of the release process for ABCD and other studies
Build containers containing scientific software to process the data, execute these containers efficiently in an HPC environment, and manage data movement across distributed storage platforms
Work closely with subject matter experts, data managers and stewards, and operations staff to achieve these and other objectives. Develop new data pipelines in coordination with subject matter experts
Monitor and report state of the data with regards to upcoming releases
Coordinate efforts with operations staff, data managers and stewards
Investigate agentic approaches to process management
Provide consultation to other teams for their data processing needs
Contribute effort to other MIDB projects’ data processing needs
Work with subject matter experts and external partners to resolve data errors, including rebuilding containers and pipelines to incorporate changes in the underlying software
Other tasks as assigned

Qualification

LinuxCloud ComputingHigh Performance ComputingPythonContainersData AnalysisETLGitRShell ScriptingData ReportingApache AirflowSQLAWS S3FreeSurferMachine LearningDeep Learning

Required

BA/BS in Computer Science, Data Science, Neuroscience, or Bioinformatics plus at least four years of experience, or master's degree in Computer Science, Data Science, Neuroscience, or Bioinformatics plus at least two years of experience
Experience and a high level of comfort with Linux and the command line
Experience with various computing modalities
Containers (i.e. Docker, Kubernetes, Apptainer/Singularity, etc)
High Performance Computing (HPC)
Cloud Computing (AWS, Azure, OpenStack, etc)
Programming experience in one or more of: Python, R, Shell Scripting
Experience with data in one of the following capacities: Analysis, ETL, Reporting
Experience with Git

Preferred

Experience with tools for managing data workflows and related tools (Apache Airflow, Datalad, SQL, etc)
Familiarity with object stores (AWS S3, Ceph, etc)
Experience working with academic PIs and research projects
Experience working with NIfTI and DICOM medical imaging data
Familiarity with neuroimaging analysis tools such as FreeSurfer, FSL, workbench, etc
Familiarity with machine learning and deep learning, along with annotation workflows

Benefits

Visa sponsorship, including H-1B, may be considered for qualified candidates whose visa circumstances do not require payment of the $100,000 H-1B fee.

Company

University of Minnesota

company-logo
University of Minnesota is an educational institution that offers master's and doctoral degrees in medicine, law, and engineering fields.

H1B Sponsorship

University of Minnesota has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (298)
2024 (240)
2023 (289)
2022 (215)
2021 (201)
2020 (185)

Funding

Current Stage
Late Stage
Total Funding
$97.08M
Key Investors
American Academy of Orthopaedic SurgeonsNational Science FoundationU.S. Environmental Protection Agency
2023-12-01Grant· $0.03M
2023-05-04Grant· $20M
2023-04-13Grant· $10M

Leadership Team

leader-logo
Shane Stennes
Chief Sustainability Officer
linkedin
Company data provided by crunchbase