Senior Data Engineer, Data Warehouse jobs in United States
cer-icon
Apply on Employer Site
company-logo

GeneDx · 9 hours ago

Senior Data Engineer, Data Warehouse

GeneDx is a company delivering personalized health insights to inform diagnosis and improve drug discovery. They are looking for a Senior Data Engineer to join their Unified Data Warehouse team, responsible for developing and optimizing data pipelines and collaborating with various stakeholders.

AnalyticsArtificial Intelligence (AI)Health CareHealth DiagnosticsMachine LearningMedicalPredictive Analytics
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and maintain scalable ETL/ELT pipelines for structured and unstructured data
Contribute to and maintain the enterprise data model – the source of truth in our Snowflake warehouse
Write and optimize complex SQL queries (including window functions, temp tables, and query performance tuning) to support analytics and reporting needs
Take part in designing and maintaining centralized model layer
Support data warehousing solutions via Snowflake + dbt
Develop automation scripts in Bash, Python, or other programming languages
Manage cloud environments (AWS, OCI) in collaboration with infrastructure teams
Maintain and optimize Kubernetes (EKS) cluster for scalable workloads
Implement and maintain infrastructure-as-code using tools like Terraform, YAML, and Argo for reproducible and reliable deployments
Debug and troubleshoot data pipelines and data quality issues across systems
Collaborate with stakeholders of varying technical backgrounds to translate business requirements into scalable technical solutions
Be an active contributor to our ETL/ELT framework. We contribute features, fixes, and improvements almost daily. Everyone is encouraged and empowered to propose improvements and optimizations to our framework
Contribute to best practices for data modeling, governance, and quality control
Explore and recommend AI tools and modern data solutions for efficiency and automation

Qualification

Data engineering conceptsAdvanced SQLETL/ELT pipelinesCloud platformsPythonKubernetesInfrastructure-as-codeGitCommunication skillsCollaboration skillsProblem-solving skills

Required

Strong understanding of data engineering concepts and data warehousing fundamentals
Advanced SQL skills, including debugging and performance tuning
Proficiency in at least one general-purpose programming language (e.g., Python, Java, Scala). We use Python
Familiarity with Kimball (Dimensional) Modeling
Basic scripting knowledge (Bash) for automation and operational workflows
Familiarity with cloud platforms (AWS, GCP, or OCI)
Solid communication and collaboration skills to work effectively with technical and non-technical stakeholders
Familiarity with Git

Preferred

Experience with distributed computing frameworks such as Dask (preferred) or Spark
Hands-on experience managing and deploying workloads in Kubernetes
Exposure to infrastructure-as-code (Terraform, Helm, Argo, etc.)
Experience with any of the popular workflow orchestration systems (Airflow, Dagster, Argo Workflows, etc)
Experience implementing Change Data Capture (CDC) pipelines
Strong debugging and problem-solving skills for troubleshooting complex data issues
Knowledge of AI tools and when to apply them in a data engineering context

Benefits

Paid Time Off (PTO)
Health, Dental, Vision and Life insurance
401k Retirement Savings Plan
Employee Discounts
Voluntary benefits

Company

GeneDx uses artificial intelligence and machine learning to analyze patient data to provide insights to transform the practice of medicine.

H1B Sponsorship

GeneDx has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)

Funding

Current Stage
Public Company
Total Funding
$941M
Key Investors
BlackRock Innovation Capital
2023-01-26Post Ipo Equity· $150M
2023-01-10IPO
2022-01-18Post Ipo Equity· $200M

Leadership Team

leader-logo
Katherine Stueland
Chief Executive Officer
linkedin
leader-logo
Kevin Feeley
Chief Financial Officer
linkedin
Company data provided by crunchbase