Validate Health · 15 hours ago
Data Engineer / Scientist - for 2024 University Grads (Small, Fun Team 🤓)
Maximize your interview chances
ConsultingHealth Care
No H1B
Insider Connection @Validate Health
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Build complex data transformations and statistical models using Python, SQL, SAS.
Perform EDA, scenario modeling, forecasting and simulations on large datasets.
Ingest, transform, clean and augment internal and external data assets.
Automate data pipelines to process patient-level and aggregated public health data.
Leverage an ever evolving range of cloud services, data science libraries and database platforms, such as AWS (EC2, RDS, S3, Lambda, Redshift, Athena, Glue), Databricks/Spark, PostgreSQL, SparkSQL, Python, Pandas, PySpark, Apache Airflow, and DST.
Continuously learn by investigating new technologies and mentor other team members.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
BS or MS in Computer Science or Mathematics from a top university program.
Experience with data science concepts, building statistical models deployed on scalable data platforms.
Experience with EDA (exploratory data analysis) using a variety of Python libraries or BI platform tools.
Experience with hands on SQL, writing complex data transformation queries and optimizing for performance on large scale data.
Solid understanding of data pipelines, processing parallelization, and ETL concepts.
Experience with AWS or other PaaS cloud services (such as AWS S3, RDS, Redshift, EMR, and Lambda) are a plus.
Desire to be an expert in healthcare economics and passionate about making an impact in this field.
Preferred
Knowledge of Databricks or other Spark environments.
Familiarity of the following items are optional, but preferred: Concepts used by ETL tools such as DBT and SSIS; Apache AirFlow and other job orchestration tools; Linux command line, shell utilities and Git / GitHub; Docker and environment management.
Benefits
Fully remote team with an environment and company culture that's optimized to provide a fulfilling work experience and career growth opportunities to every team member, regardless the location
Daily whiteboard sessions
Weekly "deep thought" days
Opportunity for continuous personal growth and development through research projects and agile experimentation mindset towards product development
Quarterly personal educational goals and budget
Quarterly individual performance bonus (~10% at start, ~20% after 4 years)
Quarterly company offsite retreats and virtual town halls
Quarterly anonymous leadership feedback and evaluations
Profit sharing bonus (~5%)
Health coverage and 401K
Generous Vacation Policy