200+ applicants

Company

Original Job Post

Planned Systems International · 2 days ago

Data Engineer

United States

Full-time

Remote

Mid Level

3+ years exp

Wonder how qualified you are to the job?

Maximize your interview chances

Software

Comp. & Benefits

Insider Connection @Planned Systems International

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

Collaborate with stakeholders, DevSecOps, and data scientists to execute on the analytics roadmap.

Understand the business domain and document requirements for data pipelines enabling descriptive, predictive, and prescriptive analytics.

Analyze and explore data from large data systems and sources, perform complex manipulations including federated joins, imputation, deduping, etc.

Utilize SQL, Python, and PySpark for data analysis, cleaning, transformation, and persistence from databases and various data formats.

Write and optimize SQL and SparkSQL queries against databases and Data Lakes.

Productionize data pipelines, add monitoring, support, and operational metrics.

Create visualizations and dashboards using Tableau.

Write unit, integration, and regression tests for data pipeline jobs.

Create ERD diagrams, validate designs with prototype data models.

Collaborate with the data science team to develop optimized data models for machine learning.

Experiment with latest technologies, present data-driven solutions to stakeholders, including executive management.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

SQLOraclePostgresMySQLPythonPandasPySparkSpark DataFramesStructured DataUnstructured DataSemi-Structured DataData Engineering PatternsBI ToolsTableauQlikviewPowerBIUnit TestsIntegration TestsRegression TestsMachine Learning Life CycleNotebook EnvironmentJupyterCollabDatabricksProblem-SolvingEffective CommunicationDocumentationFast-Paced EnvironmentCan-Do AttitudeDatabricks Unified Analytics Platform

Required

U.S. Citizenship with the ability to obtain a U.S. Government Security Clearance

Intermediate to advanced level hands-on knowledge of SQL

3-5 years’ experience with at least one of the major relational databases: Oracle, Postgres, MySQL

Hands-on experience with Python including experience with either Python Pandas DataFrame APIs for data manipulation or experience with PySpark and Spark DataFrames

Experience with structured, unstructured and semi-structured data in multiple file formats including text, CSV and JSON files

Experience implementing various data engineering patterns

Experience with one of the major BI tools (Tableau, Qlikview, PowerBI, etc)

Experience writing unit, integration and regression tests

Understanding of the machine learning life cycle

Experience with one major notebook environment (Jupyter, Collab, Databricks, etc) for Python

Effective communication, documentation and problem-solving skills

Ability to work in a fast-paced environment with a can-do attitude