Senior Data Engineer @ Credible | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Senior Data Engineer jobs in New York, NY
32 applicantsPosted by Agency
company-logo

Credible · 16 hours ago

Senior Data Engineer

ftfMaximize your interview chances
Computer Software

Insider Connection @Credible

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Collaborate with cross-functional teams to define the data engineering strategy aligned to business objectives, including data modeling that unifies data assets across a range of source systems used to manage the operations of our partnering hospitals.
Define and execute processes needed to develop, test, deploy, and maintain high quality data pipelines.
Oversee the end-to-end development of data pipelines from source data extraction through to production-grade analytical dataset delivery, ensuring data quality and security throughout the pipeline.
Continuously monitor and optimize data processing performance and efficiency.
Identify and address bottlenecks, optimize query performance, and improve overall system stability.
Establish and enforce data quality management policies, data access controls, and data privacy standards.
Stay abreast of the latest developments in engineering tools and best practices.
Provide guidance to the team about technical challenges.
Maintain clear and comprehensive documentation of data pipelines, architecture, and processes to ensure knowledge sharing and team continuity.
Evaluate and manage relationships with third-party vendors and tools, making informed decisions about when to leverage external solutions.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Data EngineeringPythonSQLData ModelingApache SparkDatabricksData ArchitectureData IntegrationData GovernanceGitMachine Learning PipelinesCI/CD PipelinesData SecurityData PrivacyAzure CloudApache Spark Structured StreamingMLFlowProject Management

Required

3+ years in data engineering roles in a production environment
Advanced proficiency in Python and SQL for data engineering
Up-to-date knowledge of and 1+ years of experience using Databricks for Lakehouse management
Deep understanding of data modeling, data architecture, and data integration best practices
Strong hands-on experience with Apache Spark
Familiarity with data governance, security, and privacy principles
Comfort using git or equivalent to manage the software development life cycle
Exceptional ability to learn and use new software development techniques and tools
Ability to manage multiple projects simultaneously
High energy, humble team player with “get it done” attitude, seeking collaboration with colleagues

Preferred

Experience with the Azure cloud ecosystem
Experience developing production-ready, real-time machine learning model serving pipelines
Comfort developing in the Apache Spark Structured Streaming paradigm
Experience working in a private equity-backed services company
Experience deploying machine learning models with MLFlow or equivalent
Experience developing CI/CD pipelines

Company

Credible

twitter
company-logo
Welcome to Credible, the next generation of ATS platforms, providing employers with cutting-edge technology to find their next great hire in as little as one day.

Funding

Current Stage
Early Stage
Company data provided by crunchbase
logo

Orion

Your AI Copilot