Houghton Mifflin Harcourt · 18 hours ago
Senior Analytics Engineer
Maximize your interview chances
ContentE-Learning
H1B Sponsor Likely
Insider Connection @Houghton Mifflin Harcourt
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Model raw data into clean, tested, and reusable datasets, making it easier for other stakeholders to view and understand data in a data warehouse or database. Since data models are created around business needs, the job of analytics engineers is to define the rules and requirements for the formats and attributes of data.
Translate user and product requirements into data model requirements to execute against and make critical decisions regarding the business rules and how they’re implemented.
Builds ETL pipelines that can efficiently process very large datasets.
Design, implement and maintain online and offline feature stores to support ML training and inference. Senior Analytics Engineers will be responsible for managing low latency (online) and high latency (offline) systems.
Develop and maintain data and design documentation to ensure that everyone on the team uses the same definitions and language and is executing against the same architectural vision. This involves providing identifiable and understandable descriptions of data and data system components as well as exposing them in a way for all consumers to easily comprehend. Senior analytics engineers create design and data documents and utilize them to communicate effectively with stakeholders and drive innovation.
Draft and maintain documents that describe how the data flows from data sources to consumption by visualizing them with directed acyclic graphs (DAGs). From a technical user perspective, the lineage helps them to determine the root cause of an error in the whole data flow.
Define metrics and implement tests to guarantee data meets operational and analytics needs. Responsible for implementing data quality standards —how data should be formatted, shown, and used across the organization.
Develop and maintain automation, scheduling and monitoring of processes designed to gather data from disparate sources and preparing them for data analysis.
Use CI/CD processes throughout the data model development lifecycle to develop higher quality code and data models without disruption to production.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Over 4 years of hands-on experience in data engineering, analytics, or data science, with a strong focus on supporting data pipelines for machine learning models deployed in production environments.
Bachelor’s degree in statistics, mathematics, computer science, software engineering, or related field.
Proficient in SQL and Python.
Practical experience to handle various data orchestration tasks is required.
Data modeling: Experience developing data models for specific business processes. Familiarity with common data modeling techniques including Star Schema (Kimball’s), One Big Table (OBT) and Data Vault.
Extensive hands-on experience with tools for building data pipelines like Snowflake, Amazon Redshift, and Google BigQuery; ETL tools like AWS Glue, Talend, or others; Business Intelligence tools like Tableau, Looker, or equivalent.
Comfortable with software engineering best practices: version control (git), writing unit testing, code review, and CI/CD.
Demonstrates exceptional interpersonal and communication skills, facilitating seamless collaboration throughout the organization. Proficient in understanding and anticipating stakeholder needs, effectively engaging with key stakeholders to convey the value of analytics initiatives and align them with business objectives. Committed to fostering and maintaining positive, productive relationships with colleagues and customers.
Preferred
Master’s degree is a plus.
Experience with the ML lifecycle is preferred, in particular feature stores.
Experience with cloud-based development and infrastructure as code principles.
Company
Houghton Mifflin Harcourt
Houghton Mifflin Harcourt is a global learning company specialized in pre-K–12 education content, services, and technology solutions.
H1B Sponsorship
Houghton Mifflin Harcourt has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (6)
2022 (4)
2021 (5)
2020 (10)
Funding
Current Stage
Public CompanyTotal Funding
$800MKey Investors
ABRY Partners
2022-02-22Acquired· by Veritas Capital ($2.8B)
2015-05-29Post Ipo Debt· $800M
2013-11-22IPO· nasdaq:HMHC
Recent News
Weekly Post Gazette
2024-06-05
Weekly Post Gazette
2024-06-05
2024-04-24
Company data provided by crunchbase