Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

The New York Times · 22 hours ago

Data Engineer

The New York Times is committed to independent journalism and seeks to help people understand the world. They are looking for a Data Engineer to design and implement complex data pipelines, manage data storage across cloud platforms, and optimize data for analytic consumption.

Digital MediaJournalismNews
check
H1B Sponsor Likelynote

Responsibilities

Design, model, and implement complex ELT/ETL pipelines for the cleansed and curated data layers in the medallion architecture, taking full ownership of the data product's structure, partitioning, documentation, and performance characteristics
Develop advanced data transformations using dbt (data build tool) for relational data modeling and PySpark for large-scale data processing within the Lakehouse, ensuring outputs meet strict Service Level Agreements and quality standards
Collaborate across teams to define requirements and translate them into robust and scalable data models suitable for analytic consumption
Manage the physical data storage across both GCP and AWS, selecting optimal file formats and designing efficient partitioning and clustering strategies
Administer and tune Spark compute resources (e.g., Dataproc, EMR, or managed services) to optimize job execution time and cost
Own core components of our centralized analytics environment, specifically focused on Hex, integrations, and the methods of data exposure and access controls; and support data activation strategies, ensuring seamless data consumption by analytic tools
Optimize user queries and access patterns to maintain platform performance and cost efficiency
Implement centralized data quality checks and observability mechanisms within the data pipeline to proactively identify and resolve data issues
Contribute to the implementation of metadata management, data lineage, and role-based access control (RBAC) initiatives across the Lakehouse environment
Demonstrate support and understanding of our value of journalistic independence and a strong commitment to our mission to seek the truth and help people understand the world

Qualification

Data EngineeringSQLPythonCloud Data WarehouseData ModelingPySparkWorkflow OrchestrationInfrastructure-as-CodeA/B TestingData Quality Standards

Required

2+ years of hands-on experience in a Data Engineering, Data Warehousing, Analytics Engineering or equivalent role
Proficiency in SQL and experience with complex, production-level data modeling (dimensional modeling, Kimball, OBT, or Data Vault)
Demonstrated experience designing, developing, and deploying end-to-end data products through the full Software Development Lifecycle
Experience with a Cloud Data Warehouse, like BigQuery
Proficiency in Python for scripting and data manipulation, including knowledge of PySpark or other Spark APIs
Familiarity with cloud services and data storage components in at least one major cloud provider (GCP or AWS)
Experience with workflow orchestration tools (e.g., Airflow, Cloud Composer, or Prefect) and version control systems (Git)

Preferred

Experience operating in a dual-cloud environment (GCP/AWS)
Experience with Infrastructure-as-Code (IaC) tools like Terraform
Experience with advanced Lakehouse file formats like Iceberg or Delta Lake
Familiarity with experimentation or A/B testing platforms and the data required to support them
Experience in data product quality standards through integration advanced testing, quality checks, and monitoring into the CI/CD pipeline

Benefits

Medical, dental and vision benefits
Flexible Spending Accounts (F.S.A.s)
A company-matching 401(k) plan
Paid vacation
Paid sick days
Paid parental leave
Tuition reimbursement
Professional development programs

Company

The New York Times

twittertwittertwitter
company-logo
The New York Times is powered by the idea that independent, deeply reported journalism fuels a healthy and engaged society.

H1B Sponsorship

The New York Times has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (40)
2024 (20)
2023 (21)
2022 (36)
2021 (27)
2020 (36)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
William Bardeen
Chief Financial Officer
linkedin
leader-logo
Crystal Chien
VP of Engineering
linkedin
Company data provided by crunchbase