Khan Academy · 1 week ago
Senior Platform Engineer I, ML Data Systems (24 months fixed-term)
Khan Academy is a nonprofit with the mission to deliver a free, world-class education to anyone, anywhere. They are seeking a Senior Platform Engineer focused on ML Data Systems to evolve dataset tools for AI-based tutoring, ensuring data quality and integration with various systems.
AppsE-LearningEducationInternet
Responsibilities
Evolve and maintain pipelines for transforming raw trace data into ML-ready datasets
Clean, normalize, and enrich data while preserving semantic meaning and consistency
Prepare and format datasets for human labeling, and integrate results into ML datasets
Develop and maintain scalable ETL pipelines using Airflow, DBT, Go, and Python running on GCP
Implement automated tests and validation to detect data drift or labeling inconsistencies
Collaborate with AI engineers, platform developers, and product teams to define data strategies in support of continuously improving the quality of Khan’s AI-based tutoring
Contribute to shared tools and documentation for dataset management and AI evaluation
Inform our data governance strategies for proper data retention, PII controls/scrubbing, and isolation of particularly sensitive data such as offensive test imagery
Qualification
Required
Bachelor's or Master's degree in Computer Science, Data Engineering, related field, or equivalent professional experience
5 years of Software Engineering experience, including significant time working with large ML datasets
Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect)
Experience with data versioning tools (e.g., DVC, LakeFS) and cloud storage systems
Familiarity with machine learning workflows — from training data preparation to evaluation
Familiarity with the architecture and operation of large language models, and a nuanced understanding of their capabilities and limitations
Attention to detail and an obsession with data quality and reproducibility
Preferred
Experience with labeling platforms (e.g., Label Studio, Scale AI, Toloka) or human-in-the-loop systems
Understanding of ML evaluation techniques, including prompt-based and generative model metrics
Exposure to MLOps practices such as model registry, feature store, or continuous evaluation
Background in education technology or other human-centered AI applications
Benefits
Ample paid time off as needed – Your well-being is a priority
8 pre-scheduled Wellness Days in 2026 occurring on a Monday or a Friday for a 3-day weekend boost
Remote-first culture - that caters to your time zone, with open flexibility as needed, at times
Generous parental leave
An exceptional team that trusts you and gives you the freedom to do your best
The chance to put your talents towards a deeply meaningful mission and the opportunity to work on high-impact products that are already defining the future of education
Opportunities to connect through affinity, ally, and social groups
401(k) + 4% matching & comprehensive insurance, including medical, dental, vision, and life
Company
Khan Academy
Khan Academy is a nonprofit organization that provides a free world class education for anyone, anywhere.
H1B Sponsorship
Khan Academy has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2023 (2)
2022 (1)
2021 (2)
2020 (5)
Funding
Current Stage
Growth StageTotal Funding
$16.21MKey Investors
Amgen FoundationOmidyar Network
2022-01-01Series A· $0.01M
2020-07-21Grant· $3M
2017-10-12Grant· $3M
Leadership Team
Recent News
Hindu Business Line
2025-12-15
Tech Funding News
2025-10-31
Company data provided by crunchbase