SIGN IN
Data Engineer, AWS & AI/ML Enablement jobs in United States
cer-icon
Apply on Employer Site
company-logo

The College Board · 3 months ago

Data Engineer, AWS & AI/ML Enablement

College Board is a nonprofit organization that provides educational opportunities and services. They are seeking a Data Engineer to design, build, and operate scalable data platforms that support analytics and AI/ML use cases, collaborating with various teams to transform raw data into actionable insights.
Education Management
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and maintain scalable batch and streaming data pipelines using AWS services such as S3, Glue, Lambda, Kinesis, Step Functions, Redshift, Athena, and DynamoDB
Develop and optimize data models and complex SQL queries to support analytics, reporting, and downstream consumers
Build and operate serverless ETL frameworks for automated ingestion, transformation, and loading of structured and semi-structured data
Implement cloud-first, microservices-based architectures, ensuring high availability, performance, and cost efficiency
Ensure data quality, reliability, and observability through automated testing, validation, monitoring, and alerting
Integrate BI and analytics tool such as QuickSight to enable real-time and self-service analytics
Contribute to CI/CD pipelines, infrastructure automation, and secure development practices to deliver production-grade data systems
Partner with Data Science and AI teams to productionize ML-ready datasets, including training, evaluation, and inference data pipelines
Build and maintain feature pipelines and embedding workflows that support ML models and experimentation
Support MLOps/LLMOps workflows, including dataset versioning, experiment tracking, and capturing inference data for continuous improvement
Enable AI use cases such as recommendation systems, personalization, and retrieval-augmented generation (RAG) through robust data foundations
Apply a thoughtful approach to AI feasibility, fairness, and effectiveness, especially when working with sensitive or regulated data
Participate actively in Agile/Scrum ceremonies, design reviews, and peer code reviews
Collaborate cross-functionally with Product, UX, Infrastructure, and Security teams
Mentor junior engineers by providing guidance on data architecture, coding standards, and best practices
Produce clear documentation, runbooks, and technical guides to support long-term platform sustainability

Qualification

AWS servicesData EngineeringPythonSQLETL/ELT pipelinesMicroservices architectureCI/CDMachine LearningCommunication skillsContinuous learningCollaboration

Required

4+ years of experience in Data Engineering or Software Engineering in a production environment using AWS services such as S3, Glue, Lambda, Athena, DynamoDB, Step Functions, Redshift and Kinesis
Strong proficiency in Python and SQL, including performance tuning for large datasets
1+ years of hands-on experience designing, building, and deploying production-grade ML and generative AI solutions using AWS SageMaker and Amazon Bedrock
Experience designing and operating ETL/ELT pipelines, data models, and analytics-ready datasets
Solid understanding of cloud computing, DevOps, CI/CD, and microservices architectures
Strong security and privacy mindset, especially when working with sensitive data
Demonstrated interest in continuous learning, including keeping up with evolving data engineering and AI/ML best practices
Excellent communication skills with the ability to explain technical concepts to both technical and non-technical stakeholders
A passion for expanding educational and career opportunities and mission-driven work
Authorization to work in the United States for any employer
Curiosity and enthusiasm for emerging technologies, with a willingness to experiment with and adopt new AI-driven solutions and a comfort learning and applying new digital tools independently and proactively
Clear and concise communication skills, written and verbal
A learner's mindset and a commitment to growth: welcoming diverse perspectives, giving and receiving timely, respectful feedback, and continuously improving through iterative learning and user input
A drive for impact and excellence: solving complex problems, making data-informed decisions, prioritizing what matters most, and continuously improving through learning, user input, and external benchmarking
A collaborative and empathetic approach: working across differences, fostering trust, and contributing to a culture of shared success

Preferred

Experience with event-driven architectures and real-time analytics
Front-end or API experience (e.g., React, Node.js) is a plus
Exposure to observability and monitoring for data pipelines, including freshness, volume, and performance metrics
Experience collaborating with product managers and analytics partners to translate business requirements into well-designed data solutions

Benefits

A meaningful career
A supportive team
A comprehensive package designed to help you thrive
Fair and competitive compensation
Open, transparent conversations about compensation, benefits, and what it’s like to work at College Board

Company

The College Board

company-logo
College Board is a not-for-profit organization that clears a path for all students to own their future through the Advanced Placement Program, the SAT, Official SAT Practice on Khan Academy, BigFuture, and more.

H1B Sponsorship

The College Board has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (7)
2024 (9)
2023 (8)
2022 (12)
2021 (5)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
David Coleman
Chief Executive Officer
linkedin
leader-logo
Daniela Berger Pollack
Chief Financial Officer
linkedin
Company data provided by crunchbase