Data Engineer II jobs in United States
cer-icon
Apply on Employer Site
company-logo

GoGuardian · 2 hours ago

Data Engineer II

GoGuardian is a company focused on improving learning environments for K-12 education through innovative technology solutions. They are seeking a Data Engineer II to design, build, and enhance their analytics and AI/ML ecosystem, collaborating with various teams to drive data-driven products and capabilities.

E-LearningEdTechEducationInternetSoftware
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and optimize ETL pipelines that power analytics, data science, and ML workflows using tools such as Databricks, PySpark, and Airflow
Develop and maintain labeling and retraining pipelines for machine learning models, ensuring quality, reproducibility, and observability
Implement and support MLOps practices, including model versioning, CI/CD for ML, and model monitoring in production environments
Collaborate with data scientists to productionize and scale model training, inference, and evaluation pipelines
Contribute to the design and evolution of the data lakehouse, including schema design, partitioning strategies, and performance optimization
Document and communicate data architecture, lineage, and dependencies to ensure transparency and maintainability across teams
Champion data quality and governance, ensuring that datasets are accurate, well-structured, and compliant with organizational standards
Leverage infrastructure-as-code and containerization to build reproducible, maintainable environments
Participate in code reviews and continuous improvement of engineering best practices within the team

Qualification

ETL pipelinesPythonSQLMLOpsDatabricksAirflowAWS servicesInfrastructure as CodeCollaboration skillsProblem-solving skillsCommunication skills

Required

Bachelor's degree in Computer Science, Engineering, or related field
2–4 years of experience building and operating large-scale data systems, ideally supporting analytics and ML workloads
Proficiency in Python and SQL, with experience in PySpark, pandas, or similar data processing frameworks
Experience with DBT
Experience with modern data warehousing and lakehouse platforms, preferably Databricks
Hands-on experience with workflow orchestration tools such as Airflow, Dagster, or Prefect
Strong understanding of data modeling, ETL design, and distributed data systems
Experience with AWS data and compute services (S3, Lambda, ECS, CloudWatch, etc.) or equivalent cloud platforms
Familiarity with MLOps concepts (e.g., feature stores, model registries, CI/CD for ML)
Experience using Infrastructure as Code, preferably Terraform
Excellent problem-solving, collaboration, and communication skills; comfortable working in a dynamic, fast-paced environment

Benefits

Complete health insurance
401(k) matching
Employee equity plan
Flexible time off
Paid holidays
Paid parental leave
Paid year-end holiday break
Work from home funds
Fertility & adoption reimbursement

Company

GoGuardian

company-logo
GoGuardian is an edtech startup that makes subscription software for K-12 schools.

H1B Sponsorship

GoGuardian has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (2)
2022 (1)
2021 (1)

Funding

Current Stage
Late Stage
Total Funding
$200M
Key Investors
Tiger Global ManagementSumeru Equity PartnersTodd Mackey
2021-08-05Private Equity· $200M
2018-05-24Private Equity
2018-02-01Secondary Market

Leadership Team

leader-logo
Rich Preece
CEO at GoGuardian
linkedin
leader-logo
Advait Shinde
Co-Founder and Exec Chairman
linkedin
Company data provided by crunchbase