Central Data Platform Engineer - Software Dev Engineer I jobs in United States
cer-icon
Apply on Employer Site
company-logo

Yahoo · 18 hours ago

Central Data Platform Engineer - Software Dev Engineer I

Yahoo is an American web portal that provides various services including Yahoo Search and Yahoo Mail. They are seeking a motivated entry-level AI Engineer to join their AI & ML team, where the role involves designing and building scalable tools for data governance and integrating AI features into production systems.

EmailInternetNative AdvertisingOnline PortalsSearch EngineSocial Media
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Assist in building AI features: use Python, LLM APIs (OpenAI, Anthropic, etc.), vector embedding pipelines
Support prompt engineering and RAG workflows: design, test, iterate prompt templates, integrate vector search
Help build and maintain AI-model monitoring/observability dashboards: track model accuracy, latency, drift and work with backend engineers to integrate AI services into the product
Participate in experimenting with AI workflows: multi-agent orchestration, model fine-tuning, system prompts
Working through documents and conversations with colleagues to understand product requirements for new features
Work closely with cross-functional teams to understand product and technical roadmaps, identifying potential impacts on system operability and proposing proactive solutions for Cloud environments
Lead initiatives to enhance and optimize existing cloud infrastructure, drive improvements in scalability, efficiency, and resilience, and oversee large-scale projects related to cloud platforms, automation, and performance optimization
Foster cross-functional collaboration between development, infrastructure, and operations teams to improve the overall performance, reliability, and security of services on cloud

Qualification

PythonLarge Language ModelsCloud-native AI servicesDistributed systemsSQLData pipeline orchestrationAWSGCPInfrastructure as CodeAnalytical skillsProblem-solving skillsCross-functional collaboration

Required

A solid Computer Science foundation in data structures and algorithms, object oriented programming, and modern software engineering practices from your achievement of obtaining a degree in CS or a similar engineering pursuit
Proactive in staying updated with evolving AI trends and new LLM releases
Skilled at diagnosing and solving complex, ambiguous problems with curiosity and a product-focused mindset
Experience working with the latest Large Language Models (LLMs) and AI advancements, cloud native AI services like Sagemaker, VertexAI, LangChain, LlamaIndex, or other LLM-orchestration libraries
The ability to use an object oriented programming language like Java or C++ or scripting languages like Python or Perl, and Unix or Linux systems
Knowledge of SQL and distributed query engines (e.g., Presto, Trino, Athena, BigQuery). Familiarity with data concepts such as joins, aggregation, projection, and explosion
The ability to work with large-scale distributed systems
Strong analytical and problem-solving skills with the ability to work effectively in a cross-functional, collaborative environment

Preferred

Working knowledge of AWS and GCP cloud environments, including core data and compute services (e.g., EMR, MWAA, S3, Lambda, ECS, BigQuery, Dataproc)
Experience with data pipeline orchestration tools and frameworks such as Oozie and Airflow
Query Execution and Optimization: Designing and optimizing queries to run efficiently on platforms such as BigQuery, Hive, Pig, and Spark, ensuring high performance and scalability
Familiarity with modern data architectures, including lakehouse and Medallion design patterns
Understanding of data processing/data governance concepts
Familiarity with AI-assisted engineering tools (e.g., Cursor, MCP, Copilot, agentic AI frameworks) and emerging AI/ML technologies that enhance data engineering productivity
Experience working with IaC (eg. Terraform, Ansible)
Experience working with Infrastructure as Code (IaC) tools, such as Terraform, or CloudFormation, to automate and manage cloud infrastructure deployments and automations
Familiarity & working experience with Kubernetes and container-based orchestration

Benefits

Healthcare
A great 401k
Backup childcare
Education stipends
Much (much) more

Company

Yahoo is a technology and media company that serves users through its portfolio of digital platforms, products, and services. It is a sub-organization of Verizon Media.

H1B Sponsorship

Yahoo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (197)
2022 (646)
2021 (381)
2020 (463)

Funding

Current Stage
Public Company
Total Funding
$6.8M
Key Investors
SoftBank GroupSequoia Capital
2021-05-03Acquired
1996-04-12IPO
1995-11-30Series B· $4.8M

Leadership Team

leader-logo
Monica Vorn Mijaleski
Chief Financial Officer
linkedin
leader-logo
Mike Gupta
SVP Finance / Chief Treasury Officer
linkedin
Company data provided by crunchbase