Data Engineer III, Amazon Leo AI Foundations jobs in United States
cer-icon
Apply on Employer Site
company-logo

Amazon · 1 week ago

Data Engineer III, Amazon Leo AI Foundations

Amazon is a leading global technology company, and they are seeking a Data Engineer III to design, implement, and operate globally distributed systems for their Leo AI Foundations. This role involves building cloud services and APIs to enable intelligent software operations across Leo devices, ensuring low-latency and scalable architectures for high-quality internet service and AI capabilities.

Artificial Intelligence (AI)DeliveryE-CommerceFoundational AIRetail
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Architect and implement a scalable, cost-optimized S3-based Data Lakehouse that unifies structured and unstructured data from disparate sources
Architect and implement a scalable, cost-performance-optimized OLAP-based analytics layer
Establish metadata management with automated data classification and lineage tracking
Design and enforce standardized data ingestion patterns with built-in quality controls and validation gates
Architect a centralized metrics repository that becomes the source of truth for all Leo metrics
Implement robust data quality frameworks with staging-first policies and automated validation pipelines
Design extensible metrics schemas that support complex analytical queries while optimizing for AI retrieval patterns
Develop intelligent orchestration for metrics generation workflows with comprehensive audit trails
Lead the design of semantic data models that balance analytical performance with AI retrieval requirements
Implement cross-domain federated query capabilities with sophisticated query optimization techniques
Architect a globally distributed vector database infrastructure capable of managing billions of embeddings with consistent sub-100ms retrieval times
Design and implement hybrid search strategies combining dense vectors with sparse representations for optimal semantic retrieval
Establish automated compliance validation frameworks ensuring data handling meets Amazon's security standards

Qualification

Data engineeringData modelingETL pipelinesSQLPythonBig data technologiesMentoring

Required

5+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with SQL
Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
Experience mentoring team members on best practices

Preferred

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience operating large data warehouses

Benefits

Equity
Sign-on payments
Full range of medical, financial, and/or other benefits

Company

Amazon is a tech firm with a focus on e-commerce, cloud computing, digital streaming, and artificial intelligence.

Funding

Current Stage
Public Company
Total Funding
$8.11B
Key Investors
AmazonKleiner Perkins
2023-01-03Post Ipo Debt· $8B
2001-07-24Post Ipo Equity· $100M
1997-05-15IPO

Leadership Team

leader-logo
Douglas J. Herrington
CEO, Worldwide Amazon Stores
linkedin
leader-logo
Werner Vogels
VP & CTO
linkedin
Company data provided by crunchbase