Amazon · 1 week ago
Data Engineer III, Amazon Leo AI Foundations
Amazon is a leading global technology company, and they are seeking a Data Engineer III to design, implement, and operate globally distributed systems for their Leo AI Foundations. This role involves building cloud services and APIs to enable intelligent software operations across Leo devices, ensuring low-latency and scalable architectures for high-quality internet service and AI capabilities.
Artificial Intelligence (AI)DeliveryE-CommerceFoundational AIRetail
Responsibilities
Architect and implement a scalable, cost-optimized S3-based Data Lakehouse that unifies structured and unstructured data from disparate sources
Architect and implement a scalable, cost-performance-optimized OLAP-based analytics layer
Establish metadata management with automated data classification and lineage tracking
Design and enforce standardized data ingestion patterns with built-in quality controls and validation gates
Architect a centralized metrics repository that becomes the source of truth for all Leo metrics
Implement robust data quality frameworks with staging-first policies and automated validation pipelines
Design extensible metrics schemas that support complex analytical queries while optimizing for AI retrieval patterns
Develop intelligent orchestration for metrics generation workflows with comprehensive audit trails
Lead the design of semantic data models that balance analytical performance with AI retrieval requirements
Implement cross-domain federated query capabilities with sophisticated query optimization techniques
Architect a globally distributed vector database infrastructure capable of managing billions of embeddings with consistent sub-100ms retrieval times
Design and implement hybrid search strategies combining dense vectors with sparse representations for optimal semantic retrieval
Establish automated compliance validation frameworks ensuring data handling meets Amazon's security standards
Qualification
Required
5+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with SQL
Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
Experience mentoring team members on best practices
Preferred
Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience operating large data warehouses
Benefits
Equity
Sign-on payments
Full range of medical, financial, and/or other benefits
Company
Amazon
Amazon is a tech firm with a focus on e-commerce, cloud computing, digital streaming, and artificial intelligence.
Funding
Current Stage
Public CompanyTotal Funding
$8.11BKey Investors
AmazonKleiner Perkins
2023-01-03Post Ipo Debt· $8B
2001-07-24Post Ipo Equity· $100M
1997-05-15IPO
Recent News
The Motley Fool
2026-01-09
2026-01-09
2026-01-09
Company data provided by crunchbase