Tredence Inc. · 1 day ago
AWS Data Architect
Tredence Inc. is seeking a highly skilled AWS Data Engineer with extensive experience to design, build, and optimize large-scale data pipelines and cloud-based data solutions. The role involves collaborating with data architects, implementing data quality measures, and providing technical leadership to the data engineering team.
Responsibilities
Design, develop, and maintain scalable, reliable, and high-performance data pipelines using AWS services (Glue, EMR, Redshift, Lambda, Kinesis, Step Functions, etc.)
Build and manage ETL/ELT workflows for structured, semi-structured, and unstructured data using Glue, Spark, Python, and SQL
Implement data ingestion frameworks including real-time streaming (Kinesis) and batch data processing
Integrate data from diverse sources (RDBMS, APIs, streaming sources, on-prem systems) into cloud-based data platforms
Develop complex data transformation logic using PySpark, Glue ETL, SQL, and EMR jobs
Work with AWS data migration tools such as DMS, DataSync, SCT, and MWAA to support migration and modernization initiatives
Collaborate with data architects to design scalable data lake and data warehouse architectures on AWS
Apply strong knowledge of data modeling (star/snowflake schemas), dimensional modeling, and data warehousing concepts
Create optimized table structures and schemas in Redshift and Delta Lake formats
Implement and automate data quality checks, validation rules, and reconciliation frameworks
Ensure data governance and security best practices, including IAM permissions, encryption, access controls, and compliance with regulatory standards
Maintain data lineage, metadata, and documentation for auditability
Monitor data pipelines using CloudWatch, Glue job metrics, EMR logs, and custom observability dashboards
Troubleshoot pipeline failures, performance bottlenecks, and data inconsistencies
Optimize data pipelines for performance, reliability, and cost-efficiency, leveraging AWS best practices
Work closely with data scientists, analysts, architects, and business stakeholders to understand data needs and propose technical solutions
Provide technical leadership and mentorship to junior engineers, conducting reviews, training sessions, and knowledge-sharing
Contribute to continuous improvements in engineering processes, CI/CD practices, and DevOps automation for data pipelines
Qualification
Required
10+ years of hands-on experience in data engineering, with at least 4+ years working extensively in AWS
Proven expertise with AWS services: Core Data Services: Glue, Redshift, EMR, Athena, Lake Formation
Storage & Compute: S3, Lambda, EC2, Step Functions
Migration Tools: DMS, DataSync, MWAA
Strong proficiency in Python, PySpark, SQL, and data pipeline frameworks
Experience building distributed data processing systems using Apache Spark on EMR or Glue
Strong understanding of data modeling, data warehousing, ETL frameworks, and big data ecosystems
Hands-on experience with CI/CD pipelines (CodePipeline, GitHub Actions, Jenkins) for data workloads
Solid understanding of security best practices, IAM policies, encryption (KMS), and network configuration in AWS
Company
Tredence Inc.
Tredence is a global data science solutions provider focused on solving the last mile problem in AI.
H1B Sponsorship
Tredence Inc. has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (143)
2024 (103)
2023 (103)
2022 (74)
2021 (69)
2020 (75)
Funding
Current Stage
Late StageTotal Funding
$205MKey Investors
Advent InternationalChicago Pacific Founders
2022-12-22Series B· $175M
2020-12-10Series A· $30M
Recent News
2026-01-06
2025-11-13
Company data provided by crunchbase