Neotech Global · 11 hours ago
Databricks architect with AI/ML & AWS
Neotech Global is seeking an experienced AI/ML Architect with expertise in Databricks on AWS to lead the design and implementation of scalable data and machine learning platforms. The role involves working with large datasets, optimizing pipelines, and driving analytical and ML capabilities across the organization.
Responsibilities
Develop, train, and optimize ML models using Python, PySpark, MLflow, and Databricks Machine Learning
Conduct exploratory data analysis (EDA) to identify patterns, trends, and insights in large datasets
Deploy ML models into production using MLflow, Databricks Workflows, or other MLOps pipelines
Build analytics solutions such as forecasting, anomaly detection, segmentation, or recommendation systems
Design ML architectures aligned with Databricks Lakehouse on AWS
Architect and build scalable ETL/ELT pipelines using PySpark, SQL, and Databricks Workflows
Implement Delta Lake best practices, including OPTIMIZE, ZORDER, partitioning, and schema evolution
Design lakehouse layers (Bronze/Silver/Gold) with strong separation of compute and serving layers
Optimize cluster performance and jobs using Spark tuning, caching, and shuffle minimization
Work with multi-terabyte, time-series, high velocity data in a distributed environment
Ensure robust data availability for downstream ML and analytics workloads
Architect end-to-end data and ML solutions using AWS services, including: S3 for storage, IAM for identity & access, Glue Catalog for metadata management, Networking for secure, high throughput data movement
Integrate Databricks with AWS-native compute, API layers, and low-latency endpoints
Translate business problems into scalable analytical or ML architectures
Communicate complex statistical and architectural concepts to non technical stakeholders
Collaborate with product, engineering, and business leaders to drive data-informed initiatives
Provide design leadership while remaining hands-on in execution
Qualification
Required
Bachelor's or Master's in Computer Science, Data Science, Engineering, Statistics, or related field
10+ years of experience in data engineering, ML engineering, or AI/ML architecture roles
Deep expertise in Databricks on AWS, including:
PySpark / Spark SQL
Databricks Notebooks
Delta Lake
Unity Catalog
MLflow
Databricks Jobs & Workflows
Strong programming ability in Python (pandas, numpy, scikit-learn)
Demonstrated experience with large-scale, multi-terabyte data processing
Strong understanding of ML algorithms, distributed systems, and data optimization
Preferred
Experience with MLOps and production deployment pipelines
Strong grasp of AWS-native data and compute services
Understanding of CI/CD using GitHub Actions, GitLab CI, or similar
Familiarity with deep learning frameworks (TensorFlow, PyTorch)
Company
Neotech Global
Neo Tech delivers advanced engineering, product development, and technology services to global OEMs and enterprises across automotive, medical devices, aerospace, oil & gas and BFSI sectors.
H1B Sponsorship
Neotech Global has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (24)
2024 (51)
2023 (45)
2022 (41)
2021 (30)
2020 (37)
Funding
Current Stage
Growth StageCompany data provided by crunchbase