USA_Data Scientist jobs in United States
cer-icon
Apply on Employer Site
company-logo

VARITE INC · 12 hours ago

USA_Data Scientist

VARITE INC is seeking a skilled Data Engineer to design, build, and maintain high-quality, scalable data pipelines. The role focuses on implementing data models, ensuring data quality, and maintaining CI/CD pipelines.

Information Technology & Services
check
Growth Opportunities

Responsibilities

Design and implement Silver and Gold layer data models following medallion architecture best practices
Perform data cleansing, standardization, enrichment, and aggregation to support analytics and reporting
Build optimized PySpark-based transformations for large-scale data processing
Ensure data consistency, performance, and scalability across datasets
Build and maintain CI/CD pipelines using Git-based workflows (Azure DevOps / GitHub)
Use ARM templates (or IaC equivalents) for automated infrastructure provisioning
Enable automated deployment of data pipelines, notebooks, and configurations
Follow DevOps best practices for version control, branching, and release management
Develop reusable Python-based automation scripts for pipeline orchestration, validation, and monitoring
Create modular, maintainable, and testable Python code
Support automation of metadata, logging, alerting, and operational tasks
Implement data quality libraries such as Great Expectations
Define and automate data quality rules (completeness, accuracy, freshness, consistency)
Integrate data quality checks into data pipelines and CI/CD workflows
Monitor, log, and troubleshoot data quality issues proactively
Work closely with data architects, analysts, QA, and business stakeholders
Translate business and analytical requirements into robust data engineering solutions
Participate in Agile ceremonies and support sprint-based delivery
Provide production support, performance tuning, and troubleshooting

Qualification

PythonPySparkCI/CD pipelinesGreat ExpectationsARM templatesData modelingMedallion architectureData qualityAgile methodologyLarge datasets

Required

5–10+ years of experience in data engineering or analytics engineering
Strong hands-on experience with Silver & Gold layer development, Python (automation and data processing) and PySpark
Experience with Great Expectations (data quality framework) and CI/CD pipelines using Git-based tools
Hands-on experience with ARM templates or infrastructure-as-code concepts
Strong understanding of data modeling and medallion architecture
Experience working with large datasets in distributed environments

Preferred

Microsoft Certified: Azure Data Engineer Associate or Azure Enterprise Data Analyst Associate
Waste management or oil and gas domain knowledge

Company

VARITE INC

company-logo
VARITE has a definite spirit.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Adarsh Katyal
President & CEO
linkedin
leader-logo
Sue Patel Arora
Vice President Of Strategic Partnerships
linkedin
Company data provided by crunchbase