SoftStandard Solutions · 15 hours ago
Data Engineer
SoftStandard Solutions is seeking a Data Engineer to design and maintain scalable data pipelines. The role involves collaborating with Data Scientists and ML Engineers, optimizing ETL workflows, and ensuring data quality across platforms.
Responsibilities
Design, build, and maintain scalable, high-performance data pipelines for structured and unstructured data
Develop and optimize ETL/ELT workflows to support analytics, AI, and machine learning use cases
Work closely with Data Scientists and ML Engineers to productionize AI/ML models
Design and manage data lakes, data warehouses, and lakehouse architectures
Ensure data quality, reliability, governance, and security across platforms
Optimize data ingestion, transformation, and storage for large-scale, real-time, and batch processing
Implement streaming data pipelines for near-real-time analytics
Enable feature engineering and feature stores for AI/ML workflows
Collaborate with Product, Analytics, and Engineering teams in an Agile environment
Mentor junior data engineers and drive data engineering best practices
Support CI/CD pipelines and infrastructure automation for data platforms
Qualification
Required
9–12+ Years of experience
Design, build, and maintain scalable, high-performance data pipelines for structured and unstructured data
Develop and optimize ETL/ELT workflows to support analytics, AI, and machine learning use cases
Work closely with Data Scientists and ML Engineers to productionize AI/ML models
Design and manage data lakes, data warehouses, and lakehouse architectures
Ensure data quality, reliability, governance, and security across platforms
Optimize data ingestion, transformation, and storage for large-scale, real-time, and batch processing
Implement streaming data pipelines for near-real-time analytics
Enable feature engineering and feature stores for AI/ML workflows
Collaborate with Product, Analytics, and Engineering teams in an Agile environment
Mentor junior data engineers and drive data engineering best practices
Support CI/CD pipelines and infrastructure automation for data platforms
Proficiency in Python, SQL, Scala, Apache Spark, PySpark, Databricks, Airflow, Prefect, Dagster
Experience with Apache Kafka, Spark Streaming, Flink, Kinesis, Pub/Sub, real-time & batch processing
Familiarity with AWS, Azure, GCP (S3, Glue, EMR, Redshift, Data Factory, Synapse, Data Lake, BigQuery, Dataflow)
Knowledge of Data Lakes, Lakehouse Architecture, Delta Lake, Iceberg, Hudi, Snowflake, Redshift, BigQuery
Experience with SQL & NoSQL, PostgreSQL, MySQL, MongoDB, Cassandra
Experience with Feature Engineering, Feature Stores (Feast, Databricks), ML Pipelines (MLflow, Kubeflow), LLM & GenAI Data Preparation
Experience with CI/CD, Docker, Kubernetes, Terraform, Data Quality, Metadata Management, Security & Compliance
Willingness to work onsite and relocate as per client requirements
Company
SoftStandard Solutions
SoftStandard Solutions is a leading consulting, business solution and systems integration firm with a unique blend of services.
H1B Sponsorship
SoftStandard Solutions has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (10)
2024 (23)
2023 (10)
2022 (25)
2021 (21)
2020 (12)
Funding
Current Stage
Growth StageCompany data provided by crunchbase