Polar IT · 16 hours ago
AWS Python Developer with Pyspark
Polar IT is seeking an experienced Python Developer with strong expertise in AWS and PySpark to join their data engineering team. The ideal candidate will have hands-on experience developing scalable data pipelines and processing large data sets in cloud-based environments.
Big DataBusiness IntelligenceCloud ComputingInformation TechnologySoftware
Responsibilities
Design, develop, and maintain data pipelines and ETL workflows using Python, PySpark, and AWS services
Build and optimize large-scale data processing and data transformation solutions
Integrate various data sources and ensure data quality, performance, and reliability
Collaborate with data engineers, analysts, and architects to deliver end-to-end data solutions
Implement best practices for code optimization, error handling, and data validation
Participate in code reviews, documentation, and deployment automation
Ensure adherence to data security and compliance standards
Qualification
Required
Bachelor's degree in Computer Science, Data Engineering, or a related field
10+ years of experience in software development with a strong focus on Python
Hands-on experience with PySpark for distributed data processing
Solid understanding of AWS cloud services such as S3, Glue, Lambda, EMR, Redshift, and Athena
Strong experience in ETL development and data pipeline orchestration
Familiarity with SQL and relational/non-relational databases
Excellent analytical, debugging, and communication skills
Preferred
Experience with Airflow, Databricks, or other workflow management tools
Knowledge of CI/CD pipelines and version control tools like Git
Exposure to data lake or data warehouse architectures
Familiarity with Docker or Kubernetes for deployment
Company
Polar IT
Polar IT Services is a global Information Technology company, located in Elkridge, MD, Atlanta, GA & India that has been providing high quality software solutions to large & medium sized customers.
Funding
Current Stage
Growth StageCompany data provided by crunchbase