![company-logo](https://images.crunchbase.com/image/upload/t_cb-default-original/v1457577286/b21gytiooizrf410ghx3.png)
State Street Global Advisors · 3 days ago
Data Engineer
Wonder how qualified you are to the job?
FinanceFinancial Services
Insider Connection @State Street Global Advisors
Responsibilities
Design, develop, and maintain scalable data pipelines using pyspark on Databricks.
Implement and optimize stream processing workflows using Kafka for real-time data ingestion and processing.
Utilize Parquet and Avro-formatted data files for efficient storage and retrieval.
Leverage Databricks platform on AWS to build and manage data processing workflows and analytics.
Harness the power of Databricks Delta Lake and Parquet files for data warehousing and query optimization.
Collaborate closely with data analysts and scientists to provide reliable data solutions.
Implement robust testing methodologies and contribute to the pyspark/Python ecosystem.
Monitor data pipelines, identify and resolve issues, and ensure data integrity.
Stay up-to-date with the latest trends and technologies in data engineering.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Bachelor’s or master’s degree in computer science or a related field.
Minimum 5 years of real world Data Engineering experience working on large scale data projects.
Strong proficiency in pySpark, Python and shell scripting, with a focus on software engineering best practices and a deep understanding of development lifecycle.
Experience working with workflow management tools such as Airflow
Experience with stream processing technologies, preferably Kafka.
Familiarity with Avro data serialization format and its usage in data engineering workflows.
Expertise in using Databricks platform on AWS for data processing and analytics.
Solid understanding of data warehousing concepts and experience with Delta Lake and Parquet files.
Proficiency in SQL and experience with relational databases.
Strong testing skills, with experience in implementing and executing unit tests, integration tests, and end-to-end tests using Python packages such as pytest.
Familiarity with the Python ecosystem, including PyPI packages and their integration into data engineering workflows.
Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment.
Strong communication skills and ability to effectively communicate complex technical concepts to non-technical stakeholders.
Working experience with Databricks and pyspark
Proficiency in writing complex SQLs
Working experience with cloud platforms like AWS or Azure (preferably AWS)
Working Experience with Airflow
Experience working with very large datasets
Preferred
Experience working with reporting tools such as Tableau
Past experience working on Machine Learning projects
Past experience working in finance
Benefits
Medical care
Insurance
Savings plans
Flexible Work Programs
Development programs
Educational support
Company
State Street Global Advisors
![company-logo](https://images.crunchbase.com/image/upload/t_cb-default-original/v1457577286/b21gytiooizrf410ghx3.png)
State Street Global Advisors is the investment management division of State Street Corporation
Funding
Current Stage
Late StageLeadership Team
Recent News
2024-05-20
Company data provided by crunchbase