GCP Data Engineer @ Mastech Digital | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
GCP Data Engineer jobs in San Jose, CA
200+ applicantsPosted by Agency
company-logo

Mastech Digital · 1 day ago

GCP Data Engineer

ftfMaximize your interview chances
Information Technology
check
Growth Opportunities
check
H1B Sponsor Likelynote
Hiring Manager
vivek shrivastava
linkedin

Insider Connection @Mastech Digital

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Ability to design and develop a high performance data pipeline framework from scratch
Data ingestion across systems
Data quality and curation
Data transformation and efficient data storage
Data reconciliation, monitoring and controls
Support reporting model and other downstream application needs
Skill in technical design documentation, data modeling and performance tuning applications
Lead and manage a team of data engineers, contribute towards code reviews, and guide the team in designing and developing convoluted data pipelines adhering to the defined standards.
Be hands on, performs POCs on the open source/licensed tools in the market and share recommendations.
Provide technical leadership and contribute to the definition, development, integration, test, documentation and support across multiple platforms (GCP, Python, HANA)
Establish a consistent project management framework and develop processes to deliver high quality software, in rapid iterations, for the business partners in multiple geographies
Participate in a team that designs, develops, troubleshoots, and debugs software programs for databases, applications, tools etc.
Experience in balancing production platform stability, feature delivery and reduction of technical debt across a broad landscape of technologies.
Skill in the following platform, tools and technologies
GCP cloud platform – GCS, Big Query, Streaming (pub/sub), data proc and data flow, NIFI
Python, PYSpark, Kafka, SQL, shell scripting & Stored procs
Data warehouse, distributed data platforms and data lake
Database definition, schema design, Looker Views, Models
CI/CD pipeline
Proven track record in scripting code in Python, PySpark and SQL
Excellent structured thinking skills, with the ability to break down multi-dimensional problems
Ability to navigate ambiguity and work in a fast-moving environment with multiple stakeholders
Good communication skills and ability to coordinate and work with cross functional teams.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

GCP cloud platformPythonData pipeline developmentPysparkSQLData warehouseCI/CD pipelineKafkaShell scriptingStored proceduresData modelingLooker ViewsTechnical design documentation

Required

Ability to design and develop a high performance data pipeline framework from scratch
Data ingestion across systems
Data quality and curation
Data transformation and efficient data storage
Data reconciliation, monitoring and controls
Support reporting model and other downstream application needs
Skill in technical design documentation, data modeling and performance tuning applications
Lead and manage a team of data engineers, contribute towards code reviews, and guide the team in designing and developing convoluted data pipelines adhering to the defined standards
Be hands on, performs POCs on the open source/licensed tools in the market and share recommendations
Provide technical leadership and contribute to the definition, development, integration, test, documentation and support across multiple platforms (GCP, Python, HANA)
Establish a consistent project management framework and develop processes to deliver high quality software, in rapid iterations, for the business partners in multiple geographies
Participate in a team that designs, develops, troubleshoots, and debugs software programs for databases, applications, tools etc.
Experience in balancing production platform stability, feature delivery and reduction of technical debt across a broad landscape of technologies
Skill in the following platform, tools and technologies: GCP cloud platform – GCS, Big Query, Streaming (pub/sub), data proc and data flow, NIFI
Python, PYSpark, Kafka, SQL, shell scripting & Stored procs
Data warehouse, distributed data platforms and data lake
Database definition, schema design, Looker Views, Models
CI/CD pipeline
Proven track record in scripting code in Python, PySpark and SQL
Excellent structured thinking skills, with the ability to break down multi-dimensional problems
Ability to navigate ambiguity and work in a fast-moving environment with multiple stakeholders
Good communication skills and ability to coordinate and work with cross functional teams

Company

Mastech Digital

company-logo
Mastech Digital provides IT associates in digital and mainstream technologies, Digital Transformation Services around Salesforce.com and SAP

H1B Sponsorship

Mastech Digital has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (398)
2022 (900)
2021 (896)
2020 (594)

Funding

Current Stage
Public Company
Total Funding
unknown
2008-09-26IPO

Leadership Team

leader-logo
Vivek Gupta
Member of the board, President and CEO
linkedin
leader-logo
Ashok Trivedi
Co-Founder & Co-Chairman
Company data provided by crunchbase
logo

Orion

Your AI Copilot