Sr. Platform and DataLake Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

TetraScience · 4 months ago

Sr. Platform and DataLake Engineer

TetraScience is the Scientific Data and AI Cloud company, leading the market in AI-native scientific data sets and lab data management solutions. As a Senior Platform and Data Lake Engineer, you will be crucial in building and maintaining data infrastructure, collaborating with cross-functional teams to manage the ingestion, processing, and storage of large volumes of scientific data.

BiotechnologyData IntegrationData ManagementInternet of ThingsLife SciencePharmaceuticalSoftware
check
Growth Opportunities
badNo H1Bnote

Responsibilities

Design, develop, and optimize data lake solutions to support our scientific data pipelines and analytics capabilities
Design, develop, and optimize data pipelines and workflows within the Databricks platform
Design and architect services to meet customer data processing needs
Implement data quality and governance frameworks to ensure data integrity and compliance

Qualification

Data pipeline infrastructureDatabricks ecosystemCloud-based data technologiesLake House architecturePythonJavaTypescriptAWS servicesData governanceDevOps principlesMLOps principlesSpark/GlueDelta tables/icebergSnowflakeData warehousing solutionsETL toolsSoft skills

Required

8+ years of experience in the software development industry, preferably in data engineering, data warehousing or data analytics companies and teams
Experienced in designing and implementing complex, scalable data pipelines/ETL services
Expert level of Python, Java, and Typescript
Extensive in cloud-based data storage and processing technologies, particularly AWS services such as S3, Step Functions, Lambda, and Airflow
Expert level of understanding and hands-on experience with Lake House architecture
Ability to articulate ideas clearly, present findings persuasively, and build rapport with clients and team members

Preferred

Knowledge of basic DevOps and MLOps principles
3+ year of experience with the DataBricks ecosystem
Expert level of experience with Spark/Glue and Delta tables/iceberg
Working knowledge of Snowflake
Experience in working with Data Scientists and ML Developers
Experience in management and lead developer roles from technology services companies
Hands-on experience with data warehousing solutions and ETL tools

Benefits

100% employer-paid benefits for all eligible employees and immediate family members
Unlimited paid time off (PTO)
401K
Flexible working arrangements - Remote work
Company paid Life Insurance, LTD/STD
A culture of continuous improvement where you can grow your career and get coaching

Company

TetraScience

twittertwittertwitter
company-logo
TetraScience is an R&D cloud data management company that empowers transformation in life sciences and drug discovery.

Funding

Current Stage
Growth Stage
Total Funding
$99.14M
Key Investors
Underscore VCWaters CorporationDigital Science
2021-04-15Series B· $80M
2020-05-01Series A· $11M
2019-10-31Series A· $8M

Leadership Team

leader-logo
Patrick Grady
CEO
linkedin
leader-logo
Siping Wang
President & CTO
linkedin
Company data provided by crunchbase