TriCom Technical Services · 6 days ago
Web Data Engineer
TriCom Technical Services is seeking a Web Data Engineer who is passionate about extracting structured insights from unstructured web data. The role involves designing, building, and maintaining scalable web scraping pipelines to gather healthcare provider information from multiple online sources.
Digital MarketingProfessional NetworkingProfessional Services
Responsibilities
Develop, deploy, and maintain robust web scraping pipelines for collecting healthcare provider data
Work with agentic frameworks (e.g. Firecrawl) to automate dynamic data extraction workflows
Use tools such as Selenium to extract and parse structured/unstructured web data
Ensure data accuracy, completeness, and freshness through validation, deduplication, and error-handling processes
Collaborate with data engineers to integrate scraped data into our existing data pipelines and storage systems
Monitor scraping performance and troubleshoot issues with site structure changes, blocking mechanisms, or throttling
Follow best practices for ethical and compliant data collection
Qualification
Required
3+ years of professional experience in Python-based web scraping or data engineering, preferably in a SaaS based environment
Strong proficiency with Python and libraries such as Selenium, BeautifulSoup or Playwright
Familiarity with agentic scraping frameworks (e.g., Firecrawl) or autonomous browser-based extraction systems
Experience handling large-scale scraping, asynchronous requests, and data normalization
Working knowledge of data storage formats and systems (e.g., JSON, Parquet, SQL, or cloud databases)
Strong problem-solving skills and ability to debug complex scraping workflows
Understanding web protocols, HTML structures, and REST APIs
Bachelor's degree in data science, Computer Science, Statistics, Mathematics, or a related quantitative field
Preferred
Experience with cloud-based data pipelines (Databricks)
Knowledge of healthcare provider data or healthcare data standards
Familiarity with AI-driven or LLM-powered data collection frameworks
Benefits
100% Paid employee Medical/Dental Benefits
Paid time off
Paid Holidays
401(k) (with immediately-vested company match)