Web Scraping Specialist jobs in United States
cer-icon
Apply on Employer Site
company-logo

Wynd Labs ยท 2 months ago

Web Scraping Specialist

Wynd Labs builds infrastructure that delivers massive amounts of web data to support AI model training. They are seeking a Web Scraping Specialist to lead data gathering and analysis efforts, optimize scraping processes, and contribute to transforming internet data accessibility.

Artificial Intelligence (AI)BlockchainInternet

Responsibilities

Write, test, and refine code that extracts data from various online sources, ensuring reliability and efficiency
Perform data retrieval tasks, handling complexities such as pagination and dynamic content loaded with AJAX
Clean and format extracted data, ensuring it meets quality standards for further analysis or processing
Database management: Store and manage the scraped data in appropriate databases, optimizing for access speed and data integrity
Regularly monitor the scraping processes, identify and resolve any issues to maintain continuous data flow

Qualification

Web scrapingPythonJavaScriptBeautifulSoupScrapySeleniumAsynchronous programmingMultithreadingNoSQL databasesCloud servicesHTMLMachine learningCSSDocument Object Model (DOM)Open-source participation

Required

Demonstrated ability to extract data from complex websites with minimal supervision, with a portfolio or examples of past projects
Proficiency in languages such as Python or JavaScript, with strong skills in libraries and frameworks like BeautifulSoup, Scrapy, or Selenium
Knowledge of asynchronous programming, multithreading, and distributed scraping
In-depth knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM)
Experience with NoSQL databases (MongoDB, Cassandra), capable of designing efficient storage solutions and managing data integrity
Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis adds significant value
Experience with cloud services (AWS, Google Cloud, Azure) for deploying and managing scraping jobs at scale
Active participation in open-source projects related to web scraping, data processing, or similar fields

Benefits

Work Remotely
Benefits and equity package

Company

Wynd Labs

twittertwittertwitter
company-logo
Wynd Labs is a platform that makes public web data accessible for AI.

Funding

Current Stage
Early Stage
Company data provided by crunchbase