A web scraping project that retrieves TV and movie data from two sources, then transforms and stores data in a MySQL database.
To easily scrape any data from the youtube homepage, a youtube channel/user, search results, playlists, and a single video itself.
An experiment in open-sourcing the web scrapers that feed the Los Angeles Times' California coronavirus tracker.
SpiderX allows you to watch movies by scraping data from the internet. This is android/ios client repo built with React native and Expo.
Script that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
gazpacho is a web scraping library. It replaces requests and BeautifulSoup for most projects. gazpacho is small, simple, fast, and consistent. You should use it!
The web is full of data. Transistor is a web scraping framework for collecting, storing, and using targeted data from structured web pages.