Top 21 scrape open source projects

Cloudflare Scrape
A Python module to bypass Cloudflare's anti-bot page.
Instagram Scraper
scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
cero
Scrape domain names from SSL certificates of arbitrary hosts
Crawler pubg.op.gg
This is a web crawler for pubg.op.gg, written by Ruichong Liu. 绝地求生游戏数据抓取
Spider
Spider项目将会不断更新本人学习使用过的爬虫方法!!!
ha-multiscrape
Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
GChan
Scrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.
Scrape-Finance-Data-v2
A standalone package to scrape financial data from listed Vietnamese companies via Vietstock
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
visdom
A library use jQuery like API for html parsing & node selecting & node mutation, suitable for web scraping and html confusion.
scrapers
scrapers for building your own image databases
readability-cli
A CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!
pupflare
A webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)
1-21 of 21 scrape projects