Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
TwintAn advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
imgur-scraperRetrieve years of imgur.com's data without any authentication.
PastaBeanPython Script to Scrape Pastebin with Regex
ceroScrape domain names from SSL certificates of arbitrary hosts
Crawler pubg.op.ggThis is a web crawler for pubg.op.gg, written by Ruichong Liu. 绝地求生游戏数据抓取
SpiderSpider项目将会不断更新本人学习使用过的爬虫方法!!!
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
GChanScrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.
Scrape-Finance-Data-v2A standalone package to scrape financial data from listed Vietnamese companies via Vietstock
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
fanslySimply scrape / download all the media from an fansly account
visdomA library use jQuery like API for html parsing & node selecting & node mutation, suitable for web scraping and html confusion.
scrapersscrapers for building your own image databases
stweetAdvanced python library to scrap Twitter (tweets, users) from unofficial API
pyscrapersScrapers for vk, facebook, instagram and more
readability-cliA CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!
pupflareA webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)