Top 392 scraper open source projects

youtube-playlist
❄️ Extract links, ids, and names from a youtube playlist
ha-multiscrape
Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
civic-scraper
Tools for downloading agendas, minutes and other documents produced by local government
CourseCake
By serving course 📚 data that is more "edible" 🍰 for developers, we hope CourseCake offers a smooth approach to build useful tools for students.
OnlyFans
Scrape all the media from an OnlyFans account - Updated regularly
instagram-get-images
Instagram get images 🌄 (hashtags, account, locations) with puppeteer
YouTube-MA
💾 YouTube video metadata archiver written in Golang
scrapman
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
jsonHunter
在线爬虫,online web scraper
rymscraper
Python API to extract data from rateyourmusic.com.
instagram-hashtag-scraper
NodeJS application for scraping recent top posts from Instagram by hashtag without API access.
lux
👾 Fast and simple video download library and CLI tool written in Go
linkedin-employee-scraper
Extract all employees from LinkedIn. Especially useful for companies with thousands of employees.
YT-DLP-SCRIPTS
...Just a place for me to share my various YT-DLP & related bash scripts.
stream-list-updater
Automation for updating an index of live George Floyd protest streams
nest-crawler
An easiest crawling and scraping module for NestJS
youtube-trending-videos-scraper
A scraper for videos that are trending on YouTube (https://www.youtube.com/feed/trending)
TinderBotz
Automated Tinder bot and scraper using selenium in python.
Image-Scraper
Fast concurrent image scraper
immo-feed
A extensible app for scraping property listings
EsriRESTScraper
A Python class that scrapes ESRI Rest Endpoints and exports data to a geodatabase
gesetze-tools
Scripts to maintain German law git repository
TikTokDownloader PyWebIO
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
dorkscout
DorkScout - Golang tool to automate google dork scan against the entiere internet or specific targets
ceiba-dl
NTU CEIBA 資料下載工具
LinkedIn-Scraper
A LinkedIn Scraper to scrape up to 10k LinkedIn profiles from company profile links and save their e-mail addresses if available!
4scanner
Continuously search imageboards threads for images/webms and download them
Recipe-Scraper
A JS package for scraping recipes from the web.
scraper
图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
GChan
Scrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
ant
A web crawler for Go
scrapeer
Essential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.
pyitau
Unofficial client to access your Itaú bank data
rose
Analyse all kinds of data for a TV series
twpy
Twitter High level scraper for humans.
gHarvester
Proof of concept for a security issue (in my opinion) that I found in accounts.google.com
Federal-Parliament-Scraper
A scraper for obtaining information on the workings of the Belgian Federal Parliament.
trawler
scraper for facebook, gab, google and tiktok
PTTmineR
Parallel Searching and Crawling Data from PTT 🚀
wordpress-scraper
Simple, easy-to-use scraper to scrape data from WordPress JSON API
scripts
A collection of random scripts I coded up
301-360 of 392 scraper projects