Headless Chrome CrawlerDistributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+6.04%)
Mutual labels: crawler, scraper, scraping, crawling, chrome
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-90.9%)
Mutual labels: crawler, scraper, scraping, crawling
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (-83.69%)
Mutual labels: crawler, scraper, scraping, crawling
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-98.95%)
Mutual labels: scraper, scraping, crawling, scraping-websites
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+775.4%)
Mutual labels: hacktoberfest, crawler, scraping, crawling
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-96.46%)
Mutual labels: crawler, scraper, scraping, crawling
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+221.17%)
Mutual labels: crawler, scraper, scraping, crawling
bots-zooNo description or website provided.
Stars: ✭ 59 (-98.78%)
Mutual labels: crawler, scraper, scraping, crawling
NpkillList any node_modules directories in your system, as well as the space they take up. You can then select which ones you want to erase to free up space.
Stars: ✭ 5,325 (+10.09%)
Mutual labels: cli, hacktoberfest, tool
LeetCodeAt present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (-99.07%)
Mutual labels: data-mining, scraper, scraping-websites
Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (-97.66%)
Mutual labels: scraper, scraping, scraping-websites
scrapmanRetrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-99.57%)
Mutual labels: scraper, scraping, scraping-websites
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-98.9%)
Mutual labels: scraper, scraping, crawling
gochanges**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-99.75%)
Mutual labels: scraper, scraping, scraping-websites
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-98.92%)
Mutual labels: scraper, scraping, crawling
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-99.71%)
Mutual labels: scraper, scraping, scraping-websites
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-99.69%)
Mutual labels: crawler, scraper, scraping
MusoqUse SQL on various data sources
Stars: ✭ 252 (-94.79%)
Mutual labels: cli, scraping, tool
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (-94.09%)
Mutual labels: crawler, scraping, crawling
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-94.27%)
Mutual labels: crawler, scraping, crawling