Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+1417.31%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+29775%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+228.85%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+1.92%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+746.15%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+432.69%)
zcrawlAn open source web crawling platform
Stars: ✭ 21 (-59.62%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+930.77%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-57.69%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-26.92%)
fetchurlsA bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.
Stars: ✭ 97 (+86.54%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+776.92%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+9201.92%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+1207.69%)
ZeiverA Scraper, Downloader, & Recorder for static open directories.
Stars: ✭ 14 (-73.08%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-1.92%)
bots-zooNo description or website provided.
Stars: ✭ 59 (+13.46%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+2296.15%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (+265.38%)
Jsonframe Cheeriosimple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (+276.92%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+4500%)
copycatA PHP Scraping Class
Stars: ✭ 70 (+34.62%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+305.77%)
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+359.62%)
BaiduSpider项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (-44.23%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (+265.38%)
SerpscrapSEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (+194.23%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+31378.85%)
Scrapysharpreborn of https://bitbucket.org/rflechner/scrapysharp
Stars: ✭ 226 (+334.62%)
google-scraperThis class can retrieve search results from Google.
Stars: ✭ 33 (-36.54%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+184.62%)
Website-downloader💡 Download the complete source code of any website (including all assets). [ Javascripts, Stylesheets, Images ] using Node.js
Stars: ✭ 615 (+1082.69%)
gospider⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架
Stars: ✭ 183 (+251.92%)
Pahe.ph-ScraperPahe.ph [Pahe.in] Movies Website Scraper
Stars: ✭ 57 (+9.62%)
gochanges**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-76.92%)
warcworkerA dockerized, queued high fidelity web archiver based on Squidwarc
Stars: ✭ 48 (-7.69%)
gathertoolgathertool是golang脚本化开发库,目的是提高对应场景程序开发的效率;轻量级爬虫库,接口测试&压力测试库,DB操作库等。
Stars: ✭ 36 (-30.77%)
turtleInstagram Photo Downloader
Stars: ✭ 15 (-71.15%)
stweetAdvanced python library to scrap Twitter (tweets, users) from unofficial API
Stars: ✭ 287 (+451.92%)
socials👨👩👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-28.85%)
UdemycoursegrabberYour will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [insert big number here] websites, compares the last updated date, and gives you the download link of the latest one back, but you also have the choice to see the other ones as well!
Stars: ✭ 137 (+163.46%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+136.54%)
scrapersscrapers for building your own image databases
Stars: ✭ 46 (-11.54%)
fanslySimply scrape / download all the media from an fansly account
Stars: ✭ 351 (+575%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-67.31%)
savedditBulk Downloader for Reddit
Stars: ✭ 130 (+150%)
vsco-scraperEasily allows for scraping a VSCO
Stars: ✭ 106 (+103.85%)
scraper图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Stars: ✭ 64 (+23.08%)
4scannerContinuously search imageboards threads for images/webms and download them
Stars: ✭ 103 (+98.08%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-57.69%)
yutto🧊 一个可爱且任性的 B 站视频下载器(bilili V2)
Stars: ✭ 383 (+636.54%)
openMICMeter Information Collection System
Stars: ✭ 15 (-71.15%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+1667.31%)