Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+2025%)
FscrawlerElasticsearch File System Crawler (FS Crawler)
Stars: ✭ 906 (+2731.25%)
Creeper🐾 Creeper - The Next Generation Crawler Framework (Go)
Stars: ✭ 762 (+2281.25%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+1575%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-21.87%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+1865.63%)
CcrawlSimple CORPORA list crawler
Stars: ✭ 11 (-65.62%)
FessFess is very powerful and easily deployable Enterprise Search Server.
Stars: ✭ 561 (+1653.13%)
PythonPython脚本。模拟登录知乎, 爬虫,操作excel,微信公众号,远程开机
Stars: ✭ 7,355 (+22884.38%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+2340.63%)
Go jobs带你了解一下Golang的市场行情
Stars: ✭ 526 (+1543.75%)
Sqlivmassive SQL injection vulnerability scanner
Stars: ✭ 840 (+2525%)
Magnet Dht✌️ Python3 BitTorrent DHT crawler
Stars: ✭ 692 (+2062.5%)
AxegrinderCrawl websites for accessibility issues from the command line.
Stars: ✭ 12 (-62.5%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+1890.63%)
TumblthreeA Tumblr Blog Backup Application
Stars: ✭ 923 (+2784.38%)
PapercrawlerCrawler used to crawl papers
Stars: ✭ 20 (-37.5%)
NetdiscoveryNetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。
Stars: ✭ 573 (+1690.63%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+2681.25%)
Wechatsogou基于搜狗微信搜索的微信公众号爬虫接口
Stars: ✭ 5,220 (+16212.5%)
XsrfprobeThe Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (+1562.5%)
GospiderGospider - Fast web spider written in Go
Stars: ✭ 785 (+2353.13%)
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+15503.13%)
Pic Gather[ Closed ] 🎨 image collector, which supports custom acquisition source configuration and is compatible with MacOS and Windows operating systems.
Stars: ✭ 842 (+2531.25%)
PxerA tool for pixiv.net. 人人可用的P站爬虫
Stars: ✭ 776 (+2325%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-56.25%)
FetchbotA simple and flexible web crawler that follows the robots.txt policies and crawl delays.
Stars: ✭ 753 (+2253.13%)
Xalpha基金投资管理回测引擎
Stars: ✭ 683 (+2034.38%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+1950%)
Appcrawler基于appium的app自动遍历工具
Stars: ✭ 925 (+2790.63%)
Price Monitor京东商品价格监控:监控用户设定商品价格,降价邮件/微信提醒。技术:Python爬虫/IP代理池/JS接口爬取/Selenium页面爬取
Stars: ✭ 634 (+1881.25%)
Sina Stock CrawlerSina stock options crawler with CSV output 新浪上证ETF期权数据爬虫
Stars: ✭ 12 (-62.5%)
Course Crawler🎓 中国大学MOOC、学堂在线、网易云课堂、好大学在线、爱课程 MOOC 课程下载。
Stars: ✭ 611 (+1809.38%)
Mzitu👧 美女写真套图爬虫(二)
Stars: ✭ 920 (+2775%)
NewcrawlerFree Web Scraping Tool with Java
Stars: ✭ 589 (+1740.63%)
AutocrawlerGoogle, Naver multiprocess image web crawler (Selenium)
Stars: ✭ 957 (+2890.63%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+1712.5%)
FinalreconThe Last Web Recon Tool You'll Need
Stars: ✭ 888 (+2675%)
FilemastaA search application to explore, discover and share online files
Stars: ✭ 571 (+1684.38%)
DisecDistributed Image Search Engine Crawler
Stars: ✭ 11 (-65.62%)
Xxl CrawlerA distributed web crawler framework.(分布式爬虫框架XXL-CRAWLER)
Stars: ✭ 561 (+1653.13%)
Psi ReportCrawls a website, gets PageSpeed Insights data for each page, and exports an HTML report.
Stars: ✭ 6 (-81.25%)
Scrapy RedisRedis-based components for Scrapy.
Stars: ✭ 4,998 (+15518.75%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (+2465.63%)
Pyptt支援 PTT 還有 PTT2 的 PTT API
Stars: ✭ 527 (+1546.88%)
Instagram Profilecrawl📝 quickly crawl the information (e.g. followers, tags etc...) of an instagram profile.
Stars: ✭ 816 (+2450%)
Vw Crawler🐞简单轻便的Java爬虫框架,只要会一点简单的正则表达式和简单的css选择器就能轻松的采集数据。
Stars: ✭ 32 (+0%)
Onion CrawlerTor website crawler (specific for Alphabay at the time)
Stars: ✭ 15 (-53.12%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+2365.63%)