estate-crawlerScraping the real estate agencies for up-to-date house listings as soon as they arrive!
Stars: ✭ 20 (+42.86%)
antA web crawler for Go
Stars: ✭ 264 (+1785.71%)
dht-spider一个简单的基于DHT协议的BT磁力链接爬虫
Stars: ✭ 16 (+14.29%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+778.57%)
Magic googleGoogle search results crawler, get google search results that you need
Stars: ✭ 247 (+1664.29%)
InventusInventus is a spider designed to find subdomains of a specific domain by crawling it and any subdomains it discovers.
Stars: ✭ 80 (+471.43%)
Core🔞 JAVClub - 让你的大姐姐不再走丢
Stars: ✭ 2,728 (+19385.71%)
Ppspiderweb spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (+1592.86%)
ZSpider基于Electron爬虫程序
Stars: ✭ 37 (+164.29%)
KillshotA Penetration Testing Framework, Information gathering tool & Website Vulnerability Scanner
Stars: ✭ 237 (+1592.86%)
scrapy-LBCAraignée LeBonCoin avec Scrapy et ElasticSearch
Stars: ✭ 14 (+0%)
vietnam-ecommerce-crawlerCrawling the data from lazada, websosanh, compare.vn, cdiscount and cungmua with flexible configs
Stars: ✭ 28 (+100%)
Laravel Crawler DetectA Laravel wrapper for CrawlerDetect - the web crawler detection library
Stars: ✭ 227 (+1521.43%)
seenreqGenerate an object for testing if a request is sent, request is Mikeal's request.
Stars: ✭ 42 (+200%)
scrapy-html-storageScrapy downloader middleware that stores response HTMLs to disk.
Stars: ✭ 17 (+21.43%)
TaobaoSpiderThis taobao spider has been archived
Stars: ✭ 28 (+100%)
Syncplaylistsync playlist between music platform
Stars: ✭ 218 (+1457.14%)
Jd mask robot京东口罩库存监控爬虫(非selenium),扫码登录、查价、加购、下单、秒杀
Stars: ✭ 216 (+1442.86%)
crawlerpython爬虫项目集合
Stars: ✭ 29 (+107.14%)
LspiderLSpider 一个为被动扫描器定制的前端爬虫
Stars: ✭ 214 (+1428.57%)
asyncpy使用asyncio和aiohttp开发的轻量级异步协程web爬虫框架
Stars: ✭ 86 (+514.29%)
BiliutilBilibili.com视频批量下载工具包
Stars: ✭ 212 (+1414.29%)
grapyGrapy, a fast high-level web crawling framework for Python 3.3 or later base on asyncio.
Stars: ✭ 18 (+28.57%)
DhtBitTorrent DHT Protocol && DHT Spider.
Stars: ✭ 2,459 (+17464.29%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (+21.43%)
DeadPool该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery原生的分布式调用,实现大规模并发。
Stars: ✭ 38 (+171.43%)
Fiction house小说精品屋是一个多平台(web、安卓app、微信小程序)、功能完善的屏幕自适应小说漫画连载系统,包含精品小说专区、轻小说专区和漫画专区。包括小说/漫画分类、小说/漫画搜索、小说/漫画排行、完本小说/漫画、小说/漫画评分、小说/漫画在线阅读、小说/漫画书架、小说/漫画阅读记录、小说下载、小说弹幕、小说/漫画自动采集/更新/纠错、小说内容自动分享到微博、邮件自动推广、链接自动推送到百度搜索引擎等功能。
Stars: ✭ 2,710 (+19257.14%)
Wereader一个功能全面的微信读书爬虫 wereader
Stars: ✭ 207 (+1378.57%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+110864.29%)
JssoupJavaScript + BeautifulSoup = JSSoup
Stars: ✭ 203 (+1350%)
fernando-pessoaClassificador de poemas do Fernando Pessoa de acordo com os seus heterônimos
Stars: ✭ 31 (+121.43%)
gospider⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架
Stars: ✭ 183 (+1207.14%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+16985.71%)
Zhihuspider多线程知乎用户爬虫,基于python3
Stars: ✭ 201 (+1335.71%)
main project基于nodejs的网络聊天室、爬虫,vue音乐播放器,及php后台开发的管理系统等项目
Stars: ✭ 49 (+250%)
CangibrinaA fast and powerfull dashboard (admin) finder
Stars: ✭ 200 (+1328.57%)
spiderpython 爬虫(amazon, confluence ...)
Stars: ✭ 21 (+50%)
easypoi简单、免费、高效的百度地图poi采集和分析工具。
Stars: ✭ 87 (+521.43%)
glyphhangerYour web font utility belt. It can subset web fonts. It can find unicode-ranges for you automatically. It makes julienne fries.
Stars: ✭ 422 (+2914.29%)
Portia Dashboardportia-dashboard is a visual web crawler based on scrapinghub/portia
Stars: ✭ 199 (+1321.43%)
Fooproxy稳健高效的评分制-针对性- IP代理池 + API服务,可以自己插入采集器进行代理IP的爬取,针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库,支持MongoDB 4.0 使用 Python3.7(Scored IP proxy pool ,customise proxy data crawler can be added anytime)
Stars: ✭ 195 (+1292.86%)
sedeText-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data
Stars: ✭ 83 (+492.86%)
Zi5bookbook.zi5.me全站kindle电子书籍爬取,按照作者书籍名分类,每本书有mobi和equb两种格式,采用分布式进行全站爬取
Stars: ✭ 191 (+1264.29%)