CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-70.94%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-95.51%)
flink-crawlerContinuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-96.83%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+454.29%)
Webstera reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (-75.96%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-88.71%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+926.09%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-81.7%)
Fictiondown小说下载|小说爬取|起点|笔趣阁|导出Markdown|导出txt|转换epub|广告过滤|自动校对
Stars: ✭ 362 (-76.09%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (-73.51%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (-17.7%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (-77.28%)
Bilili🍻 bilibili video (including bangumi) and danmaku downloader | B站视频(含番剧)、弹幕下载器
Stars: ✭ 379 (-74.97%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+111.96%)
XsrfprobeThe Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (-64.86%)
Go jobs带你了解一下Golang的市场行情
Stars: ✭ 526 (-65.26%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (-64.6%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (-61.69%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (-55.09%)
Creeper🐾 Creeper - The Next Generation Crawler Framework (Go)
Stars: ✭ 762 (-49.67%)
Gopa AbandonedGOPA, a spider written in Go.(NOTE: this project moved to https://github.com/infinitbyte/gopa )
Stars: ✭ 98 (-93.53%)
Ttbot今日头条机器人,支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等,使用今日头条网页版API实现
Stars: ✭ 338 (-77.68%)
91porn Api🌭💦 91porn爬虫在线无限制API接口(永久有效,口令每日更新) 及 在线web预览
Stars: ✭ 341 (-77.48%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (-77.01%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (-77.54%)
Signature algorithm各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (-74.9%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (-75.89%)
Spiderpython crawler spider
Stars: ✭ 70 (-95.38%)
GospiderGospider - Fast web spider written in Go
Stars: ✭ 785 (-48.15%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (-41.22%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-93.39%)
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+229.79%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+216.58%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+219.48%)
NetdiscoveryNetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。
Stars: ✭ 573 (-62.15%)
Xxl CrawlerA distributed web crawler framework.(分布式爬虫框架XXL-CRAWLER)
Stars: ✭ 561 (-62.95%)
Douyinsdk抖音 SDK,数据采集,爬虫抓取不是梦
Stars: ✭ 99 (-93.46%)
Weixin Spider微信公众号爬虫,公众号历史文章,文章评论,文章阅读及在看数据,可视化web页面,可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现,高效微信爬虫,微信公众号爬虫,历史文章,文章评论,数据更新。
Stars: ✭ 287 (-81.04%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (-56.67%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (-57.93%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (-48.41%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (-58.45%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (-45.77%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (-47.89%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-98.35%)
Lizard💐 Full Amazon Automatic Download
Stars: ✭ 41 (-97.29%)
PhotonIncredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+450.33%)
MamanRust Web Crawler saving pages on Redis
Stars: ✭ 39 (-97.42%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+437.19%)
Awesome Python Primer自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-96.24%)
BeanbunBeanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Stars: ✭ 1,096 (-27.61%)
Gospidergolang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (-81.18%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (-81.11%)