Gf SecretsSecret and/ credential patterns used for gf.
Stars: ✭ 96 (-61.13%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+37.65%)
JlitespiderA lite distributed Java spider framework :-)
Stars: ✭ 151 (-38.87%)
91porn Crawler🌭💦 91porn爬虫在线API接口(永久有效) 及 在线web预览
Stars: ✭ 329 (+33.2%)
ScrapoxyScrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (+435.22%)
Dom CrawlerThe DomCrawler component eases DOM navigation for HTML and XML documents.
Stars: ✭ 3,499 (+1316.6%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-62.75%)
SupercrawlerA web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (+23.89%)
Dxy Covid 19 Crawler2019新型冠状病毒疫情实时爬虫及API | COVID-19/2019-nCoV Realtime Infection Crawler and API
Stars: ✭ 1,865 (+655.06%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+1199.19%)
DiskoverFile system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
Stars: ✭ 977 (+295.55%)
Laravel Socialite Social OAuth Authentication for Laravel 5. drivers: facebook, github, google, linkedin, weibo, qq, wechat and douban
Stars: ✭ 296 (+19.84%)
Awesome crawl腾讯新闻、知乎话题、微博粉丝,Tumblr爬虫、斗鱼弹幕、妹子图爬虫、分布式设计等
Stars: ✭ 246 (-0.4%)
GhcrawlerCrawl GitHub APIs and store the discovered orgs, repos, commits, ...
Stars: ✭ 293 (+18.62%)
Weixin Spider微信公众号爬虫,公众号历史文章,文章评论,文章阅读及在看数据,可视化web页面,可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现,高效微信爬虫,微信公众号爬虫,历史文章,文章评论,数据更新。
Stars: ✭ 287 (+16.19%)
Rendoradynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites
Stars: ✭ 1,853 (+650.2%)
Gospidergolang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (+15.38%)
Acm StatisticsAn online tool (crawler) to analyze users performance in online judges (coding competition websites). Supported OJ: POJ, HDU, ZOJ, HYSBZ, CodeForces, UVA, ICPC Live Archive, FZU, SPOJ, Timus (URAL), LeetCode_CN, CSU, LibreOJ, 洛谷, 牛客OJ, Lutece (UESTC), AtCoder, AIZU, CodeChef, El Judge, BNUOJ, Codewars, UOJ, NBUT, 51Nod, DMOJ, VJudge
Stars: ✭ 83 (-66.4%)
Crawlertutorial爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (+14.17%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-23.08%)
DotnetspiderDotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework
Stars: ✭ 3,233 (+1208.91%)
Work crawlerDownload comics novels 小说漫画下载工具 小説漫画のダウンローダ 小說漫畫下載:腾讯漫画 大角虫漫画 有妖气 知音漫客 咪咕 SF漫画 哦漫画 看漫画 漫画柜 汗汗酷漫 動漫伊甸園 快看漫画 微博动漫 733动漫网 大古漫画网 漫画DB 無限動漫 動漫狂 卡推漫画 动漫之家 动漫屋 古风漫画网 36漫画网 亲亲漫画网 乙女漫画 comico webtoons 咚漫 ニコニコ静画 ComicWalker ヤングエースUP モアイ pixivコミック サイコミ;アルファポリス カクヨム ハーメルン 小説家になろう 起点中文网 八一中文网 顶点小说 落霞小说网 努努书坊 笔趣阁→epub.
Stars: ✭ 1,224 (+395.55%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+12.15%)
Httpcode.core简单、易用、高效 一个有态度的开源.Net Http请求框架!可以用制作爬虫,api请求等等。
Stars: ✭ 146 (-40.89%)
SwiftlinkpreviewIt makes a preview from an URL, grabbing all the information such as title, relevant texts and images.
Stars: ✭ 1,216 (+392.31%)
ArachniWeb Application Security Scanner Framework
Stars: ✭ 2,942 (+1091.09%)
PhotobrowserPhotoBrowser is a light weight photo browser, like the wechat, weibo image viewer.
Stars: ✭ 211 (-14.57%)
Pigeon💬 一个轻量化的留言板 / 记事本 / 社交系统 / 博客。人类的本质是……咕咕咕?
Stars: ✭ 262 (+6.07%)
PoopakPOOPAK - TOR Hidden Service Crawler
Stars: ✭ 78 (-68.42%)
Happy Spiders🔧 🔩 🔨 收集整理了爬虫相关的工具、模拟登陆技术、代理IP、scrapy模板代码等内容。
Stars: ✭ 261 (+5.67%)
JavpyEnjoy driving on a Javascriptive (originally Pythonic) way to Japanese AV!
Stars: ✭ 147 (-40.49%)
Tumblr crawlerThis is a Multi-thread crawler for Tumblr.
Stars: ✭ 258 (+4.45%)
AnticrawlersolutionIt covers the blockade principle of most anti-climbing strategies and corresponding solutions.👽👽👽👽(涵盖了大部分的反爬策略的封锁原理以及对应的解决方案。)
Stars: ✭ 77 (-68.83%)
lightnovel epub🍭 epub generator for (light)novels (轻) 小说 epub 生成器,支持站点:轻之国度、轻小说文库
Stars: ✭ 89 (-63.97%)
Flutter hrlweiboFlutter仿微博客户端, 包含首页、视频、发现、消息(仿微博聊界面)及个人中心模块
Stars: ✭ 2,336 (+845.75%)
octopusRecursive and multi-threaded broken link checker
Stars: ✭ 19 (-92.31%)
NcrawlerWeb Crawler written in C#
Stars: ✭ 34 (-86.23%)
eastmoneypython requests + Django+ nodejs koa+ mysql to crawl eastmoney fund and stock data,for data analysis and visualiaztion .
Stars: ✭ 56 (-77.33%)
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (-70.85%)
rankr🇰🇷 Realtime integrated information analysis service
Stars: ✭ 21 (-91.5%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+6527.13%)
indieweb-searchSource code for the IndieWeb search engine.
Stars: ✭ 16 (-93.52%)
Jd AutobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,174 (+375.3%)
SocialLibrary微博分享、微信分享、qq分享,微信支付、支付宝支付 qq登录、微信登录、支付宝登录,直接引用官方提供api 安全省心
Stars: ✭ 61 (-75.3%)
Spiderpython crawler spider
Stars: ✭ 70 (-71.66%)
MonkeykingMonkeyKing helps you to post messages to Chinese Social Networks.
Stars: ✭ 2,699 (+992.71%)
Skrape.itA Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (-6.48%)
ArachnidCrawl all unique internal links found on a given website, and extract SEO related information - supports javascript based sites
Stars: ✭ 224 (-9.31%)
WoidSimple news aggregator displaying top stories in real time
Stars: ✭ 204 (-17.41%)
N2h4네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (-28.34%)
News Pleasenews-please - an integrated web crawler and information extractor for news that just works.
Stars: ✭ 969 (+292.31%)