Dom CrawlerThe DomCrawler component eases DOM navigation for HTML and XML documents.
Stars: ✭ 3,499 (+612.63%)
JivesearchA search engine that doesn't track you.
Stars: ✭ 364 (-25.87%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (-30.75%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-43.58%)
Signature algorithm各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (-22.61%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+553.56%)
Comicbook本项目不再维护,详情可加群了解 https://t.me/onecomicbook
Stars: ✭ 429 (-12.63%)
Crawlertutorial爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (-42.57%)
InstagramcrawlerA non API python program to crawl public photos, posts or followers
Stars: ✭ 349 (-28.92%)
Ttbot今日头条机器人,支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等,使用今日头条网页版API实现
Stars: ✭ 338 (-31.16%)
Mmjpg👩 美女写真套图爬虫(一)
Stars: ✭ 398 (-18.94%)
91porn Crawler🌭💦 91porn爬虫在线API接口(永久有效) 及 在线web预览
Stars: ✭ 329 (-32.99%)
Weibo AnalystSocial media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类
Stars: ✭ 430 (-12.42%)
ScyllaIntelligent proxy pool for Humans™ (Maintainer needed)
Stars: ✭ 3,409 (+594.3%)
Hquery.phpAn extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (-39.92%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (-41.75%)
Fictiondown小说下载|小说爬取|起点|笔趣阁|导出Markdown|导出txt|转换epub|广告过滤|自动校对
Stars: ✭ 362 (-26.27%)
DotnetspiderDotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework
Stars: ✭ 3,233 (+558.45%)
DotcommonWhat do people have in their dotfiles?
Stars: ✭ 418 (-14.87%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (-44.2%)
ScavengerCrawler (Bot) searching for credential leaks on different paste sites.
Stars: ✭ 347 (-29.33%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (-29.94%)
Bt Btt磁力網站U3C3介紹以及域名更新
Stars: ✭ 261 (-46.84%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (-18.33%)
91porn Api🌭💦 91porn爬虫在线无限制API接口(永久有效,口令每日更新) 及 在线web预览
Stars: ✭ 341 (-30.55%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+730.35%)
Bilili🍻 bilibili video (including bangumi) and danmaku downloader | B站视频(含番剧)、弹幕下载器
Stars: ✭ 379 (-22.81%)
Tsec台灣上市上櫃股票爬蟲 Taiwan Stock Exchange Crawler
Stars: ✭ 327 (-33.4%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (-7.74%)
Iclr2019 OpenreviewdataScript that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
Stars: ✭ 376 (-23.42%)
SupercrawlerA web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (-37.68%)
Runoob Pdf爬取菜鸟教程网站并转PDF__python_crawer_by_chrome
Stars: ✭ 430 (-12.42%)
Go DorkThe fastest dork scanner written in Go.
Stars: ✭ 274 (-44.2%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (-25.66%)
GhcrawlerCrawl GitHub APIs and store the discovered orgs, repos, commits, ...
Stars: ✭ 293 (-40.33%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-5.5%)
Weixin Spider微信公众号爬虫,公众号历史文章,文章评论,文章阅读及在看数据,可视化web页面,可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现,高效微信爬虫,微信公众号爬虫,历史文章,文章评论,数据更新。
Stars: ✭ 287 (-41.55%)
Webstera reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (-25.87%)
Gospidergolang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (-41.96%)
Iclr2020 OpenreviewdataScript that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
Stars: ✭ 426 (-13.24%)
Tsrtc台灣股票即時爬蟲。Taiwan Stock Exchange Real Time Crawler
Stars: ✭ 359 (-26.88%)
Hacker News Digest📰 A responsive interface of Hacker News with summaries and thumbnails.
Stars: ✭ 278 (-43.38%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (-29.12%)
ArachniWeb Application Security Scanner Framework
Stars: ✭ 2,942 (+499.19%)
OpensearchserverOpen-source Enterprise Grade Search Engine Software
Stars: ✭ 408 (-16.9%)
Vaultswiss army knife for hackers
Stars: ✭ 346 (-29.53%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+885.13%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-10.39%)