Dom CrawlerThe DomCrawler component eases DOM navigation for HTML and XML documents.
Stars: ✭ 3,499 (+594.25%)
Atom LanguageclientLanguage Server Protocol support for Atom (the basis of Atom-IDE)
Stars: ✭ 385 (-23.61%)
InstagramcrawlerA non API python program to crawl public photos, posts or followers
Stars: ✭ 349 (-30.75%)
OpensearchserverOpen-source Enterprise Grade Search Engine Software
Stars: ✭ 408 (-19.05%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (-32.54%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-12.7%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+536.71%)
Fictiondown小说下载|小说爬取|起点|笔趣阁|导出Markdown|导出txt|转换epub|广告过滤|自动校对
Stars: ✭ 362 (-28.17%)
Gospidergolang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (-43.45%)
Vs StreamjsonrpcThe StreamJsonRpc library offers JSON-RPC 2.0 over any .NET Stream, WebSocket, or Pipe. With bonus support for request cancellation, client proxy generation, and more.
Stars: ✭ 421 (-16.47%)
ScavengerCrawler (Bot) searching for credential leaks on different paste sites.
Stars: ✭ 347 (-31.15%)
Ttbot今日头条机器人,支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等,使用今日头条网页版API实现
Stars: ✭ 338 (-32.94%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (-20.44%)
91porn Crawler🌭💦 91porn爬虫在线API接口(永久有效) 及 在线web预览
Stars: ✭ 329 (-34.72%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-7.94%)
ScyllaIntelligent proxy pool for Humans™ (Maintainer needed)
Stars: ✭ 3,409 (+576.39%)
Signature algorithm各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (-24.6%)
Hquery.phpAn extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (-41.47%)
Weibo AnalystSocial media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类
Stars: ✭ 430 (-14.68%)
Joyrpchigh-performance, high-extensibility Java rpc framework.
Stars: ✭ 290 (-42.46%)
Hyperf🚀 A coroutine framework that focuses on hyperspeed and flexibility. Building microservice or middleware with ease.
Stars: ✭ 4,206 (+734.52%)
Webstera reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (-27.78%)
Crawlertutorial爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (-44.05%)
Iclr2020 OpenreviewdataScript that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
Stars: ✭ 426 (-15.48%)
Tsrtc台灣股票即時爬蟲。Taiwan Stock Exchange Real Time Crawler
Stars: ✭ 359 (-28.77%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (-10.12%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (-30.95%)
DotcommonWhat do people have in their dotfiles?
Stars: ✭ 418 (-17.06%)
Vaultswiss army knife for hackers
Stars: ✭ 346 (-31.35%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+859.72%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (-31.75%)
91porn Api🌭💦 91porn爬虫在线无限制API接口(永久有效,口令每日更新) 及 在线web预览
Stars: ✭ 341 (-32.34%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+708.93%)
Mmjpg👩 美女写真套图爬虫(一)
Stars: ✭ 398 (-21.03%)
Tsec台灣上市上櫃股票爬蟲 Taiwan Stock Exchange Crawler
Stars: ✭ 327 (-35.12%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+850.99%)
Bilili🍻 bilibili video (including bangumi) and danmaku downloader | B站视频(含番剧)、弹幕下载器
Stars: ✭ 379 (-24.8%)
SupercrawlerA web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (-39.29%)
Go DorkThe fastest dork scanner written in Go.
Stars: ✭ 274 (-45.63%)
Iclr2019 OpenreviewdataScript that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
Stars: ✭ 376 (-25.4%)
GhcrawlerCrawl GitHub APIs and store the discovered orgs, repos, commits, ...
Stars: ✭ 293 (-41.87%)
Weixin Spider微信公众号爬虫,公众号历史文章,文章评论,文章阅读及在看数据,可视化web页面,可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现,高效微信爬虫,微信公众号爬虫,历史文章,文章评论,数据更新。
Stars: ✭ 287 (-43.06%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (-27.58%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (-43.25%)
Runoob Pdf爬取菜鸟教程网站并转PDF__python_crawer_by_chrome
Stars: ✭ 430 (-14.68%)
JivesearchA search engine that doesn't track you.
Stars: ✭ 364 (-27.78%)
Scan Ta new crawler based on python with more function including Network fingerprint search
Stars: ✭ 504 (+0%)
News feed🐨实时监控1000家中国企业的新闻动态
Stars: ✭ 491 (-2.58%)
Web3swiftElegant Web3js functionality in Swift. Native ABI parsing and smart contract interactions on Ethereum network.
Stars: ✭ 462 (-8.33%)
Comicbook本项目不再维护,详情可加群了解 https://t.me/onecomicbook
Stars: ✭ 429 (-14.88%)
Json Rpc🔁 JSON-RPC 1/2 transport implementation. Supports python 2/3 and pypy.
Stars: ✭ 359 (-28.77%)