Qqmusicspider基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料
Stars: ✭ 120 (-46.43%)
PhotonIncredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+3619.64%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+3646.43%)
HtmlsqlhtmlSQL is a experimental PHP library which allows you to access HTML values by an SQL like syntax.
Stars: ✭ 120 (-46.43%)
Lizard💐 Full Amazon Automatic Download
Stars: ✭ 41 (-81.7%)
Accelerated Mobile PagesAutomatically add Accelerated Mobile Pages (AMP Project) functionality on your WordPress.
Stars: ✭ 167 (-25.45%)
SerpGoogle Search SERP Scraper
Stars: ✭ 40 (-82.14%)
Awesome PuppeteerA curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+671.43%)
PychromelessPython Lambda Chrome Automation (naming pending)
Stars: ✭ 219 (-2.23%)
PixivcrawleriiiA python3 crawler for crawling Pixiv ranking top and any illustrator all artworks
Stars: ✭ 38 (-83.04%)
Tiebamanager(已跑路)百度贴吧吧务管理工具,自动扫描帖子并处理违规帖
Stars: ✭ 119 (-46.87%)
DirhuntFind web directories without bruteforce
Stars: ✭ 983 (+338.84%)
GocrawlPolite, slim and concurrent web crawler.
Stars: ✭ 1,962 (+775.89%)
Schannel Qt5A GUI client of schannel powered by therecipe/qt and golang
Stars: ✭ 36 (-83.93%)
JvppeteerHeadless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (-13.84%)
NcrawlerWeb Crawler written in C#
Stars: ✭ 34 (-84.82%)
Docs《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-47.32%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-85.27%)
Moodle Downloader 2A Moodle downloader that downloads course content fast from Moodle (eg. lecture pdfs)
Stars: ✭ 118 (-47.32%)
PypatentSearch for and retrieve US Patent and Trademark Office Patent Data
Stars: ✭ 31 (-86.16%)
Media ScraperScrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Stars: ✭ 206 (-8.04%)
DecryptloginAPIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+730.8%)
Requests HtmlPythonic HTML Parsing for Humans™
Stars: ✭ 12,268 (+5376.79%)
WebedgeBringing Edge to your Web Performance ✨💥
Stars: ✭ 21 (-90.62%)
BaiducrawlerSample of using proxies to crawl baidu search results.
Stars: ✭ 116 (-48.21%)
Seo ManagerSeo Manager Package for Laravel ( with Localization )
Stars: ✭ 192 (-14.29%)
Email ExtractorThe main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-63.84%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-93.75%)
Memex ExplorerViewers for statistics and dashboarding of Domain Search Engine data
Stars: ✭ 115 (-48.66%)
AxegrinderCrawl websites for accessibility issues from the command line.
Stars: ✭ 12 (-94.64%)
DownzemallDownZemAll! is a download manager for Windows, MacOS and Linux
Stars: ✭ 157 (-29.91%)
CcrawlSimple CORPORA list crawler
Stars: ✭ 11 (-95.09%)
Jianso movie🎬 电影资源爬虫,电影图片抓取脚本,Flask|Nginx|wsgi
Stars: ✭ 114 (-49.11%)
Web Launch Checklist📋 A simple website launch checklist to keep track of the most important enrichment possibilities for a website.
Stars: ✭ 214 (-4.46%)
MaintenanceSite maintenance SEO PSR-15 middleware
Stars: ✭ 8 (-96.43%)
Pic Gather[ Closed ] 🎨 image collector, which supports custom acquisition source configuration and is compatible with MacOS and Windows operating systems.
Stars: ✭ 842 (+275.89%)
Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Stars: ✭ 2,209 (+886.16%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-88.84%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (-15.18%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+967.86%)
Navi🧭 Declarative, asynchronous routing for React.
Stars: ✭ 2,069 (+823.66%)
SearchAn Open Source Search Engine
Stars: ✭ 139 (-37.95%)
Gatsby Advanced StarterA high performance skeleton starter for GatsbyJS that focuses on SEO/Social features/development environment.
Stars: ✭ 1,224 (+446.43%)
Work crawlerDownload comics novels 小说漫画下载工具 小説漫画のダウンローダ 小說漫畫下載:腾讯漫画 大角虫漫画 有妖气 知音漫客 咪咕 SF漫画 哦漫画 看漫画 漫画柜 汗汗酷漫 動漫伊甸園 快看漫画 微博动漫 733动漫网 大古漫画网 漫画DB 無限動漫 動漫狂 卡推漫画 动漫之家 动漫屋 古风漫画网 36漫画网 亲亲漫画网 乙女漫画 comico webtoons 咚漫 ニコニコ静画 ComicWalker ヤングエースUP モアイ pixivコミック サイコミ;アルファポリス カクヨム ハーメルン 小説家になろう 起点中文网 八一中文网 顶点小说 落霞小说网 努努书坊 笔趣阁→epub.
Stars: ✭ 1,224 (+446.43%)
Vue Seo PrerenderVue.js Tutorial: A Prerendered, SEO-Friendly Example
Stars: ✭ 139 (-37.95%)
Jekyll Seo TagA Jekyll plugin to add metadata tags for search engines and social networks to better index and display your site's content.
Stars: ✭ 1,226 (+447.32%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (+444.64%)