All Projects → juzeon → advanced-php-crawler

juzeon / advanced-php-crawler

Licence: other
新浪博客文章/wenku8轻小说文库爬虫,可抓取图片保存,一键制作电子书。kindle读书党的神器!

Programming Languages

PHP
23972 projects - #3 most used programming language

Projects that are alternatives of or similar to advanced-php-crawler

mpspider
公众号文章抓取&生成kindle电子书
Stars: ✭ 51 (+96.15%)
Mutual labels:  gitbook, kindle
Calibre
The official source code repository for the calibre ebook manager
Stars: ✭ 11,221 (+43057.69%)
Mutual labels:  kindle, calibre
calibre-kindle-comics
A calibre plugin that converts your comics into a readable format for kindle.
Stars: ✭ 32 (+23.08%)
Mutual labels:  kindle, calibre
Pwning Juice Shop
GitBook markdown content for the eBook "Pwning OWASP Juice Shop"
Stars: ✭ 110 (+323.08%)
Mutual labels:  gitbook, kindle
rolling-news
获取滚动新闻
Stars: ✭ 44 (+69.23%)
Mutual labels:  sina
how-to-build-the-gitbook-with-gitbook-cli
如何藉由 gitbook-cli 來建置自己的 GitBook。
Stars: ✭ 26 (+0%)
Mutual labels:  gitbook
doujinshi metadata plugins
the calibre metadata plugins for doujinshi
Stars: ✭ 55 (+111.54%)
Mutual labels:  calibre
projects
A website that showcases interesting projects, using Angular JS.
Stars: ✭ 106 (+307.69%)
Mutual labels:  gitbook
acala-wiki
Acala GitBook Wiki, https://wiki.acala.network
Stars: ✭ 22 (-15.38%)
Mutual labels:  gitbook
CalibreLibgenStore
A Libgen Fiction store plugin for Calibre
Stars: ✭ 117 (+350%)
Mutual labels:  calibre
docs-archive
Contao Documentation
Stars: ✭ 44 (+69.23%)
Mutual labels:  gitbook
node-calibre
📖 A Node wrapper for Calibre's command line tools.
Stars: ✭ 21 (-19.23%)
Mutual labels:  calibre
saika
✍️Effortless info-site, done right.
Stars: ✭ 19 (-26.92%)
Mutual labels:  gitbook
teslaapi.io
Unofficial Tesla API Documentation
Stars: ✭ 29 (+11.54%)
Mutual labels:  gitbook
BooGi
Generate GitBook-like modern docs/tutorial websites using Gatsby
Stars: ✭ 117 (+350%)
Mutual labels:  gitbook
anyfesto
Low cost Raspberry Pi /Linux based access point with audio, education and communications local content server. Inspired by the ideas of sharing with others. Anyfesto - a platform from which to speak.
Stars: ✭ 66 (+153.85%)
Mutual labels:  calibre
dao
Documentation for Liker Land and LikeCoin.
Stars: ✭ 20 (-23.08%)
Mutual labels:  gitbook
ebook-converter-bot
Convert ebooks with few clicks on Telegram!
Stars: ✭ 47 (+80.77%)
Mutual labels:  calibre
Kindle2Anki
Export Vocabulary Builder to Anki APKG
Stars: ✭ 84 (+223.08%)
Mutual labels:  kindle
mobi
python based software to unpack kindlegen generated ebooks
Stars: ✭ 37 (+42.31%)
Mutual labels:  kindle

advanced-php-crawler

新浪博客/FimFiction/wenku8轻小说文库 全能爬虫

这套PHP编写的小程序可以帮助喜欢在电子书阅读器上看新浪博客上文章的你,它可以根据已知的文章列表来爬行,亦可以根据已知的文章目录来爬行——你只需要将URL写入一个文本文件,接着调用程序即可。当然,你可以使用#符号作为注释,与bash类似。而且,它生成的是gitbook的标准格式,可以用gitbook/calibre工具自动生成多种格式(mobi/epub/pdf)的电子书。文章细节均已自动优化,爬行图片保存到本地,也生成封面,且使用MarkDown格式,只为带给你完美的阅读体验!

简明教程

淀粉月刊撰写的本程序简明教程:https://dfkan.com/1635.html

文件功能详解:

需要PHP5以上版本,Windows用户可安装phpstudy

wenku8.php

它用于抓取wenku8.net(轻小说文库)的全本小说,生成分卷章节,打包mobi/epub电子书。

输入wenku8.net的BookID,抓取并生成电子书。

使用方法:

php wenku8.php

命令示例:

php wenku8.php 1538

#即为网址 https://www.wenku8.net/book/1538.htm

sina-list.php

它用于爬行像这样子的博客文章目录:

新浪博客爬虫-列表爬虫

-可以集合已知文章目录(/s/articlelist*)里面的文章列表

使用方法:

php sina-list.php <网址文件>

参数解释:

<网址文件>:一行一个网址,请使用电脑版访问后复制

命令示例:

php sina-list.php urls.txt

网址文件示例:

http://blog.sina.com.cn/s/articlelist_123456wsla.html #我是注释

http://blog.sina.com.cn/s/articlelist_789456wsex.html

sina-article.php

它用于爬行像这样子的具体文章:

新浪博客爬虫-文章爬虫

-可以提取已知文章页面(/s/blog*)里面的文章

使用方法:

php sina-article.php <网址文件>

参数解释:

<网址文件>:一行一个网址,请使用电脑版访问后复制

命令示例:

php sina-article.php urls.txt

网址文件示例:

http://blog.sina.com.cn/s/blog_123456wsla.html

http://blog.sina.com.cn/s/blog_789456wsex.html

tool-rev.php

它用于把上面说的网址文件前前后后颠倒过来

新浪博客爬虫-网址文件反转工具

-将某个网址文件里面的url全部反转过来,可用于处理新旧文章顺序等

使用方法: php tool-rev.php <网址文件>

参数解释:

<网址文件>:一行一个网址,请使用电脑版访问后复制

命令示例:

php tool-rev.php urls.txt

网址文件示例:

http://blog.sina.com.cn/s/xxx_123456wsla.html

http://blog.sina.com.cn/s/xxx_789456wsex.html

fimfic.php

这个脚本专门用于处理FimFiction的故事,同时包括抓取图片、调用彩云小译API翻译正文为中英双语对照格式。

使用方法:php fimfic.php <Story网址>

命令示例:php fimfic.php https://www.fimfiction.net/story/318771/earth-without-us

附录

Gitbook安装全解

Gitbook+Calibre安装及使用

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].