douban-movieGet movie info from douban(豆瓣) and display in your terminal
Stars: ✭ 17 (-69.64%)
Videospider抓取豆瓣,bilibili等中的电视剧、电影、动漫演员等信息
Stars: ✭ 186 (+232.14%)
AbotCross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+3401.79%)
BaiduSpider项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (-48.21%)
iTop-CNiTop in chinese
Stars: ✭ 36 (-35.71%)
DeadPool该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery原生的分布式调用,实现大规模并发。
Stars: ✭ 38 (-32.14%)
seenreqGenerate an object for testing if a request is sent, request is Mikeal's request.
Stars: ✭ 42 (-25%)
DataCLUEDataCLUE: 数据为中心的NLP基准和工具包
Stars: ✭ 133 (+137.5%)
weapp-poem诗词墨客 - 最全中华古诗词小程序
Stars: ✭ 409 (+630.36%)
where-is-douban250🐛 一个爬虫程序,整理了腾讯视频、爱奇艺、优酷、哔哩哔哩等视频网站中,能够观看的「豆瓣电影 Top250 榜单」影片。
Stars: ✭ 123 (+119.64%)
Free proxy pool对免费代理IP网站进行爬取,收集汇总为自己的代理池。关键是验证代理的有效性、匿名性、去重复
Stars: ✭ 66 (+17.86%)
gathertoolgathertool是golang脚本化开发库,目的是提高对应场景程序开发的效率;轻量级爬虫库,接口测试&压力测试库,DB操作库等。
Stars: ✭ 36 (-35.71%)
douban-openapi-serverA Douban API server that provides an unofficial APIs for media information gathering
Stars: ✭ 56 (+0%)
lwodfThe Chinese edition of Live Working or Die Fighting: How the Working Class Went Global (劳工的全球化), authored by Paul Mason, translated by the CNPolitics translation team.
Stars: ✭ 25 (-55.36%)
hsk-vocabulary🇨🇳Open source Chinese HSK vocabulary list with example sentences
Stars: ✭ 27 (-51.79%)
spiderpython 爬虫(amazon, confluence ...)
Stars: ✭ 21 (-62.5%)
hanUsing Tensorflow to train a model to detect miswritten Chinese characters.
Stars: ✭ 12 (-78.57%)
glyphhangerYour web font utility belt. It can subset web fonts. It can find unicode-ranges for you automatically. It makes julienne fries.
Stars: ✭ 422 (+653.57%)
sedeText-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data
Stars: ✭ 83 (+48.21%)
shuduShudu 為一個開源文字處理平台,目的是讓閱讀者能夠舒服的閱讀、編寫文案。
Stars: ✭ 25 (-55.36%)
break-the-ice-with-pythonThe repository is about 100+ python programming exercise problem discussed, explained, and solved in different ways
Stars: ✭ 2,165 (+3766.07%)
rasa bot整理:基于Rasa-NLU和Rasa-Core的任务型ChatBot
Stars: ✭ 51 (-8.93%)
dcard-spiderA spider on Dcard. Strong and speedy.
Stars: ✭ 91 (+62.5%)
Vanhiupun.github.io🏖️ Vanhiupun's Awesome Site ==> another theme for elegant writers with modern flat style and beautiful night/dark mode.
Stars: ✭ 57 (+1.79%)
fuzzychineseA small package to fuzzy match chinese words
Stars: ✭ 50 (-10.71%)
Soft-CHS用于存放一些自行汉化的小软件 1.madvr
Stars: ✭ 97 (+73.21%)
php-crawler🕷️ A simple crawler (spider) writen in php just for fun, with zero dependencies
Stars: ✭ 39 (-30.36%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+1541.07%)
crawlerdetectGolang module to detect bots and crawlers via the user agent
Stars: ✭ 22 (-60.71%)
robots.txt🤖 robots.txt as a service. Crawls robots.txt files, downloads and parses them to check rules through an API
Stars: ✭ 13 (-76.79%)
ime.vimA Vim input method engine
Stars: ✭ 74 (+32.14%)
ZSpider基于Electron爬虫程序
Stars: ✭ 37 (-33.93%)
PHP-ChinesePHP Chinese Conversion (中文繁簡轉換)
Stars: ✭ 37 (-33.93%)
word2vec-moviesBag of words meets bags of popcorn in Python 3 中文教程
Stars: ✭ 54 (-3.57%)
react-flashcardsA simple React + Firebase flashcard application
Stars: ✭ 29 (-48.21%)
pinyin data🐼 Easy to use and portable pronunciation data for Hanzi characters.
Stars: ✭ 13 (-76.79%)
TaobaoSpiderThis taobao spider has been archived
Stars: ✭ 28 (-50%)