KimMeen / Weibo Analyst
Licence: apache-2.0
Social media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类
Stars: ✭ 430
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Weibo Analyst
Decryptlogin
APIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+332.79%)
Mutual labels: crawler, weibo
Sina Weibo Album Downloader
Multithreading download all HD photos / pictures from someone's Sina Weibo album.
Stars: ✭ 125 (-70.93%)
Mutual labels: crawler, weibo
Weibo Crawler
新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
Stars: ✭ 1,019 (+136.98%)
Mutual labels: crawler, weibo
LinLP
使用Python进行自然语言处理相关实践,如新词发现,主题模型,隐马尔模型词性标注,Word2Vec,情感分析
Stars: ✭ 43 (-90%)
Mutual labels: sentiment-analysis, lda
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-66.74%)
Mutual labels: crawler, sentiment-analysis
Lxspider
爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》
Stars: ✭ 60 (-86.05%)
Mutual labels: crawler, weibo
amazon-reviews
Sentiment Analysis & Topic Modeling with Amazon Reviews
Stars: ✭ 26 (-93.95%)
Mutual labels: sentiment-analysis, lda
Weibo Topic Spider
微博超级话题爬虫,微博词频统计+情感分析+简单分类,新增肺炎超话爬取数据
Stars: ✭ 128 (-70.23%)
Mutual labels: crawler, weibo
Weibopicdownloader
免登录下载微博图片 爬虫 Download Weibo Images without Logging-in
Stars: ✭ 247 (-42.56%)
Mutual labels: crawler, weibo
WeiboCrawler
无cookie版微博爬虫,可以连续爬取一个或多个新浪微博用户信息、用户微博及其微博评论转发。
Stars: ✭ 45 (-89.53%)
Mutual labels: crawler, weibo
Weibo terminator workflow
Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!
Stars: ✭ 259 (-39.77%)
Mutual labels: crawler, sentiment-analysis
Deep Learning V2 Pytorch
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101
Stars: ✭ 4,457 (+936.51%)
Mutual labels: sentiment-analysis
About This Repository
Please follow the restrictions in Apache-2.0 Licence before you refer to this repository.
@Ming Jin ([email protected])
This repository is a simple NLP project for beginners and will be updated occasionally.
Environment: Python3.6
Pull requests are always welcome!
To-do list:
- Sentiment analysis can be done by using LSTM.
- Crawler can be improved based on some libraries like BeautifulSoup.
Introduction
这是一个微博评论分析工具,实现功能主要有:
- 微博评论数据爬取
- 分词与关键词提取
- 词云与词频统计
- 情感分析
- 主题聚类
正常状态下实现效果在: “ 案例:泰国大象踩踏伤人事件 ” 注意:案例中最后表格需要自己根据LDA结果进行统计
This is a Weibo comments processing toolbox, which has been implemented for:
- Weibo comments crawler that based on regular expression
- Tokenization, filtration and key words extraction
- Words cloud and visualization
- Sentiment analysis
- Topic clustering that based on LDA
Pre-Requirements Checklist
MySQL is required (e.g., You may use MySQL Workbench)
- importlib
- sys
- time
- requests
- lxml
- pymysql
- jieba
- PIL
- wordcloud
- snownlp
- logging
- configparser
- random
- codecs
Citing The Code
BibTex reference format:
@misc{WeiboAnalyst,
title={Weibo-Analyst: An Open-Source Python Library for Social Media Comments Analysis},
url={https://github.com/KimMeen/Weibo-Analyst},
author={Ming Jin},
year={2018}
}
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].