All Projects → KimMeen → Weibo Analyst

KimMeen / Weibo Analyst

Licence: apache-2.0
Social media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Weibo Analyst

Decryptlogin
APIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+332.79%)
Mutual labels:  crawler, weibo
Weibo wordcloud
根据关键词抓取微博数据,再生成词云
Stars: ✭ 154 (-64.19%)
Mutual labels:  crawler, weibo
Sina Weibo Album Downloader
Multithreading download all HD photos / pictures from someone's Sina Weibo album.
Stars: ✭ 125 (-70.93%)
Mutual labels:  crawler, weibo
Weibo Crawler
新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
Stars: ✭ 1,019 (+136.98%)
Mutual labels:  crawler, weibo
LinLP
使用Python进行自然语言处理相关实践,如新词发现,主题模型,隐马尔模型词性标注,Word2Vec,情感分析
Stars: ✭ 43 (-90%)
Mutual labels:  sentiment-analysis, lda
Weibo Album Crawler
新浪微博相册大图多线程爬虫。
Stars: ✭ 83 (-80.7%)
Mutual labels:  crawler, weibo
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-66.74%)
Mutual labels:  crawler, sentiment-analysis
Lxspider
爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》
Stars: ✭ 60 (-86.05%)
Mutual labels:  crawler, weibo
amazon-reviews
Sentiment Analysis & Topic Modeling with Amazon Reviews
Stars: ✭ 26 (-93.95%)
Mutual labels:  sentiment-analysis, lda
SentimentAnalysis
基于新浪微博数据的情感极性分析
Stars: ✭ 43 (-90%)
Mutual labels:  sentiment-analysis, weibo
Weibo Topic Spider
微博超级话题爬虫,微博词频统计+情感分析+简单分类,新增肺炎超话爬取数据
Stars: ✭ 128 (-70.23%)
Mutual labels:  crawler, weibo
weibo-scraper
Simple Weibo Scraper
Stars: ✭ 50 (-88.37%)
Mutual labels:  crawler, weibo
Weibopicdownloader
免登录下载微博图片 爬虫 Download Weibo Images without Logging-in
Stars: ✭ 247 (-42.56%)
Mutual labels:  crawler, weibo
WeiboCrawler
无cookie版微博爬虫,可以连续爬取一个或多个新浪微博用户信息、用户微博及其微博评论转发。
Stars: ✭ 45 (-89.53%)
Mutual labels:  crawler, weibo
Weibo terminator workflow
Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!
Stars: ✭ 259 (-39.77%)
Mutual labels:  crawler, sentiment-analysis
Netease Music Cracker
🎵 将可下载的网易云音乐的缓存文件转换为 MP3 文件
Stars: ✭ 373 (-13.26%)
Mutual labels:  crawler
Newpipeextractor
Core part of NewPipe
Stars: ✭ 400 (-6.98%)
Mutual labels:  crawler
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (-15.12%)
Mutual labels:  crawler
Weibospider
新浪微博爬虫,用python爬取新浪微博数据
Stars: ✭ 4,861 (+1030.47%)
Mutual labels:  weibo
Deep Learning V2 Pytorch
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101
Stars: ✭ 4,457 (+936.51%)
Mutual labels:  sentiment-analysis

About This Repository

Please follow the restrictions in Apache-2.0 Licence before you refer to this repository.

@Ming Jin ([email protected])

This repository is a simple NLP project for beginners and will be updated occasionally.

Environment: Python3.6

Pull requests are always welcome!

To-do list:

  1. Sentiment analysis can be done by using LSTM.
  2. Crawler can be improved based on some libraries like BeautifulSoup.

Introduction

这是一个微博评论分析工具,实现功能主要有:

  1. 微博评论数据爬取
  2. 分词与关键词提取
  3. 词云与词频统计
  4. 情感分析
  5. 主题聚类

正常状态下实现效果在: “ 案例:泰国大象踩踏伤人事件 ” 注意:案例中最后表格需要自己根据LDA结果进行统计

This is a Weibo comments processing toolbox, which has been implemented for:

  1. Weibo comments crawler that based on regular expression
  2. Tokenization, filtration and key words extraction
  3. Words cloud and visualization
  4. Sentiment analysis
  5. Topic clustering that based on LDA

Pre-Requirements Checklist

MySQL is required (e.g., You may use MySQL Workbench)

  1. importlib
  2. sys
  3. time
  4. requests
  5. lxml
  6. pymysql
  7. jieba
  8. PIL
  9. wordcloud
  10. snownlp
  11. logging
  12. configparser
  13. random
  14. codecs

Citing The Code

BibTex reference format:

@misc{WeiboAnalyst,
title={Weibo-Analyst: An Open-Source Python Library for Social Media Comments Analysis},
url={https://github.com/KimMeen/Weibo-Analyst},
author={Ming Jin},
year={2018}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].