All Projects → CarryChang → Real_Time_DataMining_Software

CarryChang / Real_Time_DataMining_Software

Licence: Apache-2.0 license
携程/榛果民宿实时评论挖掘软件,包含数据的实时采集/数据清洗/结构化保存/ UGC 数据主题提取/情感分析/后结构化可视化等技术的综合性演示 Demo。基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和 NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。主要克服用户打分和评论不一致,实时对携程和美团在线民宿的满意度进行评测以及对额外数据进行可视化的综合性工具,多维度的对在线 UGC 进行数据挖掘并可视化,demo 视频演示见链接。

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Real Time DataMining Software

sentiment.datalogue
Sentiment analysis challenge for Datalogue recruiting
Stars: ✭ 14 (-67.44%)
Mutual labels:  sentiment-analysis
GroupDocs.Classification-for-.NET
GroupDocs.Classification-for-.NET samples and showcase (text and documents classification and sentiment analysis)
Stars: ✭ 38 (-11.63%)
Mutual labels:  sentiment-analysis
sentimentAnalysisLab
This lab is about how to add the AI and ML cloud service feature to your web application with React and the Amplify Framework.
Stars: ✭ 78 (+81.4%)
Mutual labels:  sentiment-analysis
sentiment-analysis-webapp
[不再更新]中文短文本情感分析 web 应用 | A web app about Chinese sentences sentiment analysis
Stars: ✭ 33 (-23.26%)
Mutual labels:  sentiment-analysis
wink-nlp
Developer friendly Natural Language Processing ✨
Stars: ✭ 312 (+625.58%)
Mutual labels:  sentiment-analysis
arabic-sentiment-analysis
Sentiment Analysis in Arabic tweets
Stars: ✭ 64 (+48.84%)
Mutual labels:  sentiment-analysis
LSTM-sentiment-analysis
LSTM sentiment analysis. Please look at my another repo for SVM and Naive algorithem
Stars: ✭ 19 (-55.81%)
Mutual labels:  sentiment-analysis
pandas twitter
Analyzing Trump's tweets using Python (Pandas + Twitter workshop)
Stars: ✭ 81 (+88.37%)
Mutual labels:  sentiment-analysis
sentiment-analysis-using-python
Large Data Analysis Course Project
Stars: ✭ 23 (-46.51%)
Mutual labels:  sentiment-analysis
TLA
A comprehensive tool for linguistic analysis of communities
Stars: ✭ 47 (+9.3%)
Mutual labels:  sentiment-analysis
stansent
No description or website provided.
Stars: ✭ 16 (-62.79%)
Mutual labels:  sentiment-analysis
node-alchemy
An Alchemy API library for Node.JS
Stars: ✭ 54 (+25.58%)
Mutual labels:  sentiment-analysis
tf-sentiment-docker
A docker image for sentiment analysis on tensorflow
Stars: ✭ 15 (-65.12%)
Mutual labels:  sentiment-analysis
sentiment-analysis2
Sentiment ananlysis in keras and mxnet
Stars: ✭ 37 (-13.95%)
Mutual labels:  sentiment-analysis
Emotion and Polarity SO
An emotion classifier of text containing technical content from the SE domain
Stars: ✭ 74 (+72.09%)
Mutual labels:  sentiment-analysis
pysentimiento
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks
Stars: ✭ 274 (+537.21%)
Mutual labels:  sentiment-analysis
sarcasm-detection-for-sentiment-analysis
Sarcasm Detection for Sentiment Analysis
Stars: ✭ 21 (-51.16%)
Mutual labels:  sentiment-analysis
soroka
Узнай, хорошо или плохо говорят о тебе или твоей фирме в Интернете! Наша "Сорока" с искусственным интеллектом принесёт тебе это на своём хвосте.
Stars: ✭ 16 (-62.79%)
Mutual labels:  sentiment-analysis
twitter-sentiment-analysis
Streaming tweets with spark, language detection & sentiment analysis, dashboard with Kibana
Stars: ✭ 100 (+132.56%)
Mutual labels:  sentiment-analysis
awesome-text-classification
Text classification meets word embeddings.
Stars: ✭ 27 (-37.21%)
Mutual labels:  sentiment-analysis

996.icu

Real_Time_DataMining_Sortware

Stargazers over time

视频演示demo

一款能实时进行文本挖掘的软件,不占用多余的存储空间,直接将采集后的数据集中存储在本地txt中,运用本软件无需进行大量的手动翻页操作,输入对应的店铺链接即可对民宿进行分析包含接结构化数据的可视化和非结构化UGC的情感分析,包含数据的实时采集/数据清洗/结构化保存/UGC数据主题提取/情感分析/后结构化可视化等技术的综合性演示demo。基于在线民宿UGC数据的意见挖掘项目,包含数据挖掘和NLP相关的处理,负责数据采集、整句切分、主题抽取、情感分析等任务。主要克服用户打分和评论不一致,实时对携程和美团在线民宿的满意度进行评测以及对额外数据进行可视化的综合性工具,多维度的对在线UGC进行数据挖掘并可视化,对比顾客直接打分的结果来看,运用机器学习的情感分析方法更能挖掘到详细的顾客意见和对应的合理评分。

软件包含数据采集(txt_analysis/spiders)/清理/可视化(txt_analysis/picturing)部分组成,直接运行Python3 RealTime_UGC_Analysis_GUI.py 即可打开本软件的GUI界面,缺失的库按照提示进行安装即可。
主要功能包括美团/携程在线民宿UGC的原始评论采集、主题分类、实时数据清洗、文本情感分析与后结构化结果可视化展示等模块。
  1. 使用Request模拟浏览实现了美团/携程民宿的实时自动化的采集民宿UGC内容的功能,提取后的民宿地址和在线评论等信息如下。
  1. 单一的UGC情感分析,使用清洗后的用户打分进行标注然后训练的分类模型,然后对切分后的UGC进行分类。
  1. 对提取后的民宿主题进行UGC主题情感分析:如环境
  1. 提取UGC标签,并进行量化可视化
  1. 单家民宿的UGC情感分析结果
  1. UGC顾客打分占比
  1. 分析该民宿第二次以上的预定情况
  1. 通过数据清洗和结构化展示该民宿在不同月份的住宿情况

本软件包含了自己对NLP技术落地的一些实践,针对整句UGC中包含的多主题问题,使用基于词性标注的标点符号切分,将整句切分为多个分句子,然后使用主题字典1的方式对主题进行分类,工作包含数据的实时采集/数据清洗/结构化保存/UGC数据主题提取/情感分析/后结构化可视化等技术的综合性演示demo,对比顾客直接打分的结果来看,运用机器学习的情感分析方法更能挖掘到详细的顾客意见和对应的合理评分。

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].