All Projects → jiangxiaolin → LinLP

jiangxiaolin / LinLP

Licence: other
使用Python进行自然语言处理相关实践,如新词发现,主题模型,隐马尔模型词性标注,Word2Vec,情感分析

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to LinLP

Weibo Analyst
Social media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类
Stars: ✭ 430 (+900%)
Mutual labels:  sentiment-analysis, lda
xinlp
把李航老师《统计学习方法》的后几章的算法都用java实现了一遍,实现盒子与球的EM算法,扩展到去GMM训练,后来实现了HMM分词(实现了HMM分词的参数训练)和CRF分词(借用CRF++训练的参数模型),最后利用tensorFlow把BiLSTM+CRF实现了,然后为lucene包装了一个XinAnalyzer
Stars: ✭ 21 (-51.16%)
Mutual labels:  hmm, lda
amazon-reviews
Sentiment Analysis & Topic Modeling with Amazon Reviews
Stars: ✭ 26 (-39.53%)
Mutual labels:  sentiment-analysis, lda
CLUEmotionAnalysis2020
CLUE Emotion Analysis Dataset 细粒度情感分析数据集
Stars: ✭ 3 (-93.02%)
Mutual labels:  sentiment-analysis
sentence-classification-pytorch
Sentiment analysis with variable length sequences in pytorch
Stars: ✭ 34 (-20.93%)
Mutual labels:  sentiment-analysis
XED
XED multilingual emotion datasets
Stars: ✭ 34 (-20.93%)
Mutual labels:  sentiment-analysis
restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-51.16%)
Mutual labels:  sentiment-analysis
fawkes
🚀🚀 Fetch, parse, categorize, summarize user reviews 🚀🚀
Stars: ✭ 83 (+93.02%)
Mutual labels:  sentiment-analysis
fsauor2018
基于LSTM网络与自注意力机制对中文评论进行细粒度情感分析
Stars: ✭ 36 (-16.28%)
Mutual labels:  sentiment-analysis
sentiment-analysis-of-tweets-in-russian
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
Stars: ✭ 51 (+18.6%)
Mutual labels:  sentiment-analysis
nlpserver
NLP Web Service
Stars: ✭ 76 (+76.74%)
Mutual labels:  sentiment-analysis
KGE-LDA
Knowledge Graph Embedding LDA. AAAI 2017
Stars: ✭ 35 (-18.6%)
Mutual labels:  lda
Steward
A stock portfolio manager that provides neural net based short-term predictions for stocks and natural language processing based analysis on community sentiments.
Stars: ✭ 25 (-41.86%)
Mutual labels:  sentiment-analysis
Aspect-Based-Sentiment-Analysis
A python program that implements Aspect Based Sentiment Analysis classification system for SemEval 2016 Dataset.
Stars: ✭ 57 (+32.56%)
Mutual labels:  sentiment-analysis
stocktwits-sentiment
Stocktwits market sentiment analysis in Python with Keras and TensorFlow.
Stars: ✭ 23 (-46.51%)
Mutual labels:  sentiment-analysis
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-37.21%)
Mutual labels:  lda
CIP
Basic exercises of chinese information processing
Stars: ✭ 32 (-25.58%)
Mutual labels:  hmm
ntua-slp-semeval2018
Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.
Stars: ✭ 79 (+83.72%)
Mutual labels:  sentiment-analysis
reddit-opinion-mining
Sentiment analysis and opinion mining of Reddit data.
Stars: ✭ 15 (-65.12%)
Mutual labels:  sentiment-analysis
InterGCN-ABSA
[COLING 2020] Jointly Learning Aspect-Focused and Inter-Aspect Relations with Graph Convolutional Networks for Aspect Sentiment Analysis
Stars: ✭ 41 (-4.65%)
Mutual labels:  sentiment-analysis

使用Python进行自然语言处理实践

代码环境: python --version 3.5.2
tensorflow
keras

本项目旨在归纳总结自然语言处理工程师常见实践内容,每个模块尽量包含理论(经典算法和最新paper)和实践部分。

一、常见自然语言处理任务

1、文本向量化:

1.1 tfidf

1.2 word2vec
(文件夹)word2vec:word2vec使用(done)

1.3 glove
1.4 elmo
1.5 bert
1.6 xlnet

2、文本分类问题

2.1(文件夹)sentiment:情感分析实践-目前使用word2vec和svm实现(done)
2.2 BiLSTM
2.3 HAN

3、自然语言推理问题
相似文本匹配、问答匹配)

4、语言生成问题

闲聊、摘要

5、任务型对话 对话状态跟踪、对话策略生成、用户模拟器

6、知识图谱 基于neo4j构建问答系统 transe

7、信息抽取 命名实体识别

8、自然语言基础功能: 8.1 分词

8.2 词性标注 (文件夹)HMMPostag:hmm词性标注(done)

8.3 句法分析

8.4 新词发现 (文件夹)NewWordFinder:新词发现功能(done)

8.5 指代消解

9、文本聚类

(文件夹)LDATopicModel:lda主题模型(done)

备注: (文件夹)resource:部分项目公共资源目录

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].