All Projects → sismetanin → sentiment-analysis-of-tweets-in-russian

sismetanin / sentiment-analysis-of-tweets-in-russian

Licence: MIT license
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to sentiment-analysis-of-tweets-in-russian

word2vec-tsne
Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
Stars: ✭ 59 (+15.69%)
Mutual labels:  word2vec, word-embeddings, embeddings, machinelearning, computational-linguistics, nlp-machine-learning
SentimentAnalysis
Sentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (-37.25%)
Mutual labels:  sentiment-analysis, word-embeddings, embeddings, computational-linguistics, nlp-machine-learning
NTUA-slp-nlp
💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (-62.75%)
Mutual labels:  sentiment-analysis, word2vec, word-embeddings, nlp-machine-learning
datastories-semeval2017-task6
Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-60.78%)
Mutual labels:  word-embeddings, embeddings, computational-linguistics, nlp-machine-learning
Datastories Semeval2017 Task4
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Stars: ✭ 184 (+260.78%)
Mutual labels:  sentiment-analysis, word-embeddings, embeddings, nlp-machine-learning
Dict2vec
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (+78.43%)
Mutual labels:  word2vec, word-embeddings, embeddings
Magnitude
A fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+2633.33%)
Mutual labels:  word2vec, word-embeddings, embeddings
Dna2vec
dna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (+129.41%)
Mutual labels:  word2vec, word-embeddings, embeddings
Fasttext.js
FastText for Node.js
Stars: ✭ 127 (+149.02%)
Mutual labels:  word2vec, word-embeddings, machinelearning
Persian-Sentiment-Analyzer
Persian sentiment analysis ( آناکاوی سهش های فارسی | تحلیل احساسات فارسی )
Stars: ✭ 30 (-41.18%)
Mutual labels:  sentiment-analysis, word2vec, embeddings
overview-and-benchmark-of-traditional-and-deep-learning-models-in-text-classification
NLP tutorial
Stars: ✭ 41 (-19.61%)
Mutual labels:  tweets, sentiment-analysis, word-embeddings
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-47.06%)
Mutual labels:  word2vec, word-embeddings, embeddings
ar-embeddings
Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec
Stars: ✭ 83 (+62.75%)
Mutual labels:  sentiment-analysis, embeddings
brand-sentiment-analysis
Scripts utilizing Heartex platform to build brand sentiment analysis from the news
Stars: ✭ 21 (-58.82%)
Mutual labels:  sentiment-analysis, nlp-machine-learning
word2vec-on-wikipedia
A pipeline for training word embeddings using word2vec on wikipedia corpus.
Stars: ✭ 68 (+33.33%)
Mutual labels:  word2vec, word-embeddings
Arabic-Word-Embeddings-Word2vec
Arabic Word Embeddings Word2vec
Stars: ✭ 26 (-49.02%)
Mutual labels:  word2vec, word-embeddings
PersianNER
Named-Entity Recognition in Persian Language
Stars: ✭ 48 (-5.88%)
Mutual labels:  word-embeddings, embeddings
word-benchmarks
Benchmarks for intrinsic word embeddings evaluation.
Stars: ✭ 45 (-11.76%)
Mutual labels:  word2vec, word-embeddings
sarcasm-detection-for-sentiment-analysis
Sarcasm Detection for Sentiment Analysis
Stars: ✭ 21 (-58.82%)
Mutual labels:  sentiment-analysis, word2vec
RadiologyReportEmbedding
Intelligent Word Embeddings of Free-Text Radiology Reports
Stars: ✭ 22 (-56.86%)
Mutual labels:  embeddings, nlp-machine-learning

Sentiment Analysis of Tweets in Russian using Convolutional Neural Network with Word2Vec Word Embeddings

This repository contains the source code for binary sentiment analysis of tweets in Russian using CNN. Word2Vec was used in order to set up pre-trained word embeddings for one-layer convolutional neural network, which was constructed using Keras with TensorFlow backend. Experiments showed F-measure score up to 78.14%.

MSA The CNN architecture was inspired by approaches described in [3] and [4].

Citation:

@INPROCEEDINGS{Smetanin-SA-2019,
  author={Sergey Smetanin and Michail Komarov},
  booktitle={2019 IEEE 21st Conference on Business Informatics (CBI)},
  title={Sentiment Analysis of Product Reviews in Russian using Convolutional Neural Networks},
  year={2019},
  volume={01},
  number={},
  pages={482-486},
  doi={10.1109/CBI.2019.00062},
  ISSN={2378-1963},
  month={July}
}

Data

The corpus of short texts in Russian based on Twitter messages is available at http://study.mokoron.com/ (and also described in [1]). The files positive.csv and negative.csv contain positively labelled and negatively labelled tweets, respectively.

Trained Models

To download trained CNN and Word2Vec models for Russian, please follow the link https://yadi.sk/d/Xohf3dTuVTakDA.

References

  1. Y. Rubtsova, "Constructing a Corpus for Sentiment Classification Training", Software & Systems, vol. 109, no. 1, pp. 72-78, 2015.
  2. T. Mikolov, I. Sutskeve, K. Chen, G. Corrado and J. Dean, "Distributed Representations of Words and Phrases and their Compositionality", Advances in Neural Information Processing Systems, vol. 26, pp. 3111-3119, 2013.
  3. Y. Zhang and B. Wallace, "A Sensitivity Analysis of (and Practitioners' Guide to) Convolutional Neural Networks for Sentence Classification", arXiv preprint arXiv:1510.03820, 2015.
  4. M. Cliche, "BB_twtr at SemEval-2017 Task 4: Twitter Sentiment Analysis with CNNs and LSTMs", Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 573-580, 2017.

Documentation and How to report bugs

License

See LICENSE.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].