All Projects → bureaucratic-labs → Dostoevsky

bureaucratic-labs / Dostoevsky

Licence: mit
Sentiment analysis library for russian language

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Dostoevsky

Dialogue Understanding
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Stars: ✭ 77 (-59.69%)
Mutual labels:  natural-language-processing, sentiment-analysis
Nlp Papers
Papers and Book to look at when starting NLP 📚
Stars: ✭ 111 (-41.88%)
Mutual labels:  natural-language-processing, sentiment-analysis
Turkish Bert Nlp Pipeline
Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.
Stars: ✭ 85 (-55.5%)
Mutual labels:  natural-language-processing, sentiment-analysis
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+492.67%)
Mutual labels:  natural-language-processing, sentiment-analysis
Char Cnn Text Classification Pytorch
Character-level Convolutional Neural Networks for text classification in PyTorch
Stars: ✭ 147 (-23.04%)
Mutual labels:  natural-language-processing, sentiment-analysis
Absa Pytorch
Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
Stars: ✭ 1,181 (+518.32%)
Mutual labels:  natural-language-processing, sentiment-analysis
Pynlp
A pythonic wrapper for Stanford CoreNLP.
Stars: ✭ 103 (-46.07%)
Mutual labels:  natural-language-processing, sentiment-analysis
Stocksight
Stock market analyzer and predictor using Elasticsearch, Twitter, News headlines and Python natural language processing and sentiment analysis
Stars: ✭ 1,037 (+442.93%)
Mutual labels:  natural-language-processing, sentiment-analysis
Googlelanguager
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Stars: ✭ 145 (-24.08%)
Mutual labels:  natural-language-processing, sentiment-analysis
Absapapers
Worth-reading papers and related awesome resources on aspect-based sentiment analysis (ABSA). 值得一读的方面级情感分析论文与相关资源集合
Stars: ✭ 142 (-25.65%)
Mutual labels:  natural-language-processing, sentiment-analysis
Repo 2017
Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano
Stars: ✭ 1,123 (+487.96%)
Mutual labels:  natural-language-processing, sentiment-analysis
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+1218.32%)
Mutual labels:  natural-language-processing, sentiment-analysis
Textblob Ar
Arabic support for textblob
Stars: ✭ 60 (-68.59%)
Mutual labels:  natural-language-processing, sentiment-analysis
Senta
Baidu's open-source Sentiment Analysis System.
Stars: ✭ 1,187 (+521.47%)
Mutual labels:  natural-language-processing, sentiment-analysis
Pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Stars: ✭ 8,112 (+4147.12%)
Mutual labels:  natural-language-processing, sentiment-analysis
Pytreebank
😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-51.31%)
Mutual labels:  natural-language-processing, sentiment-analysis
Nlp With Ruby
Curated List: Practical Natural Language Processing done in Ruby
Stars: ✭ 907 (+374.87%)
Mutual labels:  natural-language-processing, sentiment-analysis
Ml Classify Text Js
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Stars: ✭ 38 (-80.1%)
Mutual labels:  natural-language-processing, sentiment-analysis
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-30.37%)
Mutual labels:  natural-language-processing, sentiment-analysis
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-17.28%)
Mutual labels:  natural-language-processing, sentiment-analysis

Dostoevsky Test & Lint

Sentiment analysis library for russian language

Install

Please note that Dostoevsky supports only Python 3.6+ on both Linux and Windows

$ pip install dostoevsky

Social network model [FastText]

This model was trained on RuSentiment dataset and achieves up to ~0.71 F1 score.

Usage

First of all, you'll need to download binary model:

$ python -m dostoevsky download fasttext-social-network-model

Then you can use sentiment analyzer:

from dostoevsky.tokenization import RegexTokenizer
from dostoevsky.models import FastTextSocialNetworkModel

tokenizer = RegexTokenizer()
tokens = tokenizer.split('всё очень плохо')  # [('всё', None), ('очень', None), ('плохо', None)]

model = FastTextSocialNetworkModel(tokenizer=tokenizer)

messages = [
    'привет',
    'я люблю тебя!!',
    'малолетние дебилы'
]

results = model.predict(messages, k=2)

for message, sentiment in zip(messages, results):
    # привет -> {'speech': 1.0000100135803223, 'skip': 0.0020607432816177607}
    # люблю тебя!! -> {'positive': 0.9886782765388489, 'skip': 0.005394937004894018}
    # малолетние дебилы -> {'negative': 0.9525841474533081, 'neutral': 0.13661839067935944}]
    print(message, '->', sentiment)

If you use the library in a research project, please include the following citation for the RuSentiment data:

@inproceedings{rogers-etal-2018-rusentiment,
    title = "{R}u{S}entiment: An Enriched Sentiment Analysis Dataset for Social Media in {R}ussian",
    author = "Rogers, Anna  and
      Romanov, Alexey  and
      Rumshisky, Anna  and
      Volkova, Svitlana  and
      Gronas, Mikhail  and
      Gribov, Alex",
    booktitle = "Proceedings of the 27th International Conference on Computational Linguistics",
    month = aug,
    year = "2018",
    address = "Santa Fe, New Mexico, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/C18-1064",
    pages = "755--763",
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].