All Projects → sukrutrao → Fast-Dawid-Skene

sukrutrao / Fast-Dawid-Skene

Licence: MIT license
Code for the algorithms in the paper: Vaibhav B Sinha, Sukrut Rao, Vineeth N Balasubramanian. Fast Dawid-Skene: A Fast Vote Aggregation Scheme for Sentiment Classification. KDD WISDOM 2018

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Fast-Dawid-Skene

machine-learning
Python machine learning applications in image processing, recommender system, matrix completion, netflix problem and algorithm implementations including Co-clustering, Funk SVD, SVD++, Non-negative Matrix Factorization, Koren Neighborhood Model, Koren Integrated Model, Dawid-Skene, Platt-Burges, Expectation Maximization, Factor Analysis, ISTA, F…
Stars: ✭ 91 (+160%)
Mutual labels:  expectation-maximization
sentistrength id
Sentiment Strength Detection in Bahasa Indonesia
Stars: ✭ 32 (-8.57%)
Mutual labels:  sentiment-classification
tutorial-quest
As you delve deep into the Dungeons to take on the biggest baddest bosses, DON'T GO ALONE!! Take THE Open Source Raid Guild with you!
Stars: ✭ 35 (+0%)
Mutual labels:  crowdsourcing
bert-movie-reviews-sentiment-classifier
Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model
Stars: ✭ 12 (-65.71%)
Mutual labels:  sentiment-classification
domain-attention
codes for paper "Domain Attention Model for Multi-Domain Sentiment Classification"
Stars: ✭ 22 (-37.14%)
Mutual labels:  sentiment-classification
AlpacaTag
AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)
Stars: ✭ 126 (+260%)
Mutual labels:  crowdsourcing
Dataset-Sentimen-Analisis-Bahasa-Indonesia
Repositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
Stars: ✭ 38 (+8.57%)
Mutual labels:  sentiment-classification
CrowdTruth-core
CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems
Stars: ✭ 45 (+28.57%)
Mutual labels:  crowdsourcing
youpin-web
YouPin on the web
Stars: ✭ 13 (-62.86%)
Mutual labels:  crowdsourcing
opendata
Open data of Cofacts collaborative fact-checking database
Stars: ✭ 35 (+0%)
Mutual labels:  crowdsourcing
em-explanation
Notebooks explaining the intuition behind the Expectation Maximisation algorithm
Stars: ✭ 32 (-8.57%)
Mutual labels:  expectation-maximization
HSSC
Code for "A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification" (IJCAI 2018)
Stars: ✭ 23 (-34.29%)
Mutual labels:  sentiment-classification
nsmc-zeppelin-notebook
Movie review dataset Word2Vec & sentiment classification Zeppelin notebook
Stars: ✭ 26 (-25.71%)
Mutual labels:  sentiment-classification
Deep-learning-And-Paper
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等
Stars: ✭ 62 (+77.14%)
Mutual labels:  sentiment-classification
models-by-example
By-hand code for models and algorithms. An update to the 'Miscellaneous-R-Code' repo.
Stars: ✭ 43 (+22.86%)
Mutual labels:  expectation-maximization
COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers
Rank 1 / 216
Stars: ✭ 24 (-31.43%)
Mutual labels:  sentiment-classification
pysenti
Chinese Sentiment Classification Tool. 情感极性分类,基于知网、清华、BosonNLP情感词典,易扩展,基准方法,开箱即用。
Stars: ✭ 31 (-11.43%)
Mutual labels:  sentiment-classification
SentimentAnalysis
Sentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (-8.57%)
Mutual labels:  sentiment-classification
NTUA-slp-nlp
💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (-45.71%)
Mutual labels:  sentiment-classification
bikeways4everybody
Crowdsourcing bike routes in Boston. Making pretty maps from it.
Stars: ✭ 30 (-14.29%)
Mutual labels:  crowdsourcing

Fast Dawid-Skene

GitHub

Paper | arXiv | Code | Slides | Supplementary Results

Implementation of the Fast Dawid-Skene and Hybrid algorithms described in the paper:

Vaibhav B Sinha, Sukrut Rao, Vineeth N Balasubramanian. Fast Dawid-Skene: A Fast Vote Aggregation Scheme for Sentiment Classification. In Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM) at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2018, August 2018.

Implementations of the Dawid-Skene (Dawid and Skene, 1979) and Majority Voting algorithms are also provided.

These algorithms can be used to aggregate crowd-sourced labels to estimate the true labels. Given the labels of a data point from many annotators out of a set of classes, the algorithms output the most likely correct class for the data point.

Setup

Prerequisites

The prerequisites are:

  • Python 2.7 or 3.4-3.6
  • pip

Setting up dependencies

All other dependencies can be installed using pip, as

$ pip install -r requirements.txt

If tests are to be run, use instead,

$ pip install -r requirements-dev.txt

Preparing the data

A description of the data format and the procedure to add a new dataset is given here. A toy dataset is also provided, and can be found here.

Running the program

To run the program, use

$ python scripts/fast_dawid_skene.py [OPTIONS]

To view a list of available options along with descriptions, use

$ python scripts/fast_dawid_skene.py --help

Example Run

To run on the toy dataset, with two annotators per question, using the FDS algorithm to obtain predictions, use

$ python scripts/fast_dawid_skene.py --dataset toy --k 2 --mode aggregate --algorithm FDS --print_result

To run using all available annotations for every question, using the FDS algorithm to obtain predictions, use

$ python scripts/fast_dawid_skene.py --dataset toy --mode aggregate --algorithm FDS --print_result

Running tests

Tests can be run using pytest, as,

$ py.test

License

This code is provided under the MIT License.

Some parts of the code in this file are derived from this implementation, and the original license and copyright notice can be found at the top of the file.

Citation

If the Fast Dawid-Skene / Hybrid algorithms are useful for your research, please cite our paper [2] (References).

Acknowledgements

Parts of the code for the implementation of the algorithms use or derive from code in this implementation.

References

  1. A. P. Dawid and A. M. Skene. 1979. Maximum Likelihood Estimation of Observer Error-Rates Using the EM Algorithm. J. Royal Stat. Soc. Series C 28, 1 (1979), 20–28.
  2. Vaibhav B Sinha, Sukrut Rao, Vineeth N Balasubramanian. Fast Dawid-Skene: A Fast Vote Aggregation Scheme for Sentiment Classification. In Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM) at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2018, August 2018.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].