sukrutrao / Fast-Dawid-Skene

Licence: MIT license

Code for the algorithms in the paper: Vaibhav B Sinha, Sukrut Rao, Vineeth N Balasubramanian. Fast Dawid-Skene: A Fast Vote Aggregation Scheme for Sentiment Classification. KDD WISDOM 2018

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Fast-Dawid-Skene

machine-learning

Python machine learning applications in image processing, recommender system, matrix completion, netflix problem and algorithm implementations including Co-clustering, Funk SVD, SVD++, Non-negative Matrix Factorization, Koren Neighborhood Model, Koren Integrated Model, Dawid-Skene, Platt-Burges, Expectation Maximization, Factor Analysis, ISTA, F…

Stars: ✭ 91 (+160%)

Mutual labels: expectation-maximization

sentistrength id

Sentiment Strength Detection in Bahasa Indonesia

Stars: ✭ 32 (-8.57%)

Mutual labels: sentiment-classification

tutorial-quest

As you delve deep into the Dungeons to take on the biggest baddest bosses, DON'T GO ALONE!! Take THE Open Source Raid Guild with you!

Stars: ✭ 35 (+0%)

Mutual labels: crowdsourcing

bert-movie-reviews-sentiment-classifier

Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model

Stars: ✭ 12 (-65.71%)

Mutual labels: sentiment-classification

domain-attention

codes for paper "Domain Attention Model for Multi-Domain Sentiment Classification"

Stars: ✭ 22 (-37.14%)

Mutual labels: sentiment-classification

AlpacaTag

AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)

Stars: ✭ 126 (+260%)

Mutual labels: crowdsourcing

Dataset-Sentimen-Analisis-Bahasa-Indonesia

Repositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…

Stars: ✭ 38 (+8.57%)

Mutual labels: sentiment-classification

CrowdTruth-core

CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems

Stars: ✭ 45 (+28.57%)

Mutual labels: crowdsourcing

youpin-web

YouPin on the web

Stars: ✭ 13 (-62.86%)

Mutual labels: crowdsourcing

opendata

Open data of Cofacts collaborative fact-checking database

Stars: ✭ 35 (+0%)

Mutual labels: crowdsourcing

em-explanation

Notebooks explaining the intuition behind the Expectation Maximisation algorithm

Stars: ✭ 32 (-8.57%)

Mutual labels: expectation-maximization

HSSC

Code for "A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification" (IJCAI 2018)

Stars: ✭ 23 (-34.29%)

Mutual labels: sentiment-classification

nsmc-zeppelin-notebook

Movie review dataset Word2Vec & sentiment classification Zeppelin notebook

Stars: ✭ 26 (-25.71%)

Mutual labels: sentiment-classification

Deep-learning-And-Paper

【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等

Stars: ✭ 62 (+77.14%)

Mutual labels: sentiment-classification

models-by-example

By-hand code for models and algorithms. An update to the 'Miscellaneous-R-Code' repo.

Stars: ✭ 43 (+22.86%)

Mutual labels: expectation-maximization

COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers

Rank 1 / 216

Stars: ✭ 24 (-31.43%)

Mutual labels: sentiment-classification

pysenti

Chinese Sentiment Classification Tool. 情感极性分类，基于知网、清华、BosonNLP情感词典，易扩展，基准方法，开箱即用。

Stars: ✭ 31 (-11.43%)

Mutual labels: sentiment-classification

SentimentAnalysis

Sentiment Analysis: Deep Bi-LSTM+attention model

Stars: ✭ 32 (-8.57%)

Mutual labels: sentiment-classification

NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (-45.71%)

Mutual labels: sentiment-classification

bikeways4everybody

Crowdsourcing bike routes in Boston. Making pretty maps from it.

Stars: ✭ 30 (-14.29%)

Mutual labels: crowdsourcing

View All Similar Projects ➔

Fast Dawid-Skene

Paper | arXiv | Code | Slides | Supplementary Results

Implementation of the Fast Dawid-Skene and Hybrid algorithms described in the paper:

Vaibhav B Sinha, Sukrut Rao, Vineeth N Balasubramanian. Fast Dawid-Skene: A Fast Vote Aggregation Scheme for Sentiment Classification. In Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM) at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2018, August 2018.

Implementations of the Dawid-Skene (Dawid and Skene, 1979) and Majority Voting algorithms are also provided.

These algorithms can be used to aggregate crowd-sourced labels to estimate the true labels. Given the labels of a data point from many annotators out of a set of classes, the algorithms output the most likely correct class for the data point.

Setup

Prerequisites

The prerequisites are:

Python 2.7 or 3.4-3.6
pip

Setting up dependencies

All other dependencies can be installed using pip, as

$ pip install -r requirements.txt

If tests are to be run, use instead,

$ pip install -r requirements-dev.txt

Preparing the data

A description of the data format and the procedure to add a new dataset is given here. A toy dataset is also provided, and can be found here.

Running the program

To run the program, use

$ python scripts/fast_dawid_skene.py [OPTIONS]

To view a list of available options along with descriptions, use

$ python scripts/fast_dawid_skene.py --help

Example Run

To run on the toy dataset, with two annotators per question, using the FDS algorithm to obtain predictions, use

$ python scripts/fast_dawid_skene.py --dataset toy --k 2 --mode aggregate --algorithm FDS --print_result

To run using all available annotations for every question, using the FDS algorithm to obtain predictions, use

$ python scripts/fast_dawid_skene.py --dataset toy --mode aggregate --algorithm FDS --print_result

Running tests

Tests can be run using pytest, as,

$ py.test

License

This code is provided under the MIT License.

Some parts of the code in this file are derived from this implementation, and the original license and copyright notice can be found at the top of the file.

Citation

If the Fast Dawid-Skene / Hybrid algorithms are useful for your research, please cite our paper [2] (References).

Acknowledgements

Parts of the code for the implementation of the algorithms use or derive from code in this implementation.

References

A. P. Dawid and A. M. Skene. 1979. Maximum Likelihood Estimation of Observer Error-Rates Using the EM Algorithm. J. Royal Stat. Soc. Series C 28, 1 (1979), 20–28.
Vaibhav B Sinha, Sukrut Rao, Vineeth N Balasubramanian. Fast Dawid-Skene: A Fast Vote Aggregation Scheme for Sentiment Classification. In Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM) at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2018, August 2018.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

sukrutrao / Fast-Dawid-Skene

Programming Languages

Labels

Projects that are alternatives of or similar to Fast-Dawid-Skene

Fast Dawid-Skene

Setup

Prerequisites

Setting up dependencies

Preparing the data

Running the program

Example Run

Running tests

License

Citation

Acknowledgements

References