All Projects → siat-nlp → Mams For Absa

siat-nlp / Mams For Absa

Licence: apache-2.0
A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Mams For Absa

Text2sql Data
A collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (+112.59%)
Mutual labels:  dataset, natural-language-processing
Wikisql
A large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+614.81%)
Mutual labels:  dataset, natural-language-processing
Doccano
Open source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+4048.15%)
Mutual labels:  dataset, natural-language-processing
Chazutsu
The tool to make NLP datasets ready to use
Stars: ✭ 238 (+76.3%)
Mutual labels:  dataset, natural-language-processing
Pytreebank
😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-31.11%)
Mutual labels:  dataset, natural-language-processing
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+88.89%)
Mutual labels:  dataset, natural-language-processing
Insuranceqa Corpus Zh
🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (+508.15%)
Mutual labels:  dataset, natural-language-processing
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (+2.96%)
Mutual labels:  dataset, natural-language-processing
Char Rnn Tensorflow
Multi-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-57.04%)
Mutual labels:  dataset, natural-language-processing
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (-59.26%)
Mutual labels:  dataset, natural-language-processing
Korean Hate Speech
Korean HateSpeech Dataset
Stars: ✭ 192 (+42.22%)
Mutual labels:  dataset, natural-language-processing
Ua Gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-20%)
Mutual labels:  dataset, natural-language-processing
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (+17.04%)
Mutual labels:  dataset, natural-language-processing
Oie Resources
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+109.63%)
Mutual labels:  dataset, natural-language-processing
Pytorch Nlp
Basic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+1378.52%)
Mutual labels:  dataset, natural-language-processing
Hate Speech And Offensive Language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Stars: ✭ 543 (+302.22%)
Mutual labels:  dataset, natural-language-processing
Mtnt
Code for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-64.44%)
Mutual labels:  dataset, natural-language-processing
Bond
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (-28.89%)
Mutual labels:  dataset, natural-language-processing
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-10.37%)
Mutual labels:  dataset, natural-language-processing
Konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-3.7%)
Mutual labels:  natural-language-processing

MAMS-for-ABSA

This repository contains the data and code for the paper "A Challenge Dataset and Effective Models for Aspect-Based Sentiment Analysis", EMNLP-IJCNLP 2019, [paper].

MAMS

MAMS is a challenge dataset for aspect-based sentiment analysis (ABSA), in which each sentences contain at least two aspects with different sentiment polarities. MAMS dataset contains two versions: one for aspect-term sentiment analysis (ATSA) and one for aspect-category sentiment analysis (ACSA).

Requirements

pytorch==1.1.0
spacy==2.1.8
pytorch-pretrained-bert==0.6.2
adabound==0.0.5
pyyaml==5.1.2
numpy==1.17.2
scikit-learn==0.21.3
scipy==1.3.1

Quick Start

Put the pretrained GloVe(http://nlp.stanford.edu/data/wordvecs/glove.840B.300d.zip) file glove.840B.300d.txt in folder ./data. Modify config.py to select task, model and hyper-parameters. When mode is set to term, base_path should point to an ATSA dataset. When mode is set to category, base_path should point to an ACSA dataset.

Preprocessing

python preprocess.py

Train

python train.py

Test

python test.py

Acknowledgement

The BERT model pretrained by huggingface(https://github.com/huggingface/pytorch-transformers) is used in our experiments.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].