All Projects → hate-alert → HateALERT-EVALITA

hate-alert / HateALERT-EVALITA

Licence: MIT license
Code for replicating results of team 'hateminers' at EVALITA-2018 for AMI task

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to HateALERT-EVALITA

KAREN
KAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (+38.46%)
Mutual labels:  tfidf, hatespeech
overview-and-benchmark-of-traditional-and-deep-learning-models-in-text-classification
NLP tutorial
Stars: ✭ 41 (+215.38%)
Mutual labels:  tfidf, glove-embeddings
HatefulUsersTwitter
Code for the paper "Characterizing and Detecting Hateful Users on Twitter"
Stars: ✭ 69 (+430.77%)
Mutual labels:  hate-speech
vita
Vita - Genetic Programming Framework
Stars: ✭ 24 (+84.62%)
Mutual labels:  classification
cnn-rnn-classifier
A practical example on how to combine both a CNN and a RNN to classify images.
Stars: ✭ 47 (+261.54%)
Mutual labels:  classification
toxicity
The world's largest social media toxicity dataset.
Stars: ✭ 135 (+938.46%)
Mutual labels:  hate-speech
scoruby
Ruby Scoring API for PMML
Stars: ✭ 69 (+430.77%)
Mutual labels:  classification
Orange3
🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+24146.15%)
Mutual labels:  classification
Conformer
Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition
Stars: ✭ 345 (+2553.85%)
Mutual labels:  classification
dzetsaka
dzetsaka : classification plugin for Qgis
Stars: ✭ 61 (+369.23%)
Mutual labels:  classification
SafeAI
Reusable, Easy-to-use Uncertainty module package built with Tensorflow, Keras
Stars: ✭ 13 (+0%)
Mutual labels:  classification
EC-GAN
EC-GAN: Low-Sample Classification using Semi-Supervised Algorithms and GANs (AAAI 2021)
Stars: ✭ 29 (+123.08%)
Mutual labels:  classification
Emotion-recognition-from-tweets
A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.
Stars: ✭ 17 (+30.77%)
Mutual labels:  tfidf
Relation-Classification
Relation Classification - SEMEVAL 2010 task 8 dataset
Stars: ✭ 46 (+253.85%)
Mutual labels:  classification
Color-Identification-using-Machine-Learning
This project explores colors in various images and then enables the user to query the images based on a given color.
Stars: ✭ 93 (+615.38%)
Mutual labels:  classification
FUTURE
A private, free, open-source search engine built on a P2P network
Stars: ✭ 19 (+46.15%)
Mutual labels:  glove-embeddings
Awesome-Tensorflow2
基于Tensorflow2开发的优秀扩展包及项目
Stars: ✭ 45 (+246.15%)
Mutual labels:  classification
awesome-text-classification
Text classification meets word embeddings.
Stars: ✭ 27 (+107.69%)
Mutual labels:  classification
classy
Super simple text classifier using Naive Bayes. Plug-and-play, no dependencies
Stars: ✭ 12 (-7.69%)
Mutual labels:  classification
embeddings
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Stars: ✭ 27 (+107.69%)
Mutual labels:  classification

HitCount PWC

AMI-EVALITA2018

Code for replicating results of team 'hateminers' ( now hate-alert) at EVALITA-2018 for 'Automatic Misogyny Identification' task. We have explained the model details in our paper ''Hateminers : Detecting Hate speech against Women''. We came First for the English Subtask A and Fifth for the English Subtask B.

Please cite our paper in any published work that uses our model.

Saha, P., Mathew, B., Goyal, P. & Mukherjee, A., (2018). Hateminers: Detecting Hate speech against Women. arXiv preprint arXiv:1812.06700.

@article{saha2018hateminers,
  title={Hateminers: Detecting Hate speech against Women},
  author={Saha, Punyajoy and Mathew, Binny and Goyal, Pawan and Mukherjee, Animesh},
  journal={arXiv preprint arXiv:1812.06700},
  year={2018}
}

Directory structure

root
|
|-----AMI_data (data needs to be kept here) 
|
|-----Classifier
	|
	|-----taskA(models and feature selection methods for task A- misogynous or not)
	|
	|-----taskB1(models and features selection methods for task B1- misogyny category )
	|
	|-----taskB2(models and features selection methods for task B2- misogyny category )
		

Predicitons

In order to make the predicition feature vector consisting tfidf-vectors , glove embeddings , google-universal-encoding should be prepared.

clf_task1=joblib.load('taskA/'+model_for_taskA)
clf_task2=joblib.load('taskB1/'+model_for_taskB1)
clf_task3=joblib.load('taskB2/'+model_for_taskB2)
select_task1=joblib.load('taskA/'+select_featue_for_taskA)
select_task2=joblib.load('taskB1/'+select_featue_for_taskB1)
select_task3=joblib.load('taskB2/'+select_featue_for_taskB2)

Assuming x to be a feature vector.Models of Task B are only used when Task A model predicts the given text as misogynous.

x=x.reshape(1,-1)
temp=x
temp=select_task1.transform(temp)
predict1=clf_task1.predict(temp)

if(int(predict1[0])!=0):
	temp1=x
	temp1=select_task2.transform(temp1)
	predict2=clf_task2.predict(temp1)
	temp2=x
	temp2=select_task3.transform(temp2)
	predict3=clf_task3.predict(temp2)
else:
	list_2.append(0)
	list_3.append(0)

More details can be found in here

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].