Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → unitaryai → Detoxify

unitaryai / Detoxify

Licence: apache-2.0

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers.

Programming Languages

139335 projects - #7 most used programming language

Labels

nlp kaggle-competition

Projects that are alternatives of or similar to Detoxify

Kaggle Sentiment Popcorn

Code for the Kaggle competition "Bag of Words Meets Bags of Popcorn"

Stars: ✭ 45 (-76.06%)

Mutual labels: kaggle-competition

Kaggle Competitions

There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.

Stars: ✭ 86 (-54.26%)

Mutual labels: kaggle-competition

Open Solution Toxic Comments

Open solution to the Toxic Comment Classification Challenge

Stars: ✭ 154 (-18.09%)

Mutual labels: kaggle-competition

国内首个迁移学习赛题中国平安前海征信“好信杯”迁移学习大数据算法大赛 FInSight团队作品(算法方案排名第三)

Stars: ✭ 55 (-70.74%)

Mutual labels: kaggle-competition

Applying UNET Model on TGS Salt Identification Challenge hosted on Kaggle

Stars: ✭ 81 (-56.91%)

Mutual labels: kaggle-competition

Tensorflow implementation : U-net and FCN with global convolution

Stars: ✭ 101 (-46.28%)

Mutual labels: kaggle-competition

Kaggle Web Traffic Time Series Forecasting

Solution to Kaggle - Web Traffic Time Series Forecasting

Stars: ✭ 29 (-84.57%)

Mutual labels: kaggle-competition

Kaggle dstl submission

Code for a winning model (3 out of 419) in a Dstl Satellite Imagery Feature Detection challenge

Stars: ✭ 159 (-15.43%)

Mutual labels: kaggle-competition

Kaggle | 21st place solution for TGS Salt Identification Challenge

Stars: ✭ 81 (-56.91%)

Mutual labels: kaggle-competition

Data science bowl 2018

My 5th place (out of 816 teams) solution to The 2018 Data Science Bowl organized by Booz Allen Hamilton

Stars: ✭ 147 (-21.81%)

Mutual labels: kaggle-competition

My Journey In The Data Science World

📢 Ready to learn or review your knowledge!

Stars: ✭ 1,175 (+525%)

Mutual labels: kaggle-competition

Kaggle Notebooks

Sample notebooks for Kaggle competitions

Stars: ✭ 77 (-59.04%)

Mutual labels: kaggle-competition

Kaggle Airbnb Recruiting New User Bookings

2nd Place Solution in Kaggle Airbnb New User Bookings competition

Stars: ✭ 118 (-37.23%)

Mutual labels: kaggle-competition

Wheat Detection challenge on Kaggle

Stars: ✭ 54 (-71.28%)

Mutual labels: kaggle-competition

Machine Learning Workflow With Python

This is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation

Stars: ✭ 157 (-16.49%)

Mutual labels: kaggle-competition

Ml competition platform

Kaggle-like machine learning competition platform

Stars: ✭ 42 (-77.66%)

Mutual labels: kaggle-competition

Deep Learning Boot Camp

A community run, 5-day PyTorch Deep Learning Bootcamp

Stars: ✭ 1,270 (+575.53%)

Mutual labels: kaggle-competition

Cikm 2019 Analyticup

1st Solution for 2019-CIKM-Analyticup, Efficient and Novel Item Retrieval for Large-scale Online Shopping Recommendation

Stars: ✭ 173 (-7.98%)

Mutual labels: kaggle-competition

Severstal Steel Defect Detection

Kaggle Segmentation Challenge

Stars: ✭ 160 (-14.89%)

Mutual labels: kaggle-competition

Machine Learning And Data Science

This is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.

Stars: ✭ 137 (-27.13%)

Mutual labels: kaggle-competition

View All Similar Projects ➔

🙊 Detoxify

Toxic Comment Classification with ⚡ Pytorch Lightning and 🤗 Transformers

News & Updates

15-02-2021: Detoxify featured in Scientific American!

Our opinion piece "Can AI identify toxic online content?" is now live on Scientific American

14-01-2021: Lightweight models

Added smaller models trained with Albert for the original and unbiased models! Can access these in the same way with detoxify using original-small and unbiased-small as inputs. The original-small achieved a mean AUC score of 0.98281 (0.98856 before) and the unbiased-small achieved a final score of 0.93362 (0.94734 before).

Description

Trained models & code to predict toxic comments on 3 Jigsaw challenges: Toxic comment classification, Unintended Bias in Toxic comments, Multilingual toxic comment classification.

Built by Laura Hanu at Unitary, where we are working to stop harmful content online by interpreting visual content in context.

Dependencies:

For inference:
- 🤗 Transformers
- ⚡ Pytorch lightning
For training will also need:
- Kaggle API (to download data)

Challenge	Year	Goal	Original Data Source	Detoxify Model Name	Top Kaggle Leaderboard Score	Detoxify Score
Toxic Comment Classification Challenge	2018	build a multi-headed model that’s capable of detecting different types of of toxicity like threats, obscenity, insults, and identity-based hate.	Wikipedia Comments	`original`	0.98856	0.98636
Jigsaw Unintended Bias in Toxicity Classification	2019	build a model that recognizes toxicity and minimizes this type of unintended bias with respect to mentions of identities. You'll be using a dataset labeled for identity mentions and optimizing a metric designed to measure unintended bias.	Civil Comments	`unbiased`	0.94734	0.93639
Jigsaw Multilingual Toxic Comment Classification	2020	build effective multilingual models	Wikipedia Comments + Civil Comments	`multilingual`	0.9536	0.91655*

*Score not directly comparable since it is obtained on the validation set provided and not on the test set. To update when the test labels are made available.

It is also noteworthy to mention that the top leadearboard scores have been achieved using model ensembles. The purpose of this library was to build something user-friendly and straightforward to use.

Limitations and ethical considerations

If words that are associated with swearing, insults or profanity are present in a comment, it is likely that it will be classified as toxic, regardless of the tone or the intent of the author e.g. humorous/self-deprecating. This could present some biases towards already vulnerable minority groups.

The intended use of this library is for research purposes, fine-tuning on carefully constructed datasets that reflect real world demographics and/or to aid content moderators in flagging out harmful content quicker.

Some useful resources about the risk of different biases in toxicity or hate speech detection are:

Quick prediction

The multilingual model has been trained on 7 different languages so it should only be tested on: english, french, spanish, italian, portuguese, turkish or russian.

# install detoxify  

pip install detoxify

from detoxify import Detoxify

# each model takes in either a string or a list of strings

results = Detoxify('original').predict('example text')

results = Detoxify('unbiased').predict(['example text 1','example text 2'])

results = Detoxify('multilingual').predict(['example text','exemple de texte','texto de ejemplo','testo di esempio','texto de exemplo','örnek metin','пример текста'])

# to specify the device the model will be allocated on (defaults to cpu), accepts any torch.device input

model = Detoxify('original', device='cuda')

# optional to display results nicely (will need to pip install pandas)

import pandas as pd

print(pd.DataFrame(results, index=input_text).round(5))

For more details check the Prediction section.

Labels

All challenges have a toxicity label. The toxicity labels represent the aggregate ratings of up to 10 annotators according the following schema:

Very Toxic (a very hateful, aggressive, or disrespectful comment that is very likely to make you leave a discussion or give up on sharing your perspective)
Toxic (a rude, disrespectful, or unreasonable comment that is somewhat likely to make you leave a discussion or give up on sharing your perspective)
Hard to Say
Not Toxic

More information about the labelling schema can be found here.

Toxic Comment Classification Challenge

This challenge includes the following labels:

toxic
severe_toxic
obscene
threat
insult
identity_hate

Jigsaw Unintended Bias in Toxicity Classification

This challenge has 2 types of labels: the main toxicity labels and some additional identity labels that represent the identities mentioned in the comments.

Only identities with more than 500 examples in the test set (combined public and private) are included during training as additional labels and in the evaluation calculation.

toxicity
severe_toxicity
obscene
threat
insult
identity_attack
sexual_explicit

Identity labels used:

male
female
homosexual_gay_or_lesbian
christian
jewish
muslim
black
white
psychiatric_or_mental_illness

A complete list of all the identity labels available can be found here.

Jigsaw Multilingual Toxic Comment Classification

Since this challenge combines the data from the previous 2 challenges, it includes all labels from above, however the final evaluation is only on:

toxicity

How to run

First, install dependencies

# clone project   

git clone https://github.com/unitaryai/detoxify

# create virtual env

python3 -m venv toxic-env
source toxic-env/bin/activate

# install project   

pip install -e detoxify
cd detoxify

# for training
pip install -r requirements.txt

Prediction

Trained models summary:

Model name	Transformer type	Data from
`original`	`bert-base-uncased`	Toxic Comment Classification Challenge
`unbiased`	`roberta-base`	Unintended Bias in Toxicity Classification
`multilingual`	`xlm-roberta-base`	Multilingual Toxic Comment Classification

For a quick prediction can run the example script on a comment directly or from a txt containing a list of comments.

# load model via torch.hub

python run_prediction.py --input 'example' --model_name original

# load model from from checkpoint path

python run_prediction.py --input 'example' --from_ckpt_path model_path

# save results to a .csv file

python run_prediction.py --input test_set.txt --model_name original --save_to results.csv

# to see usage

python run_prediction.py --help

Checkpoints can be downloaded from the latest release or via the Pytorch hub API with the following names:

toxic_bert
unbiased_toxic_roberta
multilingual_toxic_xlm_r

model = torch.hub.load('unitaryai/detoxify','toxic_bert')

Importing detoxify in python:

from detoxify import Detoxify

results = Detoxify('original').predict('some text')

results = Detoxify('unbiased').predict(['example text 1','example text 2'])

results = Detoxify('multilingual').predict(['example text','exemple de texte','texto de ejemplo','testo di esempio','texto de exemplo','örnek metin','пример текста'])

# to display results nicely

import pandas as pd

print(pd.DataFrame(results,index=input_text).round(5))

Training

If you do not already have a Kaggle account:

you need to create one to be able to download the data
go to My Account and click on Create New API Token - this will download a kaggle.json file
make sure this file is located in ~/.kaggle

# create data directory

mkdir jigsaw_data
cd jigsaw_data

# download data

kaggle competitions download -c jigsaw-toxic-comment-classification-challenge

kaggle competitions download -c jigsaw-unintended-bias-in-toxicity-classification

kaggle competitions download -c jigsaw-multilingual-toxic-comment-classification

Start Training

Toxic Comment Classification Challenge

python create_val_set.py

python train.py --config configs/Toxic_comment_classification_BERT.json

Unintended Bias in Toxicicity Challenge

python train.py --config configs/Unintended_bias_toxic_comment_classification_RoBERTa.json

Multilingual Toxic Comment Classification

This is trained in 2 stages. First, train on all available data, and second, train only on the translated versions of the first challenge.

The translated data can be downloaded from Kaggle in french, spanish, italian, portuguese, turkish, and russian (the languages available in the test set).

# stage 1

python train.py --config configs/Multilingual_toxic_comment_classification_XLMR.json

# stage 2

python train.py --config configs/Multilingual_toxic_comment_classification_XLMR_stage2.json --resume path_to_saved_checkpoint_stage1

Monitor progress with tensorboard

tensorboard --logdir=./saved

Model Evaluation

Toxic Comment Classification Challenge

This challenge is evaluated on the mean AUC score of all the labels.

python evaluate.py --checkpoint saved/lightning_logs/checkpoints/example_checkpoint.pth --test_csv test.csv

Unintended Bias in Toxicicity Challenge

This challenge is evaluated on a novel bias metric that combines different AUC scores to balance overall performance. More information on this metric here.

python evaluate.py --checkpoint saved/lightning_logs/checkpoints/example_checkpoint.pth --test_csv test.csv

# to get the final bias metric
python model_eval/compute_bias_metric.py

Multilingual Toxic Comment Classification

This challenge is evaluated on the AUC score of the main toxic label.

python evaluate.py --checkpoint saved/lightning_logs/checkpoints/example_checkpoint.pth --test_csv test.csv

Citation

@misc{Detoxify,
  title={Detoxify},
  author={Hanu, Laura and {Unitary team}},
  howpublished={Github. https://github.com/unitaryai/detoxify},
  year={2020}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 188

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗