Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → t-davidson → Hate Speech And Offensive Language

t-davidson / Hate Speech And Offensive Language

Licence: mit

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Labels

jupyter-notebook machine-learning nlp natural-language-processing dataset twitter classifier

Projects that are alternatives of or similar to Hate Speech And Offensive Language

Protest Detection Violence Estimation

Implementation of the model used in the paper Protest Activity Detection and Perceived Violence Estimation from Social Media Images (ACM Multimedia 2017)

Stars: ✭ 114 (-79.01%)

Mutual labels: jupyter-notebook, dataset, twitter

Anlp19

Course repo for Applied Natural Language Processing (Spring 2019)

Stars: ✭ 402 (-25.97%)

Mutual labels: jupyter-notebook, natural-language-processing

Comma2k19

A driving dataset for the development and validation of fused pose estimators and mapping algorithms

Stars: ✭ 391 (-27.99%)

Mutual labels: jupyter-notebook, dataset

Practical Nlp

Official Repository for 'Practical Natural Language Processing' by O'Reilly Media

Stars: ✭ 452 (-16.76%)

Mutual labels: jupyter-notebook, natural-language-processing

Nlp Python Deep Learning

NLP in Python with Deep Learning

Stars: ✭ 374 (-31.12%)

Mutual labels: jupyter-notebook, natural-language-processing

Vpgnet

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (ICCV 2017)

Stars: ✭ 382 (-29.65%)

Mutual labels: jupyter-notebook, dataset

Practical Pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained

Stars: ✭ 4,329 (+697.24%)

Mutual labels: jupyter-notebook, natural-language-processing

Medmnist

[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis

Stars: ✭ 338 (-37.75%)

Mutual labels: jupyter-notebook, dataset

Nlp.js

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

Stars: ✭ 4,670 (+760.04%)

Mutual labels: natural-language-processing, classifier

Ml Mipt

Open Machine Learning course at MIPT

Stars: ✭ 480 (-11.6%)

Mutual labels: jupyter-notebook, natural-language-processing

Doccano

Open source annotation tool for machine learning practitioners.

Stars: ✭ 5,600 (+931.31%)

Mutual labels: dataset, natural-language-processing

Data Science

Collection of useful data science topics along with code and articles

Stars: ✭ 315 (-41.99%)

Mutual labels: jupyter-notebook, natural-language-processing

Question generation

Neural question generation using transformers

Stars: ✭ 356 (-34.44%)

Mutual labels: jupyter-notebook, natural-language-processing

Transformers Tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

Stars: ✭ 384 (-29.28%)

Mutual labels: jupyter-notebook, natural-language-processing

Nlp Papers With Arxiv

Statistics and accepted paper list of NLP conferences with arXiv link

Stars: ✭ 345 (-36.46%)

Mutual labels: jupyter-notebook, natural-language-processing

Code search

Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"

Stars: ✭ 436 (-19.71%)

Mutual labels: jupyter-notebook, natural-language-processing

Nlp Notebooks

A collection of notebooks for Natural Language Processing from NLP Town

Stars: ✭ 513 (-5.52%)

Mutual labels: jupyter-notebook, natural-language-processing

Whylogs

Profile and monitor your ML data pipeline end-to-end

Stars: ✭ 328 (-39.59%)

Mutual labels: jupyter-notebook, dataset

Dsprites Dataset

Dataset to assess the disentanglement properties of unsupervised learning methods

Stars: ✭ 340 (-37.38%)

Mutual labels: jupyter-notebook, dataset

Courses

Quiz & Assignment of Coursera

Stars: ✭ 454 (-16.39%)

Mutual labels: jupyter-notebook, natural-language-processing

View All Similar Projects ➔

Automated Hate Speech Detection and the Problem of Offensive Language

Repository for Thomas Davidson, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. "Automated Hate Speech Detection and the Problem of Offensive Language." ICWSM. You read the paper here.

NOTE: This repository is no longer actively maintained. Please do not post issues regarding the compatibility of the existing code with new versions of Python or the packages used.

2019 NEWS

We have a new paper on racial bias in this dataset and others, you can read it here

WARNING: The data, lexicons, and notebooks all contain content that is racist, sexist, homophobic, and offensive in many other ways.

You can find our labeled data in the data directory. We have included them as a pickle file (Python 2.7) and as a CSV. You will also find a notebook in the src directory containing Python 2.7 code to replicate our analyses in the paper and a lexicon in the lexicons directory that we generated to try to more accurately classify hate speech. The classifier directory contains a script, instructions, and the necessary files to run our classifier on new data, a test case is provided.

Please cite our paper in any published work that uses any of these resources.

@inproceedings{hateoffensive,
  title = {Automated Hate Speech Detection and the Problem of Offensive Language},
  author = {Davidson, Thomas and Warmsley, Dana and Macy, Michael and Weber, Ingmar}, 
  booktitle = {Proceedings of the 11th International AAAI Conference on Web and Social Media},
  series = {ICWSM '17},
  year = {2017},
  location = {Montreal, Canada},
  pages = {512-515}
  }

Contact We would also appreciate it if you could fill out this short form if you are interested in using our data so we can keep track of how these data are used and get in contact with researchers working on similar problems.

If you have any questions please contact trd54 at cornell dot edu.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 543

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗