All Projects → ewulczyn → Wiki Detox

ewulczyn / Wiki Detox

Licence: other
See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse

Projects that are alternatives of or similar to Wiki Detox

Objectdetection
Some experiments with object detection in PyTorch
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Hands On Recommendation Systems With Python
Hands-On Recommendation Systems with Python published by Packt
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Dstl unet
Dstl Satellite Imagery Feature Detection
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Speechcmdrecognition
A neural attention model for speech command recognition
Stars: ✭ 116 (-0.85%)
Mutual labels:  jupyter-notebook
Cvnd Image Captioning Project
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Dynamic neural manifold
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
How To Build Own Text Summarizer Using Deep Learning
In this notebook, we will build an abstractive based text summarizer using deep learning from the scratch in python using keras
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Python Machine Learning Zh
Python机器学习,机器学习入门首选。
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Perfil Politico
A platform for profiling public figures in Brazilian politics
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Hyperopt Doc Zh
Github开源项目hyperopt系列的中文文档,以及学习教程等
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Hands On Data Analysis With Pandas
Materials for following along with Hands-On Data Analysis with Pandas.
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Notebooks
Curated Notebooks from STScI
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Docker For Data Science Tutorial
Materials for "Docker for Data Science" tutorial presented at PyCon 2018 in Cleveland, OH
Stars: ✭ 118 (+0.85%)
Mutual labels:  jupyter-notebook
Ruijin round1
瑞金医院MMC人工智能辅助构建知识图谱大赛初赛
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Ml Fraud Detection
Credit card fraud detection through logistic regression, k-means, and deep learning.
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Blog
Source code for my personal blog
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Dl cshse ami
Материалы курса "Глубинное обучение", ФКН ВШЭ, бакалаврская программа ПМИ
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
The Building Data Genome Project
A collection of non-residential buildings for performance analysis and algorithm benchmarking
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Python Mathematics Handbook
A set of notebooks for an introduction to Python for Mathematicians.
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook
Crypto portfolio analysis
A Jupyter notebook I use to analyze my crypto portfolio.
Stars: ✭ 117 (+0%)
Mutual labels:  jupyter-notebook

Wikipedia Detox

The repository is part of the Wikipedia Detox Research project. See the getting started guide to build your own models and run your own experiments.

This repository hold the codebase associated with the paper Ex Machina: Personal Attacks Seen at Scale by Ellery Wulczyn, Nithum Thain, Lucas Dixon, published in Feb 2017 and presented at WWW-2017.

More recent development is now happening in the repositories of https://conversationai.github.io/

Setup using python virtual env

Assumes you have python/pip installed and setup.

Setup your ptyhon virtual env (assumes python 3.5)

# Setup a new python virtual env for this project; only needs to be done once
# per setup
virtualenv -p python3.5 tmp/env
source tmp/env/bin/activate
pip3 install -r requirements.txt

Test it works:

# Enter you python virtual environment
source tmp/env/bin/activate
echo '
import tensorflow as tf
hello = tf.constant("Hello, TensorFlow!")
sess = tf.Session()
print(sess.run(hello))
' | python

Which should output:

b'Hello, TensorFlow!'

Setup datasets and train models from Figshare data

Assumes you have setup your python virtual environment.

# Enter the python virtual env
source tmp/env/bin/activate
# Create the local datasets and models directories.
mkdir -p tmp/datasets && mkdir -p tmp/models
# Download datasets and train models
python src/modeling/get_prod_models.py --task recipient_attack \
  --data_dir tmp/datasets --model_dir ${PWD}/tmp/models
python src/modeling/get_prod_models.py --task attack \
  --data_dir tmp/datasets --model_dir ${PWD}/tmp/models
python src/modeling/get_prod_models.py --task aggression \
  --data_dir tmp/datasets --model_dir tmp/models
python src/modeling/get_prod_models.py --task aggression \
  --data_dir tmp/datasets --model_dir tmp/models
ln -s ./tmp/models ./models

Start a jupyter notebook

# Enter the python virtual env
source tmp/env/bin/activate
# Start jupyter
jupyter notebook
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].