All Projects → uclanlp → Reducingbias

uclanlp / Reducingbias

Projects that are alternatives of or similar to Reducingbias

Wiki generator live
live code
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Challengehub Baselines
ChallengeHub开源的各大比赛baseline集合
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Style Transfer Colab
Google Colab Notebook for Image and Video Style Transfer Using TensorFlow
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Info490 Sp17
Advanced Data Science, University of Illinois Spring 2017
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Mastering Python Data Analysis
Mastering-Python-Data-Analysis
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Facenet Face Recognition
This is the research product of the thesis manifold Learning of Latent Space Vectors in GAN for Image Synthesis. This has an application to the research, name a facial recognition system. The application was developed by consulting the FaceNet model.
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Stock Market Prediction Using Natural Language Processing
We used Machine learning techniques to evaluate past data pertaining to the stock market and world affairs of the corresponding time period, in order to make predictions in stock trends. We built a model that will be able to buy and sell stock based on profitable prediction, without any human interactions. The model uses Natural Language Processing (NLP) to make smart “decisions” based on current affairs, article, etc. With NLP and the basic rule of probability, our goal is to increases the accuracy of the stock predictions.
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Cometts
Comet Time Series Toolset for working with a time-series of remote sensing imagery and user defined polygons
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Pyspark Setup Guide
A guide for setting up Spark + PySpark under Ubuntu linux
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Danet
Deep Attractor Network (DANet) for single-channel speech separation
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Tutoriais De Am
Algoritmos de aprendizado de máquina criados manualmente para maior compreensão das suas funcionalidades
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Blog of baojie
Some articles written by Bao Jie
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Tianchi ship 2019
天池智慧海洋 2019 https://tianchi.aliyun.com/competition/entrance/231768/introduction?spm=5176.12281949.1003.1.493e5cfde2Jbke
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Mypresentations
this is my presentaion area .个人演讲稿展示区,主要展示一些平时的个人演讲稿或者心得之类的,
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Tulingxueyuan
python笔记
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Commitgen
Code and data for the paper "A Neural Architecture for Generating Natural Language Descriptions from Source Code Changes"
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook
Trdesign
trRosetta for protein design
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Ct lung segmentation
Robust segmentation of lung and airway in CT scans
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Ga in python
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook
Bts Pytorch
PyTorch implementation of BTS Depth Estimator
Stars: ✭ 54 (+0%)
Mutual labels:  jupyter-notebook

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, Kai-Wei Chang. EMNLP 2017

Please nevigate the code through this jupyter notebook

For details, please refer to this paper

  • Abstract

Language is increasingly being used to define rich visual recognition problems with supporting image collections sourced from the web. Structured prediction models are used in these tasks to take advantage of correlations between co-occurring labels and visual input but risk inadvertently encoding social biases found in web corpora. For example, in the following image, it is possible to predict the place is the kitchen, because it is the common place for the activity cooking. However, in subfigure 4, the model predicts the agent as a woman even though it is a man, which is caused by the inappropriate correlations between the activity cooking and the female gender.

bias
Structure prediction can help the model to build the correlations between different parts. However it will also cause some bias problem.

In our work, we study data and models associated with multilabel object classification (MLC) and visual semantic role labeling (vSRL). We find that (a) datasets for these tasks contain significant gender bias and (b) models trained on these datasets further amplify existing bias. For example, the activity cooking is over 33% more likely to involve females than males in a training set, and a trained model further amplifies the disparity to 68% at test time. We propose to inject corpus-level constraints for calibrating existing structured prediction models and design an algorithm based on Lagrangian relaxation for collective inference. Our method results in almost no performance loss for the underlying recognition task but decreases the magnitude of bias amplification by 47.5% and 40.5% for multilabel classification and visual semantic role labeling, respectively.

  • Source Code

We provide our calibration function in file "fairCRF_gender_ratio.ipynb". It is based on the Lagrangian Relaxation algorithm. You need to provide your own inference algorithm and also the algorithm you used to get the accuracy performance. The function also needs you to provide your own constraints. We give detailed description about the parameters in the jupyter notebook and we also provide the running example for both vSRL and MLC tasks.

To run the vSRL task, you need to have caffe installed in your machine. If you just want to run with the sampled data, be sure to download the .prototxt files from the data/imSitu/ folder and put them to the folder ("crf_path" in our case) in the same level where caffe is installed. All the other files are also provided under data/imSitu/. Remember to modify all the path in the config.ini file with absolute path.

  • Data

(Update 11/12/2018) For the sampled potentials for imSitu can be found here: dev and test.

We provide all the potential scores for MS-COCO dataset in data/COCO folder. For complete imSitu potentials, download at here.

  • Reference

    Please cite
@InProceedings{zhao-EtAl:2017:EMNLP20173,
 author    = {Zhao, Jieyu  and  Wang, Tianlu  and  Yatskar, Mark  and  Ordonez, Vicente  and  Chang, Kai-Wei},
 title     = {Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints},
 booktitle = {Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing},
 year      = {2017},
 pages     = {2941--2951},
 url       = {https://www.aclweb.org/anthology/D17-1319}
}
  • Note

    The accuracy performance on the MLC tasks is improved. The updated results are:
Performance (%)
MLC: Development Set MLC: Test Set
CRF: 45.31 CRF: 45.46
CRF+RBA 45.24 CRF+RBA 45.41
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].