[CVPR 2022] S-attack library. Official implementation of two papers "Vehicle trajectory prediction works, but not everywhere" and "Are socially-aware trajectory prediction models really socially-aware?".

Stars: ✭ 51 (+121.74%)

Mutual labels: robustness

Amazon-Fine-Food-Review

Machine learning algorithm such as KNN,Naive Bayes,Logistic Regression,SVM,Decision Trees,Random Forest,k means and Truncated SVD on amazon fine food review

Stars: ✭ 28 (+21.74%)

Mutual labels: random-forest

belay

Robust error-handling for Kotlin and Android

Stars: ✭ 35 (+52.17%)

Mutual labels: robustness

shortcut-perspective

Figures & code from the paper "Shortcut Learning in Deep Neural Networks" (Nature Machine Intelligence 2020)

Stars: ✭ 67 (+191.3%)

Mutual labels: robustness

Adversarial-Distributional-Training

Adversarial Distributional Training (NeurIPS 2020)

Stars: ✭ 52 (+126.09%)

Mutual labels: robustness

robustness-vit

Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).

Stars: ✭ 78 (+239.13%)

Mutual labels: robustness

MLDay18

Material from "Random Forests and Gradient Boosting Machines in R" presented at Machine Learning Day '18

Stars: ✭ 15 (-34.78%)

Mutual labels: random-forest

Breast-Cancer-Scikitlearn

simple tutorial on Machine Learning with Scikitlearn

Stars: ✭ 33 (+43.48%)

Mutual labels: random-forest

adversarial-vision-challenge

NIPS Adversarial Vision Challenge

Stars: ✭ 39 (+69.57%)

Mutual labels: robustness

urb-studies-predicting-gentrification

This repo is intended to support replication and exploration of the analysis undertaken for our Urban Studies article "Understanding urban gentrification through Machine Learning: Predicting neighbourhood change in London".

Stars: ✭ 35 (+52.17%)

Mutual labels: random-forest

View All Similar Projects ➔

GeFs - Generative Forests in Python

Generative Forests are a class of Probabilistic Circuits (PCs) that subsumes Random Forests. They maintain the discriminative structure learning and overall predictive performance of Random Forests, while extending them to a full generative model over the joint p(X, y). This enhances Random Forests with pricipled methods for

Outlier detection
Robust classification
Inference with missing values

For a in depth overview of Generative Forests (GeFs) please check our paper Joints in Random Forests in Neurips 2020.

This repository reproduces the experiments provided in the papers Joints in Random Forests and Towards Robust Classification with Deep Generative Forests. See the experiments folder for the experimental set-up.

Installation

To install GeFs it suffices to run pip install . at the root directory of this repository. This project was developed for Python 3 and mostly likely will not run in Python 2.

Requirements

The required packages are installed with pip or are available in requirements.txt if you prefer not to install this package via pip. We list the requirements here for the sake of completion.

numba>=0.49
numpy
pandas
scipy>=1.5
sklearn
tqdm

Usage

We learn the structure of a GeF as in a regular Random Forest. For ease of use, we keep similar signatures to the scikit-learn implementation. Once the structure is learned, we convert it to a GeF with the topc() method, as in the following snippet.

from gefs import RandomForest
from prep import get_data, train_test_split

data, ncat = get_data(name)  # Preprocess the data. Here `name` is a string for the dataset of choice (see the data repository).
# ncat is the number of categories of each variable in the data
X_train, X_test, y_train, y_test, data_train, data_test = train_test_split(data, ncat)
rf = RandomForest(n_estimators=30, ncat=ncat)  # Train a Random Forest
rf.fit(X_train, y_train)
gef = rf.topc()  # Convert to a GeF

Currently topc() fits a GeF by extending the leaves either with a fully-factorised distribution (default) or with another PC via LearnSPN. This behaviour is defined by the learnspn parameter that gives the minimum number of samples to run LearnSPN. For instance, rf.topc(learnspn=30) would run LearnSPN for every leaf in the Random Forest with more than 30 samples.

Classification is performed either by averaging the prediction of each tree (classify_avg method) or by defining a mixture over them (classify method).

y_pred_avg = gef.classify_avg(X_test)
y_pred_mixture = gef.classify(X_test)

Note that given GeFs are generative models, we could predict any categorical variable in the data, not just the class variable. Therefore, we need to pass the index of the variable we want to predict to the classcol parameter. In the datasets provided here, the class variable is always the last one, hence data.shape[1]-1.

Computing Robustness Values

Robustness values can be computed with the compute_rob_class function.

from gefs import compute_rob_class
pred, rob = compute_rob_class(gef.root, X_test, data.shape[1]-1, int(ncat[-1]))

The function returns the prediction and the robustness value of each instance in X_test. Note that compute_rob_class requires the index and the number of categories of the target variable as third and fourth parameters.

Computing log-densities

The log-density of each sample can be computed with the log_likelihood function.

logs = gef.log_likelihood(data_test)

Here if data_test is a matrix of n observations and m variables, logs will be an array of size n, containing log(p(x)) for each observarion x in data_test.

References

If you find GeFs useful please consider citing us in your work

@article{correia2020joints,
  title={Joints in Random Forests},
  author={Correia, A. H. C. and Peharz, Robert and de Campos, C. P.},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

@article{correia2020towards,
  title={Towards Robust Classification with Deep Generative Forests},
  author={Correia, A. H. C. and Peharz, R. and de Campos, C. P.},
  journal={ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning},
  year={2020}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

AlCorreia / GeFs

Programming Languages

Labels

Projects that are alternatives of or similar to GeFs

GeFs - Generative Forests in Python

Installation

Requirements

Usage

Computing Robustness Values

Computing log-densities

References