dongyp13 / Adversarial-Distributional-Training

Licence: MIT License

Adversarial Distributional Training (NeurIPS 2020)

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Adversarial-Distributional-Training

Adversarial-Patch-Training

Code for the paper: Adversarial Training Against Location-Optimized Adversarial Patches. ECCV-W 2020.

Stars: ✭ 30 (-42.31%)

Mutual labels: adversarial-machine-learning, adversarial-training, adversarial-defense

FeatureScatter

Feature Scattering Adversarial Training

Stars: ✭ 64 (+23.08%)

Mutual labels: adversarial-machine-learning, adversarial-training

Robust-Semantic-Segmentation

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

Stars: ✭ 25 (-51.92%)

Mutual labels: robustness, adversarial-training

robust-local-lipschitz

A Closer Look at Accuracy vs. Robustness

Stars: ✭ 75 (+44.23%)

Mutual labels: robustness, adversarial-machine-learning

advrank

Adversarial Ranking Attack and Defense, ECCV, 2020.

Stars: ✭ 19 (-63.46%)

Mutual labels: adversarial-machine-learning, adversarial-defense

Denoised-Smoothing-TF

Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.

Stars: ✭ 19 (-63.46%)

Mutual labels: robustness, adversarial-defense

AWP

Codes for NeurIPS 2020 paper "Adversarial Weight Perturbation Helps Robust Generalization"

Stars: ✭ 114 (+119.23%)

Mutual labels: adversarial-training, neurips-2020

pFedMe

Personalized Federated Learning with Moreau Envelopes (pFedMe) using Pytorch (NeurIPS 2020)

Stars: ✭ 196 (+276.92%)

Mutual labels: neurips-2020

perceptual-advex

Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".

Stars: ✭ 44 (-15.38%)

Mutual labels: robustness

translearn

Code implementation of the paper "With Great Training Comes Great Vulnerability: Practical Attacks against Transfer Learning", at USENIX Security 2018

Stars: ✭ 18 (-65.38%)

Mutual labels: adversarial-machine-learning

s-attack

[CVPR 2022] S-attack library. Official implementation of two papers "Vehicle trajectory prediction works, but not everywhere" and "Are socially-aware trajectory prediction models really socially-aware?".

Stars: ✭ 51 (-1.92%)

Mutual labels: robustness

adversarial-code-generation

Source code for the ICLR 2021 work "Generating Adversarial Computer Programs using Optimized Obfuscations"

Stars: ✭ 16 (-69.23%)

Mutual labels: adversarial-machine-learning

shortcut-perspective

Figures & code from the paper "Shortcut Learning in Deep Neural Networks" (Nature Machine Intelligence 2020)

Stars: ✭ 67 (+28.85%)

Mutual labels: robustness

tulip

Scaleable input gradient regularization

Stars: ✭ 19 (-63.46%)

Mutual labels: adversarial-machine-learning

belay

Robust error-handling for Kotlin and Android

Stars: ✭ 35 (-32.69%)

Mutual labels: robustness

eeg-gcnn

Resources for the paper titled "EEG-GCNN: Augmenting Electroencephalogram-based Neurological Disease Diagnosis using a Domain-guided Graph Convolutional Neural Network". Accepted for publication (with an oral spotlight!) at ML4H Workshop, NeurIPS 2020.

Stars: ✭ 50 (-3.85%)

Mutual labels: neurips-2020

adan

Language-Adversarial Training for Cross-Lingual Text Classification (TACL)

Stars: ✭ 60 (+15.38%)

Mutual labels: adversarial-training

RayS

RayS: A Ray Searching Method for Hard-label Adversarial Attack (KDD2020)

Stars: ✭ 43 (-17.31%)

Mutual labels: robustness

robustness-vit

Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).

Stars: ✭ 78 (+50%)

Mutual labels: robustness

cycle-confusion

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

Stars: ✭ 67 (+28.85%)

Mutual labels: robustness

View All Similar Projects ➔

Adversarial Distributional Training

This repository contains the code for adversarial distributional training (ADT) introduced in the following paper

Adversarial Distributional Training for Robust Deep Learning (NeurIPS 2020)

Yinpeng Dong*, Zhijie Deng*, Tianyu Pang, Hang Su, and Jun Zhu (* indicates equal contribution)

Citation

If you find our methods useful, please consider citing:

@inproceedings{dong2020adversarial,
  title={Adversarial Distributional Training for Robust Deep Learning},
  author={Dong, Yinpeng and Deng, Zhijie and Pang, Tianyu and Su, Hang and Zhu, Jun},
  booktitle={Advances in Neural Information Processing Systems},
  year={2020}
}

Introduction

Adversarial distribution training (ADT) is a new framework to train robust deep learning models. It is formulated as a minimax optimization problem, in which the inner maximization aims to find an adversarial distribution for each natural input to characterize potential adversarial examples; and the outer minimization aims to optimize DNN parameters with the worst-case adversarial distributions.

In this paper, we proposed three different approaches to parameterize the adversarial distributions, as illustrated below.

Figure 1: An illustration of three different ADT methods, including (a) ADT_EXP; (b) ADT_EXP-AM; (c) ADT_IMP-AM.

Prerequisites

Python (3.6.8)
Pytorch (1.3.0)
torchvision (0.4.1)
numpy

Training

We have proposed three different methods for ADT. The command for each training method is specified below.

Training ADT_EXP

python adt_exp.py --model-dir adt-exp --dataset cifar10 (or cifar100/svhn)

Training ADT_EXP-AM

python adt_expam.py --model-dir adt-expam --dataset cifar10 (or cifar100/svhn)

Training ADT_IMP-AM

python adt_impam.py --model-dir adt-impam --dataset cifar10 (or cifar100/svhn)

The checkpoints will be saved at each model folder.

Evaluation

Evaluation under White-box Attacks

For FGSM attack, run

python evaluate_attacks.py --model-path ${MODEL-PATH} --attack-method FGSM --dataset cifar10 (or cifar100/svhn)

For PGD attack, run

python evaluate_attacks.py --model-path ${MODEL-PATH} --attack-method PGD --num-steps 20 (or 100) --dataset cifar10 (or cifar100/svhn)

For MIM attack, run

python evaluate_attacks.py --model-path ${MODEL-PATH} --attack-method MIM --num-steps 20 --dataset cifar10 (or cifar100/svhn)

For C&W attack, run

python evaluate_attacks.py --model-path ${MODEL-PATH} --attack-method CW --num-steps 30 --dataset cifar10 (or cifar100/svhn)

For FeaAttack, run

python feature_attack.py --model-path ${MODEL-PATH} --dataset cifar10 (or cifar100/svhn)

Evaluation under Transfer-based Black-box Attacks

First change the --white-box-attack argument in evaluate_attacks.py to False. Then run

python evaluate_attacks.py --source-model-path ${SOURCE-MODEL-PATH} --target-model-path ${TARGET-MODEL-PATH} --attack-method PGD (or MIM)

Evaluation under SPSA

python spsa.py --model-path ${MODEL-PATH} --samples_per_draw 256 (or 512/1024/2048)

Pretrained Models

We have provided the pre-trained models on CIFAR-10, whose performance is reported in Table 1. They can be downloaded at

Contact

Yinpeng Dong: [email protected]

Zhijie Deng: [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

dongyp13 / Adversarial-Distributional-Training

Programming Languages

Labels

Projects that are alternatives of or similar to Adversarial-Distributional-Training

Adversarial Distributional Training

Citation

Introduction

Prerequisites

Training

Training ADT_EXP

Training ADT_EXP-AM

Training ADT_IMP-AM

Evaluation

Evaluation under White-box Attacks

Evaluation under Transfer-based Black-box Attacks

Evaluation under SPSA

Pretrained Models

Contact

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

dongyp13 / Adversarial-Distributional-Training

Programming Languages

Labels

Projects that are alternatives of or similar to Adversarial-Distributional-Training

Adversarial Distributional Training

Citation

Introduction

Prerequisites

Training

Training ADTEXP

Training ADTEXP-AM

Training ADTIMP-AM

Evaluation

Evaluation under White-box Attacks

Evaluation under Transfer-based Black-box Attacks

Evaluation under SPSA

Pretrained Models

Contact

Training ADT_EXP

Training ADT_EXP-AM

Training ADT_IMP-AM