All Projects → sdv-dev → Ctgan

sdv-dev / Ctgan

Licence: mit
Conditional GAN for generating synthetic tabular data.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Ctgan

Tgan
Generative adversarial training for generating synthetic tabular data.
Stars: ✭ 173 (-41.75%)
Mutual labels:  generative-adversarial-network, tabular-data
Sdv
Synthetic Data Generation for tabular, relational and time series data.
Stars: ✭ 360 (+21.21%)
Mutual labels:  generative-adversarial-network, data-generation
SDGym
Benchmarking synthetic data generation methods.
Stars: ✭ 177 (-40.4%)
Mutual labels:  tabular-data, generative-adversarial-network
Data Augmentation Review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
Stars: ✭ 785 (+164.31%)
Mutual labels:  generative-adversarial-network, data-generation
DeepEcho
Synthetic Data Generation for mixed-type, multivariate time series.
Stars: ✭ 44 (-85.19%)
Mutual labels:  generative-adversarial-network, data-generation
EmotionalConversionStarGAN
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Stars: ✭ 92 (-69.02%)
Mutual labels:  generative-adversarial-network
Generative models tutorial with demo
Generative Models Tutorial with Demo: Bayesian Classifier Sampling, Variational Auto Encoder (VAE), Generative Adversial Networks (GANs), Popular GANs Architectures, Auto-Regressive Models, Important Generative Model Papers, Courses, etc..
Stars: ✭ 276 (-7.07%)
Mutual labels:  generative-adversarial-network
Autofillr
A browser extension that fills registration forms with randomly but consistently generated fake data.
Stars: ✭ 17 (-94.28%)
Mutual labels:  data-generation
HyperGBM
A full pipeline AutoML tool for tabular data
Stars: ✭ 172 (-42.09%)
Mutual labels:  tabular-data
Pytorch Srgan
A modern PyTorch implementation of SRGAN
Stars: ✭ 289 (-2.69%)
Mutual labels:  generative-adversarial-network
Makegirlsmoe web
Create Anime Characters with MakeGirlsMoe
Stars: ✭ 3,144 (+958.59%)
Mutual labels:  generative-adversarial-network
Tf 3dgan
Tensorflow implementation of 3D Generative Adversarial Network.
Stars: ✭ 263 (-11.45%)
Mutual labels:  generative-adversarial-network
TadGAN
Code for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"
Stars: ✭ 67 (-77.44%)
Mutual labels:  generative-adversarial-network
Alae
[CVPR2020] Adversarial Latent Autoencoders
Stars: ✭ 3,178 (+970.03%)
Mutual labels:  generative-adversarial-network
RecycleGAN
The simplest implementation toward the idea of Re-cycle GAN
Stars: ✭ 68 (-77.1%)
Mutual labels:  generative-adversarial-network
Swiftdatatables
A Swift Data Table package, display grid-like data sets in a nicely formatted table for iOS. Subclassing UICollectionView that allows ordering, and searching with extensible options.
Stars: ✭ 287 (-3.37%)
Mutual labels:  tabular-data
DLSS
Deep Learning Super Sampling with Deep Convolutional Generative Adversarial Networks.
Stars: ✭ 88 (-70.37%)
Mutual labels:  generative-adversarial-network
Textbox
TextBox is an open-source library for building text generation system.
Stars: ✭ 257 (-13.47%)
Mutual labels:  generative-adversarial-network
Dcgan
The Simplest DCGAN Implementation
Stars: ✭ 286 (-3.7%)
Mutual labels:  generative-adversarial-network
Swiftytexttable
A lightweight library for generating text tables.
Stars: ✭ 252 (-15.15%)
Mutual labels:  tabular-data

DAI-Lab An Open Source Project from the Data to AI Lab, at MIT

Development Status PyPI Shield Travis CI Shield Downloads Coverage Status

Overview

CTGAN is a collection of Deep Learning based Synthetic Data Generators for single table data, which are able to learn from real data and generate synthetic clones with high fidelity.

Currently, this library implements the CTGAN and TVAE models proposed in the Modeling Tabular data using Conditional GAN paper. For more information about these models, please check out the respective user guides:

Install

Requirements

CTGAN has been developed and tested on Python 3.6, 3.7 and 3.8

Install from PyPI

The recommended way to installing CTGAN is using pip:

pip install ctgan

This will pull and install the latest stable release from PyPI.

Install with conda

CTGAN can also be installed using conda:

conda install -c sdv-dev -c pytorch -c conda-forge ctgan

This will pull and install the latest stable release from Anaconda.

Usage Example

⚠️ WARNING: If you're just getting started with synthetic data, we recommend using the SDV library which provides user-friendly APIs for interacting with CTGAN. To learn more about using CTGAN through SDV, check out the user guide here.

To get started with CTGAN, you should prepare your data as either a numpy.ndarray or a pandas.DataFrame object with two types of columns:

  • Continuous Columns: can contain any numerical value.
  • Discrete Columns: contain a finite number values, whether these are string values or not.

In this example we load the Adult Census Dataset which is a built-in demo dataset. We then model it using the CTGANSynthesizer and generate a synthetic copy of it.

from ctgan import CTGANSynthesizer
from ctgan import load_demo

data = load_demo()

# Names of the columns that are discrete
discrete_columns = [
    'workclass',
    'education',
    'marital-status',
    'occupation',
    'relationship',
    'race',
    'sex',
    'native-country',
    'income'
]

ctgan = CTGANSynthesizer(epochs=10)
ctgan.fit(data, discrete_columns)

# Synthetic copy
samples = ctgan.sample(1000)

Join our community

  1. Please have a look at the Contributing Guide to see how you can contribute to the project.
  2. If you have any doubts, feature requests or detect an error, please open an issue on github or join our Slack Workspace.
  3. Also, do not forget to check the project documentation site!

Citing TGAN

If you use CTGAN, please cite the following work:

  • Lei Xu, Maria Skoularidou, Alfredo Cuesta-Infante, Kalyan Veeramachaneni. Modeling Tabular data using Conditional GAN. NeurIPS, 2019.
@inproceedings{xu2019modeling,
  title={Modeling Tabular data using Conditional GAN},
  author={Xu, Lei and Skoularidou, Maria and Cuesta-Infante, Alfredo and Veeramachaneni, Kalyan},
  booktitle={Advances in Neural Information Processing Systems},
  year={2019}
}

Related Projects

Please note that these libraries are external contributions and are not maintained nor supervised by the MIT DAI-Lab team.

R interface for CTGAN

A wrapper around CTGAN has been implemented by Kevin Kuo @kevinykuo, bringing the functionalities of CTGAN to R users.

More details can be found in the corresponding repository: https://github.com/kasaai/ctgan

CTGAN Server CLI

A package to easily deploy CTGAN onto a remote server. This package is developed by Timothy Pillow @oregonpillow.

More details can be found in the corresponding repository: https://github.com/oregonpillow/ctgan-server-cli

The Synthetic Data Vault

This repository is part of The Synthetic Data Vault Project

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].