All Projects → sdv-dev → Copulas

sdv-dev / Copulas

Licence: mit
A library to model multivariate data using copulas.

Projects that are alternatives of or similar to Copulas

Awesome Ai Ml Dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Stars: ✭ 831 (+457.72%)
Mutual labels:  jupyter-notebook, data-generation
Sdv
Synthetic Data Generation for tabular, relational and time series data.
Stars: ✭ 360 (+141.61%)
Mutual labels:  jupyter-notebook, data-generation
Example Seldon
Example for end-to-end machine learning on Kubernetes using Kubeflow and Seldon Core
Stars: ✭ 154 (+3.36%)
Mutual labels:  jupyter-notebook
Stocks
Programs for stock prediction and evaluation
Stars: ✭ 155 (+4.03%)
Mutual labels:  jupyter-notebook
Neural Style Transfer
Keras Implementation of Neural Style Transfer from the paper "A Neural Algorithm of Artistic Style" (http://arxiv.org/abs/1508.06576) in Keras 2.0+
Stars: ✭ 2,000 (+1242.28%)
Mutual labels:  jupyter-notebook
Tensorflow Multi Dimensional Lstm
Multi dimensional LSTM as described in Alex Graves' Paper https://arxiv.org/pdf/0705.2011.pdf
Stars: ✭ 154 (+3.36%)
Mutual labels:  jupyter-notebook
Pyportfolioopt
Financial portfolio optimisation in python, including classical efficient frontier, Black-Litterman, Hierarchical Risk Parity
Stars: ✭ 2,502 (+1579.19%)
Mutual labels:  jupyter-notebook
Automatic Generation Of Text Summaries
使用两种方法(抽取式Textrank和概要式seq2seq)自动提取文本摘要
Stars: ✭ 155 (+4.03%)
Mutual labels:  jupyter-notebook
Surgery Robot Detection Segmentation
Object detection and segmentation for a surgery robot using Mask-RCNN on Python 3, Keras, and TensorFlow..
Stars: ✭ 155 (+4.03%)
Mutual labels:  jupyter-notebook
Binderhub
Run your code in the cloud, with technology so advanced, it feels like magic!
Stars: ✭ 2,050 (+1275.84%)
Mutual labels:  jupyter-notebook
Your First Kaggle Submission
How to perform an exploratory data analysis on the Kaggle Titanic dataset and make a submission to the leaderboard.
Stars: ✭ 155 (+4.03%)
Mutual labels:  jupyter-notebook
Jupyter Server Proxy
Jupyter notebook server extension to proxy web services.
Stars: ✭ 153 (+2.68%)
Mutual labels:  jupyter-notebook
Cnnvis Pytorch
visualization of CNN in PyTorch
Stars: ✭ 154 (+3.36%)
Mutual labels:  jupyter-notebook
Jupyter Vim Binding
Jupyter meets Vim. Vimmer will fall in love.
Stars: ✭ 1,965 (+1218.79%)
Mutual labels:  jupyter-notebook
Deep Viz Keras
Implementations of some popular Saliency Maps in Keras
Stars: ✭ 154 (+3.36%)
Mutual labels:  jupyter-notebook
Pytorchmedicalai
This is the hands-on deep learning tutorial series for the 2018/2019 Medical AI course by DeepOncology AI.
Stars: ✭ 155 (+4.03%)
Mutual labels:  jupyter-notebook
Data Science Stack Cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (+2.68%)
Mutual labels:  jupyter-notebook
Stock Market Prediction Challenge
Following repo is the solution to Stock Market Prediction using Neural Networks and Sentiment Analysis
Stars: ✭ 154 (+3.36%)
Mutual labels:  jupyter-notebook
Deepreinforcementlearning
A replica of the AlphaZero methodology for deep reinforcement learning in Python
Stars: ✭ 1,898 (+1173.83%)
Mutual labels:  jupyter-notebook
Mgwr
Multiscale Geographically Weighted Regression (MGWR)
Stars: ✭ 155 (+4.03%)
Mutual labels:  jupyter-notebook

DAI-Lab An Open Source Project from the Data to AI Lab, at MIT

Development Status PyPi Shield Downloads Tests Coverage Status

Copulas

Overview

Copulas is a Python library for modeling multivariate distributions and sampling from them using copula functions. Given a table containing numerical data, we can use Copulas to learn the distribution and later on generate new synthetic rows following the same statistical properties.

Some of the features provided by this library include:

  • A variety of distributions for modeling univariate data.
  • Multiple Archimedean copulas for modeling bivariate data.
  • Gaussian and Vine copulas for modeling multivariate data.
  • Automatic selection of univariate distributions and bivariate copulas.

Supported Distributions

Univariate

  • Beta
  • Gamma
  • Gaussian
  • Gaussian KDE
  • Log-Laplace
  • Student T
  • Truncated Gaussian
  • Uniform

Archimedean Copulas (Bivariate)

  • Clayton
  • Frank
  • Gumbel

Multivariate

  • Gaussian Copula
  • D-Vine
  • C-Vine
  • R-Vine

Install

Requirements

Copulas is part of the SDV project and is automatically installed alongside it. For details about this process please visit the SDV Installation Guide

Optionally, Copulas can also be installed as a standalone library using the following commands:

Using pip:

pip install copulas

Using conda:

conda install -c sdv-dev -c conda-forge copulas

For more installation options please visit the Copulas installation Guide

Quickstart

In this short quickstart, we show how to model a multivariate dataset and then generate synthetic data that resembles it.

import warnings
warnings.filterwarnings('ignore')

from copulas.datasets import sample_trivariate_xyz
from copulas.multivariate import GaussianMultivariate
from copulas.visualization import compare_3d

# Load a dataset with 3 columns that are not independent
real_data = sample_trivariate_xyz()

# Fit a gaussian copula to the data
copula = GaussianMultivariate()
copula.fit(real_data)

# Sample synthetic data
synthetic_data = copula.sample(len(real_data))

# Plot the real and the synthetic data to compare
compare_3d(real_data, synthetic_data)

The output will be a figure with two plots, showing what both the real and the synthetic data that you just generated look like:

Quickstart

What's next?

For more details about Copulas and all its possibilities and features, please check the documentation site.

There you can learn more about how to contribute to Copulas in order to help us developing new features or cool ideas.

Credits

Copulas is an open source project from the Data to AI Lab at MIT which has been built and maintained over the years by the following team:

The Synthetic Data Vault

This repository is part of The Synthetic Data Vault Project

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].