All Projects β†’ DataCanvasIO β†’ Deeptables

DataCanvasIO / Deeptables

Licence: apache-2.0
DeepTables: Deep-learning Toolkit for Tabular data

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Deeptables

Deepmatch
A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.
Stars: ✭ 1,051 (+407.73%)
Mutual labels:  factorization-machines
Flurs
🌊 FluRS: A Python library for streaming recommendation algorithms
Stars: ✭ 97 (-53.14%)
Mutual labels:  factorization-machines
Tableprint
Pretty console printing πŸ“‹ of tabular data in python 🐍
Stars: ✭ 153 (-26.09%)
Mutual labels:  tabular-data
Ctr model zoo
some ctr model, implemented by PyTorch, such as Factorization Machines, Field-aware Factorization Machines, DeepFM, xDeepFM, Deep Interest Network
Stars: ✭ 55 (-73.43%)
Mutual labels:  factorization-machines
Tsv Utils
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+486.96%)
Mutual labels:  tabular-data
Tableqa
AI Tool for querying natural language on tabular data.
Stars: ✭ 109 (-47.34%)
Mutual labels:  tabular-data
Dataframes.jl
In-memory tabular data in Julia
Stars: ✭ 951 (+359.42%)
Mutual labels:  tabular-data
Mirador
Tool for visual exploration of complex data.
Stars: ✭ 186 (-10.14%)
Mutual labels:  tabular-data
Fwumious wabbit
Fwumious Wabbit, fast on-line machine learning toolkit written in Rust
Stars: ✭ 96 (-53.62%)
Mutual labels:  factorization-machines
Rsparse
Fast and accurate machine learning on sparse matrices - matrix factorizations, regression, classification, top-N recommendations.
Stars: ✭ 145 (-29.95%)
Mutual labels:  factorization-machines
Rankfm
Factorization Machines for Recommendation and Ranking Problems with Implicit Feedback Data
Stars: ✭ 71 (-65.7%)
Mutual labels:  factorization-machines
Multimodal Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
Stars: ✭ 78 (-62.32%)
Mutual labels:  tabular-data
Fmg
KDD17_FMG
Stars: ✭ 116 (-43.96%)
Mutual labels:  factorization-machines
Faster Than Csv
Faster CSV on Python 3
Stars: ✭ 52 (-74.88%)
Mutual labels:  tabular-data
Csvreader
csvreader library / gem - read tabular data in the comma-separated values (csv) format the right way (uses best practices out-of-the-box with zero-configuration)
Stars: ✭ 169 (-18.36%)
Mutual labels:  tabular-data
Attentional Neural Factorization Machine
Attention,Factorization Machine, Deep Learning, Recommender System
Stars: ✭ 39 (-81.16%)
Mutual labels:  factorization-machines
Sno
Distributed version-control for geospatial and tabular data
Stars: ✭ 100 (-51.69%)
Mutual labels:  tabular-data
Tad
A desktop application for viewing and analyzing tabular data
Stars: ✭ 2,275 (+999.03%)
Mutual labels:  tabular-data
Tgan
Generative adversarial training for generating synthetic tabular data.
Stars: ✭ 173 (-16.43%)
Mutual labels:  tabular-data
Tui.grid
πŸžπŸ”‘ The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!
Stars: ✭ 1,859 (+798.07%)
Mutual labels:  tabular-data

DeepTables

Python Versions TensorFlow Versions Downloads PyPI Version

Documentation Status Build Status Coverage Status License

DeepTables: Deep-learning Toolkit for Tabular data

DeepTables(DT) is a easy-to-use toolkit that enables deep learning to unleash great power on tabular data.

Overview

MLP (also known as Fully-connected neural networks) have been shown inefficient in learning distribution representation. The "add" operations of the perceptron layer have been proven poor performance to exploring multiplicative feature interactions. In most cases, manual feature engineering is necessary and this work requires extensive domain knowledge and very cumbersome. How learning feature interactions efficiently in neural networks becomes the most important problem.

Various models have been proposed to CTR prediction and continue to outperform existing state-of-the-art approaches to the late years. Well-known examples include FM, DeepFM, Wide&Deep, DCN, PNN, etc. These models can also provide good performance on tabular data under reasonable utilization.

DT aims to utilize the latest research findings to provide users with an end-to-end toolkit on tabular data.

DT has been designed with these key goals in mind:

  • Easy to use, non-experts can also use.
  • Provide good performance out of the box.
  • Flexible architecture and easy expansion by user.

Tutorials

Please refer to the official docs at https://deeptables.readthedocs.io/en/latest/.

Installation

pip install deeptables

GPU Setup (Optional)

pip install deeptables[gpu]

Verify the install:

python -c "from deeptables.utils.quicktest import test; test()”

Optional dependencies

Following libraries are not hard dependencies and are not automatically installed when you install DeepTables. To use all functionalities of DT, these optional dependencies must be installed.

pip install shap

Example:

A simple binary classification example

import numpy as np
from deeptables.models import deeptable, deepnets
from deeptables.datasets import dsutils
from sklearn.model_selection import train_test_split

#loading data
df = dsutils.load_bank()
df_train, df_test = train_test_split(df, test_size=0.2, random_state=42)

y = df_train.pop('y')
y_test = df_test.pop('y')

#training
config = deeptable.ModelConfig(nets=deepnets.DeepFM)
dt = deeptable.DeepTable(config=config)
model, history = dt.fit(df_train, y, epochs=10)

#evaluation
result = dt.evaluate(df_test,y_test, batch_size=512, verbose=0)
print(result)

#scoring
preds = dt.predict(df_test)

A solution using DeepTables to win the 1st place in Kaggle Categorical Feature Encoding Challenge II

Click here

DataCanvas

DeepTables is an open source project created by DataCanvas.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].