All Projects → entron → Entity Embedding Rossmann

entron / Entity Embedding Rossmann

Projects that are alternatives of or similar to Entity Embedding Rossmann

Competition Baseline
数据科学竞赛知识、代码、思路
Stars: ✭ 2,553 (+236.81%)
Mutual labels:  kaggle, jupyter-notebook
Kaggle airbus ship detection
Kaggle airbus ship detection challenge 21st solution
Stars: ✭ 238 (-68.6%)
Mutual labels:  kaggle, jupyter-notebook
Deeptoxic
top 1% solution to toxic comment classification challenge on Kaggle.
Stars: ✭ 180 (-76.25%)
Mutual labels:  kaggle, jupyter-notebook
Benchmarks
Comparison tools
Stars: ✭ 139 (-81.66%)
Mutual labels:  kaggle, jupyter-notebook
Deeplearning
深度学习入门教程, 优秀文章, Deep Learning Tutorial
Stars: ✭ 6,783 (+794.85%)
Mutual labels:  kaggle, jupyter-notebook
Data Analysis
主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (-81.27%)
Mutual labels:  kaggle, jupyter-notebook
Machine Learning
从零基础开始机器学习之旅
Stars: ✭ 209 (-72.43%)
Mutual labels:  kaggle, jupyter-notebook
Dogbreed gluon
kaggle Dog Breed Identification
Stars: ✭ 116 (-84.7%)
Mutual labels:  kaggle, jupyter-notebook
Data Science Projects
DataScience projects for learning : Kaggle challenges, Object Recognition, Parsing, etc.
Stars: ✭ 361 (-52.37%)
Mutual labels:  kaggle, jupyter-notebook
Amazon Forest Computer Vision
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks
Stars: ✭ 346 (-54.35%)
Mutual labels:  kaggle, jupyter-notebook
Kaggle Web Traffic
1st place solution
Stars: ✭ 1,641 (+116.49%)
Mutual labels:  kaggle, jupyter-notebook
Dogs vs cats
猫狗大战
Stars: ✭ 570 (-24.8%)
Mutual labels:  kaggle, jupyter-notebook
Ml Dl Scripts
The repository provides usefull python scripts for ML and data analysis
Stars: ✭ 119 (-84.3%)
Mutual labels:  kaggle, jupyter-notebook
Machine Learning Workflow With Python
This is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (-79.29%)
Mutual labels:  kaggle, jupyter-notebook
Ml Fraud Detection
Credit card fraud detection through logistic regression, k-means, and deep learning.
Stars: ✭ 117 (-84.56%)
Mutual labels:  kaggle, jupyter-notebook
Girls In Ai
免费学代码系列:小白python入门、数据分析data analyst、机器学习machine learning、深度学习deep learning、kaggle实战
Stars: ✭ 2,309 (+204.62%)
Mutual labels:  kaggle, jupyter-notebook
Kaggle Houseprices
Kaggle Kernel for House Prices competition https://www.kaggle.com/massquantity/all-you-need-is-pca-lb-0-11421-top-4
Stars: ✭ 113 (-85.09%)
Mutual labels:  kaggle, jupyter-notebook
Ds bowl 2018
Kaggle Data Science Bowl 2018
Stars: ✭ 116 (-84.7%)
Mutual labels:  kaggle, jupyter-notebook
Pytorch Kaggle Starter
Pytorch starter kit for Kaggle competitions
Stars: ✭ 268 (-64.64%)
Mutual labels:  kaggle, jupyter-notebook
Tutorials
CatBoost tutorials repository
Stars: ✭ 563 (-25.73%)
Mutual labels:  kaggle, jupyter-notebook

This is the code used in the paper "Entity Embeddings of Categorical Variables". If you want to get the original version of the code used for the Kaggle competition, please use the Kaggle branch.

To run the code one needs first download and unzip the train.csv and store.csv files on Kaggle and put them in this folder.

If you use Anaconda you can install the dependecies like the following example:

conda create --name ee python=3.7 pip
conda activate ee
pip install scikit-learn xgboost tensorflow keras jupyter matplotlib

Please refer to Keras for more details regarding how to install keras.

Next, run the following scripts to extract the csv files and prepare the features:

python3 extract_csv_files.py
python3 prepare_features.py

To run the models:

python3 train_test_model.py

You can anaylize the embeddings with plot_embeddings.ipynb. For example, the following are the learned embeeding of German States printed in 2D and the map of Germany side by side. Considering the algorithm knows nothing about German geography the remarkable resemblance between the two demonstrates the power of the algorithm for abductive reasoning. I expect entity embedding will be a very useful tool to study the relationship of genome, proteins, drugs, diseases and I would love to see its applications in biology and medicine one day.

Visualizaiton of Entity Embedding of German States in 2D Map of Germany
EE_German_States Karte-Deutschland-Fun-Facts-Deutsch
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].