All Projects → ChenglongChen → Tensorflow Xnn

ChenglongChen / Tensorflow Xnn

Licence: mit
Tensorflow implementation of DeepFM variant that won 4th Place in Mercari Price Suggestion Challenge on Kaggle.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Tensorflow Xnn

Kaggle
Kaggle Kernels (Python, R, Jupyter Notebooks)
Stars: ✭ 26 (-90.11%)
Mutual labels:  kaggle-competition
spark-fm
A parallel implementation of factorization machines based on Spark
Stars: ✭ 73 (-72.24%)
Mutual labels:  factorization-machines
Kaggle The Hunt for Prohibited Content
4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)
Stars: ✭ 29 (-88.97%)
Mutual labels:  kaggle-competition
open-solution-ship-detection
Open solution to the Airbus Ship Detection Challenge
Stars: ✭ 54 (-79.47%)
Mutual labels:  kaggle-competition
Bike-Sharing-Demand-Kaggle
Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand
Stars: ✭ 33 (-87.45%)
Mutual labels:  kaggle-competition
kaggle
Kaggle solutions
Stars: ✭ 17 (-93.54%)
Mutual labels:  kaggle-competition
histopathologic cancer detector
CNN histopathologic tumor identifier.
Stars: ✭ 26 (-90.11%)
Mutual labels:  kaggle-competition
HumanOrRobot
a solution for competition of kaggle `Human or Robot`
Stars: ✭ 16 (-93.92%)
Mutual labels:  kaggle-competition
Hello-Kaggle-Guide-KOR
Kaggle을 처음 접하는 사람들을 위한 문서
Stars: ✭ 140 (-46.77%)
Mutual labels:  kaggle-competition
Kaggle-project-list
Summary of my projects on kaggle
Stars: ✭ 20 (-92.4%)
Mutual labels:  kaggle-competition
nowplaying-RS-Music-Reco-FM
#nowplaying-RS: Music Recommendation using Factorization Machines
Stars: ✭ 23 (-91.25%)
Mutual labels:  factorization-machines
deep-ctr
No description or website provided.
Stars: ✭ 92 (-65.02%)
Mutual labels:  factorization-machines
automated-essay-grading
Source code for the paper A Memory-Augmented Neural Model for Automated Grading
Stars: ✭ 101 (-61.6%)
Mutual labels:  kaggle-competition
fer
Facial Expression Recognition
Stars: ✭ 32 (-87.83%)
Mutual labels:  kaggle-competition
Machine-Learning
The projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.
Stars: ✭ 54 (-79.47%)
Mutual labels:  kaggle-competition
Recommendation.jl
Building recommender systems in Julia
Stars: ✭ 42 (-84.03%)
Mutual labels:  factorization-machines
Apartment-Interest-Prediction
Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text and images.
Stars: ✭ 17 (-93.54%)
Mutual labels:  kaggle-competition
kaggler
🏁 API client for Kaggle
Stars: ✭ 50 (-80.99%)
Mutual labels:  kaggle-competition
ashrae-great-energy-predictor-3-solution-analysis
Analysis of top give winning solutions of the ASHRAE Great Energy Predictor III competition
Stars: ✭ 44 (-83.27%)
Mutual labels:  kaggle-competition
Jigsaw-Unintended-Bias-in-Toxicity-Classification
7th Place Solution for Jigsaw Unintended Bias in Toxicity Classification on Kaggle
Stars: ✭ 16 (-93.92%)
Mutual labels:  kaggle-competition

tensorflow-XNN

4th Place Solution for Mercari Price Suggestion Challenge on Kaggle

The Challenge

Build a model to suggest the price of product on Mercari. The model is required to train (including all the preprocessing, feature extraction and model training steps) and inference within 1 hour, using only 4 cores cpu, 16GB RAM, 1GB disk. Data include unstructured text (product title & description) and structured ones, e.g., product category and shipping flag etc.

Summary

Highlights of our method are as follows:

  • very minimum preprocessing with focus on end-to-end learning with multi-field inputs, e.g., textual and categorical;
  • hybrid NN consists of four major compoments, i.e., embed, encode, attend and predict. FastText and NN based FM are used as building block;
  • purely bagging of NNs of the same architecture via snapshot ensemble;
  • efficiency is achieved via various approaches, e.g., lazynadam optimization, fasttext encoding and average pooling, snapshot ensemble, etc.

Model Architecture

fig/architecture.png

Please find the slide of our solution here.

About this project

This is the 4th text mining competition I have attend on Kaggle. The other three are:

In these previous competitions, I took the general ML based methods, i.e., data cleaning, feature engineering (see the solutions of CrowdFlower and HomeDepot for how many features have been engineered), VW/XGBoost training, and massive ensembling.

Since I have been working on CTR & KBQA based on deeplearning and embedding models for some time, I decided to give this competition a shot. With data of this competition, I have experimented with various ideas such as NN based FM and snapshot ensemble.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].