Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → ChenglongChen → Tensorflow Xnn

ChenglongChen / Tensorflow Xnn

Licence: mit

Tensorflow implementation of DeepFM variant that won 4th Place in Mercari Price Suggestion Challenge on Kaggle.

Programming Languages

139335 projects - #7 most used programming language

Labels

kaggle-competition factorization-machines

Projects that are alternatives of or similar to Tensorflow Xnn

Kaggle Kernels (Python, R, Jupyter Notebooks)

Stars: ✭ 26 (-90.11%)

Mutual labels: kaggle-competition

A parallel implementation of factorization machines based on Spark

Stars: ✭ 73 (-72.24%)

Mutual labels: factorization-machines

Kaggle The Hunt for Prohibited Content

4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)

Stars: ✭ 29 (-88.97%)

Mutual labels: kaggle-competition

open-solution-ship-detection

Open solution to the Airbus Ship Detection Challenge

Stars: ✭ 54 (-79.47%)

Mutual labels: kaggle-competition

Bike-Sharing-Demand-Kaggle

Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand

Stars: ✭ 33 (-87.45%)

Mutual labels: kaggle-competition

Kaggle solutions

Stars: ✭ 17 (-93.54%)

Mutual labels: kaggle-competition

histopathologic cancer detector

CNN histopathologic tumor identifier.

Stars: ✭ 26 (-90.11%)

Mutual labels: kaggle-competition

a solution for competition of kaggle `Human or Robot`

Stars: ✭ 16 (-93.92%)

Mutual labels: kaggle-competition

Hello-Kaggle-Guide-KOR

Kaggle을 처음 접하는 사람들을 위한 문서

Stars: ✭ 140 (-46.77%)

Mutual labels: kaggle-competition

Kaggle-project-list

Summary of my projects on kaggle

Stars: ✭ 20 (-92.4%)

Mutual labels: kaggle-competition

nowplaying-RS-Music-Reco-FM

#nowplaying-RS: Music Recommendation using Factorization Machines

Stars: ✭ 23 (-91.25%)

Mutual labels: factorization-machines

No description or website provided.

Stars: ✭ 92 (-65.02%)

Mutual labels: factorization-machines

automated-essay-grading

Source code for the paper A Memory-Augmented Neural Model for Automated Grading

Stars: ✭ 101 (-61.6%)

Mutual labels: kaggle-competition

Facial Expression Recognition

Stars: ✭ 32 (-87.83%)

Mutual labels: kaggle-competition

Machine-Learning

The projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.

Stars: ✭ 54 (-79.47%)

Mutual labels: kaggle-competition

Recommendation.jl

Building recommender systems in Julia

Stars: ✭ 42 (-84.03%)

Mutual labels: factorization-machines

Apartment-Interest-Prediction

Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text and images.

Stars: ✭ 17 (-93.54%)

Mutual labels: kaggle-competition

🏁 API client for Kaggle

Stars: ✭ 50 (-80.99%)

Mutual labels: kaggle-competition

ashrae-great-energy-predictor-3-solution-analysis

Analysis of top give winning solutions of the ASHRAE Great Energy Predictor III competition

Stars: ✭ 44 (-83.27%)

Mutual labels: kaggle-competition

Jigsaw-Unintended-Bias-in-Toxicity-Classification

7th Place Solution for Jigsaw Unintended Bias in Toxicity Classification on Kaggle

Stars: ✭ 16 (-93.92%)

Mutual labels: kaggle-competition

View All Similar Projects ➔

tensorflow-XNN

4th Place Solution for Mercari Price Suggestion Challenge on Kaggle

The Challenge

Build a model to suggest the price of product on Mercari. The model is required to train (including all the preprocessing, feature extraction and model training steps) and inference within 1 hour, using only 4 cores cpu, 16GB RAM, 1GB disk. Data include unstructured text (product title & description) and structured ones, e.g., product category and shipping flag etc.

Summary

Highlights of our method are as follows:

very minimum preprocessing with focus on end-to-end learning with multi-field inputs, e.g., textual and categorical;
hybrid NN consists of four major compoments, i.e., embed, encode, attend and predict. FastText and NN based FM are used as building block;
purely bagging of NNs of the same architecture via snapshot ensemble;
efficiency is achieved via various approaches, e.g., lazynadam optimization, fasttext encoding and average pooling, snapshot ensemble, etc.

Model Architecture

Please find the slide of our solution here.

About this project

This is the 4th text mining competition I have attend on Kaggle. The other three are:

CrowdFlower Search Results Relevance Competition, 1st Place
Home Depot Product Search Relevance Competition, 3rd Place
The Hunt for Prohibited Content Competition, 4th Place

In these previous competitions, I took the general ML based methods, i.e., data cleaning, feature engineering (see the solutions of CrowdFlower and HomeDepot for how many features have been engineered), VW/XGBoost training, and massive ensembling.

Since I have been working on CTR & KBQA based on deeplearning and embedding models for some time, I decided to give this competition a shot. With data of this competition, I have experimented with various ideas such as NN based FM and snapshot ensemble.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 263

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗