Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → bradleypallen → Keras Quora Question Pairs

bradleypallen / Keras Quora Question Pairs

Licence: mit

A Keras model that addresses the Quora Question Pairs dyadic prediction task.

Labels

jupyter-notebook

Projects that are alternatives of or similar to Keras Quora Question Pairs

Commuter

🚎 Notebook sharing hub

Stars: ✭ 353 (-1.67%)

Mutual labels: jupyter-notebook

Dsb3tutorial

Stars: ✭ 355 (-1.11%)

Mutual labels: jupyter-notebook

Cs231nassignment

Stanford CS231n assignment in 2019 spring

Stars: ✭ 358 (-0.28%)

Mutual labels: jupyter-notebook

Meta Rl

Implementation of Meta-RL A3C algorithm

Stars: ✭ 355 (-1.11%)

Mutual labels: jupyter-notebook

Gather

Spit shine for Jupyter notebooks 🧽✨

Stars: ✭ 355 (-1.11%)

Mutual labels: jupyter-notebook

Afinn

AFINN sentiment analysis in Python

Stars: ✭ 356 (-0.84%)

Mutual labels: jupyter-notebook

Deep Learning With Pytorch

Deep Learning with PyTorch, published by Packt

Stars: ✭ 352 (-1.95%)

Mutual labels: jupyter-notebook

Python for geosciences

Introduction to python use in geosciences.

Stars: ✭ 357 (-0.56%)

Mutual labels: jupyter-notebook

Python Notlarim

Python notes in Turkish.

Stars: ✭ 356 (-0.84%)

Mutual labels: jupyter-notebook

Paintschainer

line drawing colorization using chainer

Stars: ✭ 3,612 (+906.13%)

Mutual labels: jupyter-notebook

Senator Filings

Scrape public filings of the buy + sell orders of U.S. senators and calculate their returns

Stars: ✭ 356 (-0.84%)

Mutual labels: jupyter-notebook

Spec augment

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Stars: ✭ 354 (-1.39%)

Mutual labels: jupyter-notebook

Deep Learning With Python

《Python深度学习》书籍代码

Stars: ✭ 352 (-1.95%)

Mutual labels: jupyter-notebook

Pytorch Tvmisc

Totally Versatile Miscellanea for Pytorch

Stars: ✭ 354 (-1.39%)

Mutual labels: jupyter-notebook

Spotmicroai

SpotMicro AI - How to build a self-learning Robot

Stars: ✭ 357 (-0.56%)

Mutual labels: jupyter-notebook

Weakly detector

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

Stars: ✭ 353 (-1.67%)

Mutual labels: jupyter-notebook

Python Lectures

IPython Notebooks to learn Python

Stars: ✭ 355 (-1.11%)

Mutual labels: jupyter-notebook

Quantitative Notebooks

Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy

Stars: ✭ 356 (-0.84%)

Mutual labels: jupyter-notebook

Predicting Poverty

Combining satellite imagery and machine learning to predict poverty

Stars: ✭ 358 (-0.28%)

Mutual labels: jupyter-notebook

Stock Prediction Models

Gathers machine learning and deep learning models for Stock forecasting including trading bots and simulations

Stars: ✭ 4,660 (+1198.05%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

keras-quora-question-pairs

A Keras model that addresses the Quora Question Pairs [1] dyadic prediction task.

Model implementation

The Keras model architecture is shown below:

The model architecture is based on the Stanford Natural Language Inference [2] benchmark model developed by Stephen Merity [3], specifically the version using a simple summation of GloVe word embeddings [4] to represent each question in the pair. A difference between this and the Merity SNLI benchmark is that our final layer is Dense with sigmoid activation, as opposed to softmax. Another key difference is that we are using the max operator as opposed to sum to combine word embeddings into a question representation. We use binary cross-entropy as a loss function and Adam for optimization.

Evaluation

We partition the Quora question pairs into a 90/10 train/test split. We run training for 25 epochs with a further 90/10 train/validation split, saving the weights from the model checkpoint with the maximum validation accuracy. Training takes approximately 120 secs/epoch, using Tensorflow as a backend for Keras on an Amazon Web Services EC2 p2-xlarge GPU compute instance. We finally evaluate the best checkpointed model to obtain a test set accuracy of 0.8291. The table below places this in the context of other work on the dataset reported to date:

Model	Source of Word Embeddings	Accuracy
"BiMPM model" [5]	GloVe Common Crawl (840B tokens, 300D)	0.88
"LSTM with concatenation" [6]	"Quora's text corpus"	0.87
"LSTM with distance and angle" [6]	"Quora's text corpus"	0.87
"Decomposable attention" [6]	"Quora's text corpus"	0.86
"L.D.C." [5]	GloVe Common Crawl (840B tokens, 300D)	0.86
Max bag-of-embeddings (this work)	GloVe Common Crawl (840B tokens, 300D)	0.83
"Multi-Perspective-LSTM" [5]	GloVe Common Crawl (840B tokens, 300D)	0.83
"Siamese-LSTM" [5]	GloVe Common Crawl (840B tokens, 300D)	0.83
"Neural bag-of-words" (max) [7]	GloVe Common Crawl pruned to 1M vocab. (spaCy default)	0.83
"Neural bag-of-words" (max & mean) [7]	GloVe Common Crawl pruned to 1M vocab. (spaCy default)	0.83
"Max-out Window Encoding" with depth 2 [7]	GloVe Common Crawl pruned to 1M vocab. (spaCy default)	0.83
"Neural bag-of-words" (mean) [7]	GloVe Common Crawl pruned to 1M vocab. (spaCy default)	0.81
"Multi-Perspective-CNN" [5]	GloVe Common Crawl (840B tokens, 300D)	0.81
"Siamese-CNN" [5]	GloVe Common Crawl (840B tokens, 300D)	0.80
"Spacy + TD-IDF + Siamese" [8]	GloVe (6B tokens, 300D)	0.79

Discussion

An initial pass at hyperparameter tuning by evaluating possible settings a hyperparameter at a time led to the following observations:

Computing the question representation by applying the max operator to the word embeddings slightly outperformed using mean and sum, which is consistent with what is reported in [7].
Computing the question representation using max also slightly outperformed the use of bidirectional LSTM and GRU recurrent layers, again as discussed in [7].
Batch normalization improved accuracy, as observed by [8].
Any amount of dropout decreased accuracy, as also observed by [8].
Four hidden layers in the fully-connected component had the best accuracy, with between zero and six hidden layers evaluated.
Using 200 dimensions for the layers in the fully-connected component showed the best accuracy among tested dimensions 50, 100, 200, and 300.

Future work

A more principled (and computationally-intensive) campaign of randomized search over the space of hyperparameter configurations is planned.

Requirements

Python 3.5.2
jupyter 4.2.1

Package dependencies

numpy 1.12.1
pandas 0.19.2
matplotlib 1.5.3
Keras 2.0.4
scikit-learn 0.18.1
h5py 2.6.0
hdf5 1.8.17

Usage

This repository contains two different ways to create and run the model.

From the command line

$ python3 keras-quora-question-pairs.py

On first execution, this will download the required Quora and GloVe datasets and generate files that cache the training data and related word count and embedding data for subsequent runs.

As Jupyter notebooks

Simply run the notebook server using the standard Jupyter command:

$ jupyter notebook

First run

quora-question-pairs-data-prep.ipynb

As with the script above, this will generate files for training the Keras model. Then run

quora-question-pairs-training.ipynb

next to train and evaluate the model.

License

MIT. See the LICENSE file for the copyright notice.

References

[1] Shankar Iyar, Nikhil Dandekar, and Kornél Csernai. “First Quora Dataset Release: Question Pairs,” 24 January 2016. Retrieved at https://data.quora.com/First-Quora-Dataset-Release-Question-Pairs on 31 January 2017.

[2] Samuel R. Bowman, Gabor Angeli, Christopher Potts, and Christopher D. Manning. "A large annotated corpus for learning natural language inference," in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), September 2015.

[3] Stephen Merity. "Keras SNLI baseline example,” 4 September 2016. Retrieved at https://github.com/Smerity/keras_snli on 31 January 2017.

[4] Jeffrey Pennington, Richard Socher, and Christopher D. Manning. "GloVe: Global Vectors for Word Representation," in Proceedings of the 2014 Conference on Empirical Methods In Natural Language Processing (EMNLP 2014), October 2014.

[5] Zhiguo Wang, Wael Hamza and Radu Florian. "Bilateral Multi-Perspective Matching for Natural Language Sentences," 13 February 2017. Retrieved at https://arxiv.org/pdf/1702.03814.pdf on 14 February 2017.

[6] Lili Jiang, Shuo Chang, and Nikhil Dandekar. "Semantic Question Matching with Deep Learning," 13 February 2017. Retrieved at https://engineering.quora.com/Semantic-Question-Matching-with-Deep-Learning on 13 February 2017.

[7] Matthew Honnibal. "Deep text-pair classification with Quora's 2017 question dataset," 13 February 2017. Retreived at https://explosion.ai/blog/quora-deep-text-pair-classification on 13 February 2017.

[8] Eren Golge. "Duplicate Question Detection with Deep Learning on Quora Dataset," 12 February 2017. Retreived at http://www.erogol.com/duplicate-question-detection-deep-learning/ on 13 February 2017.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 359

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗