Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → dalmia → Quora-Question-Pairs

dalmia / Quora-Question-Pairs

Licence: GPL-3.0 license

The code for our submission in Kaggle's competition Quora Question Pairs which ranked in the top 25%.

Programming Languages

139335 projects - #7 most used programming language

77523 projects

Labels

nlp machine-learning deep-learning tensorflow quora-question-pairs

Projects that are alternatives of or similar to Quora-Question-Pairs

A siamese LSTM to detect sentence/question pairs.

Stars: ✭ 25 (-16.67%)

Mutual labels: quora-question-pairs

bert quora question pairs

BERT Model Fine-tuning on Quora Questions Pairs

Stars: ✭ 28 (-6.67%)

Mutual labels: quora-question-pairs

Quora question pairs NLP Kaggle

Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training

Stars: ✭ 17 (-43.33%)

Mutual labels: quora-question-pairs

Quora-Paraphrase-Question-Identification

Paraphrase question identification using Feature Fusion Network (FFN).

Stars: ✭ 19 (-36.67%)

Mutual labels: quora-question-pairs

SentenceSimilarity

The enhanced RCNN model used for sentence similarity classification

Stars: ✭ 41 (+36.67%)

Mutual labels: quora-question-pairs

Text pair classification

Stars: ✭ 12 (-60%)

Mutual labels: quora-question-pairs

Quora-Question-Pairs

This repository contains the code for our submission in Kaggle's competition Quora Question Pairs in which we ranked in the top 25%. A detailed report for the project can be found here.

Data

train.csv contains ~ 400k question pairs along with the corresponding label (duplicate or not) and test.csv contains ~ 2300k question pairs. Both the files can be found here.

Model Architecture

We use a Siamese Neural Network architecture with Gated Recurrent Units in combination with traditional Machine Learning algorithms like Random Forest, SVM and Adaboost.

Running the model

Firstly, place the train.csv, test.csv (see the Data section above) and the pre-trained GloVe embeddings in the input folder. You can download the embeddings from here. Then, simply run the bash script:

bash run_model.sh

Contributors

Dependencies

numpy
pandas
nltk
sklearn
TensorFlow

Install them using pip.

Note

If there is any issue running the code, please post it in the issue tracker.
If you like this repo and find it useful, please consider ★ starring it (on top right of the page) :)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 30

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗