All Projects → aswalin → Squad

aswalin / Squad

Building QA system for Stanford Question Answering Dataset

Projects that are alternatives of or similar to Squad

Intro Numerical Methods
Jupyter notebooks and other materials developed for the Columbia course APMA 4300
Stars: ✭ 210 (-1.41%)
Mutual labels:  jupyter-notebook
Coloring T Sne
Exploration of methods for coloring t-SNE.
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook
Drn cvpr2020
Code and Dataset for CVPR2020 "Dynamic Refinement Network for Oriented and Densely Packed Object Detection"
Stars: ✭ 212 (-0.47%)
Mutual labels:  jupyter-notebook
Academy
Ray tutorials from Anyscale
Stars: ✭ 210 (-1.41%)
Mutual labels:  jupyter-notebook
Sc17
SuperComputing 2017 Deep Learning Tutorial
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook
Mldl
Machine Learning and Deep Learning Resources
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook
Knowledge Graph Analysis Programming Exercises
Exercises for the Analysis of Knowledge Graphs
Stars: ✭ 208 (-2.35%)
Mutual labels:  jupyter-notebook
Godot oculus quest toolkit
An easy to use VR toolkit for Oculus Quest development using the Godot game engine
Stars: ✭ 207 (-2.82%)
Mutual labels:  jupyter-notebook
Graph convolutional lstm
Traffic Graph Convolutional Recurrent Neural Network
Stars: ✭ 210 (-1.41%)
Mutual labels:  jupyter-notebook
Best Of Jupyter
🏆 A ranked list of awesome Jupyter Notebook, Hub and Lab projects (extensions, kernels, tools). Updated weekly.
Stars: ✭ 200 (-6.1%)
Mutual labels:  jupyter-notebook
Noise2self
A framework for blind denoising with self-supervision.
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook
Hindi2vec
State-of-the-Art Language Modeling and Text Classification in Hindi Language
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook
Deeplearning
Some practices about deep learning
Stars: ✭ 210 (-1.41%)
Mutual labels:  jupyter-notebook
Sttn
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook
Python Examples
Python技术 公众号文章代码实例
Stars: ✭ 212 (-0.47%)
Mutual labels:  jupyter-notebook
Cartoframes
CARTO Python package for data scientists
Stars: ✭ 208 (-2.35%)
Mutual labels:  jupyter-notebook
Mdrepo
Repositório para armazenamento de código e notebooks de postagens do blog e cursos.
Stars: ✭ 210 (-1.41%)
Mutual labels:  jupyter-notebook
Python Business Analytics
Python solutions to solve practical business problems.
Stars: ✭ 209 (-1.88%)
Mutual labels:  jupyter-notebook
Statistical Learning Method Solutions Manual
《统计学习方法》(第一版)习题解答,在线阅读地址:https://datawhalechina.github.io/statistical-learning-method-solutions-manual
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook
Deepconvlstm
Deep learning framework for wearable activity recognition based on convolutional and LSTM recurretn layers
Stars: ✭ 211 (-0.94%)
Mutual labels:  jupyter-notebook

SQuAD

Building the QA system for Stanford Question Answering Datatset (https://rajpurkar.github.io/SQuAD-explorer/)

Please read this blog for details: https://towardsdatascience.com/building-a-question-answering-system-part-1-9388aadff507

The first file create_emb.ipynb takes care of creating a dictionary of sentence embedding for all the sentences and questions in the wikipedia articles of training dataset

The second file unsupervised.ipynb calculates the distance between sentence & questions basis Euclidean & Cosine similarity using sentence embeddings. It finally extracts the setence from each paragraph that has the minimum distance from the question. Currently, they are giving an accuracy of 45% & 63% respectively.

The last file treats this problem as supervised learning problem where I am fitting multinomial logistic regression, random forest and xgboost and create 20 features - (2 features represnts the cosine distance & euclidean for one sentence. I am limiting each para to 10 sentences). The target variable is the sentence ID having the correct answer. So I have 10 labels. This is currently giving an accuracy of 63%, 65% & 69% respectively.

Future Work: Use RNNs to get the exact answer

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].