Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Tixierae → Deep_learning_nlp

Tixierae / Deep_learning_nlp

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP

Programming Languages

python3

1442 projects

Labels

jupyter-notebook deep-learning pytorch nlp keras numpy recurrent-neural-networks word2vec attention word-embeddings cnn-keras nmt

Projects that are alternatives of or similar to Deep learning nlp

Glove As A Tensorflow Embedding Layer

Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.

Stars: ✭ 85 (-79.12%)

Mutual labels: jupyter-notebook, word2vec, word-embeddings

Ner Bert

BERT-NER (nert-bert) with google bert https://github.com/google-research.

Stars: ✭ 339 (-16.71%)

Mutual labels: jupyter-notebook, attention, nmt

Stock Price Predictor

This project seeks to utilize Deep Learning models, Long-Short Term Memory (LSTM) Neural Network algorithm, to predict stock prices.

Stars: ✭ 146 (-64.13%)

Mutual labels: jupyter-notebook, recurrent-neural-networks, numpy

Rnn lstm from scratch

How to build RNNs and LSTMs from scratch with NumPy.

Stars: ✭ 156 (-61.67%)

Mutual labels: jupyter-notebook, recurrent-neural-networks, numpy

Pytorch Sentiment Analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Stars: ✭ 3,209 (+688.45%)

Mutual labels: jupyter-notebook, recurrent-neural-networks, word-embeddings

Deeplearning Nlp Models

A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.

Stars: ✭ 64 (-84.28%)

Mutual labels: jupyter-notebook, word2vec, attention

Thesemicolon

This repository contains Ipython notebooks and datasets for the data analytics youtube tutorials on The Semicolon.

Stars: ✭ 345 (-15.23%)

Mutual labels: jupyter-notebook, cnn-keras, numpy

Image Caption Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Stars: ✭ 126 (-69.04%)

Mutual labels: recurrent-neural-networks, attention, cnn-keras

Germanwordembeddings

Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets

Stars: ✭ 189 (-53.56%)

Mutual labels: jupyter-notebook, word2vec, word-embeddings

Debiaswe

Remove problematic gender bias from word embeddings.

Stars: ✭ 175 (-57%)

Mutual labels: jupyter-notebook, word2vec, word-embeddings

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-60.44%)

Mutual labels: jupyter-notebook, recurrent-neural-networks, attention

automatic-personality-prediction

[AAAI 2020] Modeling Personality with Attentive Networks and Contextual Embeddings

Stars: ✭ 43 (-89.43%)

Mutual labels: recurrent-neural-networks, attention, cnn-keras

datastories-semeval2017-task6

Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".

Stars: ✭ 20 (-95.09%)

Mutual labels: word-embeddings, recurrent-neural-networks, attention

NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (-95.33%)

Mutual labels: word2vec, word-embeddings, attention

Python

This repository helps you understand python from the scratch.

Stars: ✭ 285 (-29.98%)

Mutual labels: jupyter-notebook, numpy

Pysynth

Several simple music synthesizers in Python 3. Input from ABC or MIDI files is also supported.

Stars: ✭ 279 (-31.45%)

Mutual labels: jupyter-notebook, numpy

Gdrl

Grokking Deep Reinforcement Learning

Stars: ✭ 304 (-25.31%)

Mutual labels: jupyter-notebook, numpy

Biosentvec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

Stars: ✭ 308 (-24.32%)

Mutual labels: jupyter-notebook, word-embeddings

Hands On Deep Learning Algorithms With Python

Master Deep Learning Algorithms with Extensive Math by Implementing them using TensorFlow

Stars: ✭ 272 (-33.17%)

Mutual labels: jupyter-notebook, word-embeddings

Zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.

Stars: ✭ 3,307 (+712.53%)

Mutual labels: jupyter-notebook, recurrent-neural-networks

View All Similar Projects ➔

❤️ Deep Learning architectures for NLP

This repository contains Keras, PyTorch and NumPy implementations of some deep learning architectures for NLP. For a quick theoretical intro about Deep Learning for NLP, I encourage you to have a look at my notes.

Neural Machine Translation (NMT) in PyTorch

https://github.com/Tixierae/deep_learning_NLP/tree/master/NMT

A compact, fully functional, and well-commented PyTorch implementation of the classical seq2seq model "Effective Approaches to Attention-based Neural Machine Translation" (Luong et al. 2015), with support for the three global attention mechanisms presented in subsection 3.1 of the paper: (1) dot, (2) general, and (3) concat, and also stacking vs non-stacking RNN encoder and decoder, and bidirectional vs unidirectional RNN encoder.

We experiment on a toy English -> French dataset from http://www.manythings.org/anki/, originally extracted from the Tatoeba project, with 136,521 sentence pairs for training and 34,130 pairs for testing.

Word2vec and doc2vec by hand in NumPy

https://github.com/Tixierae/deep_learning_NLP/blob/master/skipgram/sg_d2v_numpy.ipynb

In this notebook, we learn word and document vectors completely by hand on the IMDB movie review dataset, with just a for loop and NumPy! We implement the following models:

word2vec's skip-gram with negative sampling, as introduced in Efficient Estimation of Word Representations in Vector Space and Distributed Representations of Words and Phrases and their Compositionality.
doc2vec (a.k.a. Paragraph Vector)'s distributed bag-of-words, following Distributed Representations of Sentences and Documents.

We also:

write an inference function to compute the vector of any new document
visualize word and document vectors separately, and together in the same space

Hierarchical Attention Network for Document Classification

A RAM-friendly implementation of the model introduced by Yang et al. (2016), with step-by-step explanations and links to relevant resources: https://github.com/Tixierae/deep_learning_NLP/blob/master/HAN/HAN_final.ipynb

In my experiments on the Amazon review dataset (3,650,000 documents, 5 classes), I reach 62.6% accuracy after 8 epochs, and 63.6% accuracy (the accuracy reported in the paper) after 42 epochs. Each epoch takes about 20 mins on my TitanX GPU. I deployed the model as a web app. As shown in the image below, you can paste your own review and visualize how the model pays attention to words and sentences.

Concepts covered

The notebook makes use of the following concepts:

batch training. Batches are loaded from disk and passed to the model one by one with a generator. This way, it's possible to train on datasets that are too big to fit on RAM.
bucketing. To have batches that are as dense as possible and make the most of each tensor product, the batches contain documents of similar sizes.
cyclical learning rate and cyclical momentum schedules, as in Smith (2017) and Smith (2018). The cyclical learning rate schedule is a new, promising approach to optimization in which the learning rate increases and decreases in a pre-defined interval rather than keeping decreasing. It worked better than Adam and SGD alone for me¹.
self-attention (aka inner attention). We use the formulation of the original paper.
bidirectional RNN
Gated Recurrent Unit (GRU)

¹There is more and more evidence that adaptive optimizers like Adam, Adagrad, etc. converge faster but generalize poorly compared to SGD-based approaches. For example: Wilson et al. (2018), this blogpost. Traditional SGD is very slow, but a cyclical learning rate schedule can bring a significant speedup, and even sometimes allow to reach better performance.

1D Convolutional Neural Network for short text classification

An implementation of (Kim 2014)'s 1D Convolutional Neural Network for short text classification: https://github.com/Tixierae/deep_learning_NLP/blob/master/CNN_IMDB/cnn_imdb.ipynb

2D CNN for image classification

Agreed, this is not for NLP. But an implementation can be found here https://github.com/Tixierae/deep_learning_NLP/blob/master/CNN_MNIST/mnist_cnn.py. I reach 99.45% accuracy on MNIST with it.

Inverted index and TF-IDF by hand

This notebook provides simple functions to clean and index documents, and to execute word and phrase queries. It also shows how to compute TF-IDF coefficients. https://github.com/Tixierae/deep_learning_NLP/blob/master/other/inverted_index_tfidf.ipynb

Cite 👍

If you use some of the code in this repository in your work, please cite

@article{tixier2018notes,
  title={Notes on Deep Learning for NLP},
  author={Tixier, Antoine J.-P.},
  journal={arXiv preprint arXiv:1808.09772},
  year={2018}
}

Tixier, A. J. P. (2018). Notes on Deep Learning for NLP. arXiv preprint arXiv:1808.09772.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 407

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗