Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → asappresearch → Sru

asappresearch / Sru

Licence: mit

SRU is a recurrent unit that can run over 10 times faster than cuDNN LSTM, without loss of accuracy tested on many tasks.

Programming Languages

139335 projects - #7 most used programming language

1817 projects

36643 projects - #6 most used programming language

77523 projects

9771 projects

Labels

deep-learning pytorch nlp recurrent-neural-networks

Projects that are alternatives of or similar to Sru

ECG classification programs based on ML/DL methods

Stars: ✭ 124 (-93.83%)

Mutual labels: recurrent-neural-networks

Image Caption Generator

[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow

Stars: ✭ 141 (-92.98%)

Mutual labels: recurrent-neural-networks

Rnn lstm from scratch

How to build RNNs and LSTMs from scratch with NumPy.

Stars: ✭ 156 (-92.23%)

Mutual labels: recurrent-neural-networks

Lyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network

Stars: ✭ 127 (-93.68%)

Mutual labels: recurrent-neural-networks

Document Classifier Lstm

A bidirectional LSTM with attention for multiclass/multilabel text classification.

Stars: ✭ 136 (-93.23%)

Mutual labels: recurrent-neural-networks

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Stars: ✭ 148 (-92.63%)

Mutual labels: recurrent-neural-networks

Linear Attention Recurrent Neural Network

A recurrent attention module consisting of an LSTM cell which can query its own past cell states by the means of windowed multi-head attention. The formulas are derived from the BN-LSTM and the Transformer Network. The LARNN cell with attention can be easily used inside a loop on the cell state, just like any other RNN. (LARNN)

Stars: ✭ 119 (-94.08%)

Mutual labels: recurrent-neural-networks

Emotion Recognition Using Speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Stars: ✭ 159 (-92.09%)

Mutual labels: recurrent-neural-networks

Learning the Enigma with Recurrent Neural Networks

Stars: ✭ 139 (-93.08%)

Mutual labels: recurrent-neural-networks

Layer-wise Relevance Propagation (LRP) for LSTMs

Stars: ✭ 152 (-92.43%)

Mutual labels: recurrent-neural-networks

Image Caption Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Stars: ✭ 126 (-93.73%)

Mutual labels: recurrent-neural-networks

Stockprediction

Plain Stock Close-Price Prediction via Graves LSTM RNNs

Stars: ✭ 134 (-93.33%)

Mutual labels: recurrent-neural-networks

Stock Price Predictor

This project seeks to utilize Deep Learning models, Long-Short Term Memory (LSTM) Neural Network algorithm, to predict stock prices.

Stars: ✭ 146 (-92.73%)

Mutual labels: recurrent-neural-networks

Rcnn Text Classification

Tensorflow Implementation of "Recurrent Convolutional Neural Network for Text Classification" (AAAI 2015)

Stars: ✭ 127 (-93.68%)

Mutual labels: recurrent-neural-networks

brain.js is a GPU accelerated library for Neural Networks written in JavaScript.

Stars: ✭ 12,358 (+515.13%)

Mutual labels: recurrent-neural-networks

Rnn From Scratch

Use tensorflow's tf.scan to build vanilla, GRU and LSTM RNNs

Stars: ✭ 123 (-93.88%)

Mutual labels: recurrent-neural-networks

The first public PyTorch implementation of Attentive Recurrent Comparators

Stars: ✭ 147 (-92.68%)

Mutual labels: recurrent-neural-networks

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-91.99%)

Mutual labels: recurrent-neural-networks

Keras implementation of Legendre Memory Units

Stars: ✭ 160 (-92.04%)

Mutual labels: recurrent-neural-networks

Semi-Supervised Video Object Segmentation (VOS) with Tensorflow. Includes implementation of *MaskRNN: Instance Level Video Object Segmentation (NIPS 2017)* as part of the NIPS Paper Implementation Challenge.

Stars: ✭ 151 (-92.48%)

Mutual labels: recurrent-neural-networks

View All Similar Projects ➔

News

SRU++, a new SRU variant, is released. [tech report] [blog]

The experimental code and SRU++ implementation are available on the dev branch which will be merged into master later.

About

SRU is a recurrent unit that can run over 10 times faster than cuDNN LSTM, without loss of accuracy tested on many tasks.

Average processing time of LSTM, conv2d and SRU, tested on GTX 1070

For example, the figure above presents the processing time of a single mini-batch of 32 samples. SRU achieves 10 to 16 times speed-up compared to LSTM, and operates as fast as (or faster than) word-level convolution using conv2d.

Reference:

Simple Recurrent Units for Highly Parallelizable Recurrence [paper]

@inproceedings{lei2018sru,
  title={Simple Recurrent Units for Highly Parallelizable Recurrence},
  author={Tao Lei and Yu Zhang and Sida I. Wang and Hui Dai and Yoav Artzi},
  booktitle={Empirical Methods in Natural Language Processing (EMNLP)},
  year={2018}
}

When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute [paper]

@article{lei2021srupp,
  title={When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute},
  author={Tao Lei},
  journal={arXiv preprint arXiv:2102.12459},
  year={2021}
}

Requirements

PyTorch >=1.6 recommended
ninja

Install requirements via pip install -r requirements.txt.

Installation

From source:

SRU can be installed as a regular package via python setup.py install or pip install ..

From PyPi:

pip install sru

Directly use the source without installation:

Make sure this repo and CUDA library can be found by the system, e.g.

export PYTHONPATH=path_to_repo/sru
export LD_LIBRARY_PATH=/usr/local/cuda/lib64

Examples

The usage of SRU is similar to nn.LSTM. SRU likely requires more stacking layers than LSTM. We recommend starting by 2 layers and use more if necessary (see our report for more experimental details).

import torch
from sru import SRU, SRUCell

# input has length 20, batch size 32 and dimension 128
x = torch.FloatTensor(20, 32, 128).cuda()

input_size, hidden_size = 128, 128

rnn = SRU(input_size, hidden_size,
    num_layers = 2,          # number of stacking RNN layers
    dropout = 0.0,           # dropout applied between RNN layers
    bidirectional = False,   # bidirectional RNN
    layer_norm = False,      # apply layer normalization on the output of each layer
    highway_bias = -2,        # initial bias of highway gate (<= 0)
)
rnn.cuda()

output_states, c_states = rnn(x)      # forward pass

# output_states is (length, batch size, number of directions * hidden size)
# c_states is (layers, batch size, number of directions * hidden size)

Contributing

Please read and follow the guidelines.

Other Implementations

@musyoku had a very nice SRU implementaion in chainer.

@adrianbg implemented the first CPU version.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 2,009

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (54) 🔗