Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → lmnt-com → Haste

lmnt-com / Haste

Licence: apache-2.0

Haste: a fast, simple, and open RNN library

Programming Languages

python

139335 projects - #7 most used programming language

cpp

1120 projects

Labels

deep-learning machine-learning pytorch tensorflow api algorithm cuda lstm rnn gru

Projects that are alternatives of or similar to Haste

theano-recurrence

Recurrent Neural Networks (RNN, GRU, LSTM) and their Bidirectional versions (BiRNN, BiGRU, BiLSTM) for word & character level language modelling in Theano

Stars: ✭ 40 (-81.31%)

Mutual labels: lstm, gru, rnn

Pytorch-POS-Tagger

Part-of-Speech Tagger and custom implementations of LSTM, GRU and Vanilla RNN

Stars: ✭ 24 (-88.79%)

Mutual labels: lstm, gru, rnn

ConvLSTM-PyTorch

ConvLSTM/ConvGRU (Encoder-Decoder) with PyTorch on Moving-MNIST

Stars: ✭ 202 (-5.61%)

Mutual labels: lstm, gru, rnn

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+2.8%)

Mutual labels: lstm, rnn, gru

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+879.91%)

Mutual labels: lstm, rnn, gru

tf-ran-cell

Recurrent Additive Networks for Tensorflow

Stars: ✭ 16 (-92.52%)

Mutual labels: lstm, gru, rnn

Load forecasting

Load forcasting on Delhi area electric power load using ARIMA, RNN, LSTM and GRU models

Stars: ✭ 160 (-25.23%)

Mutual labels: lstm, rnn, gru

Pytorch Seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Stars: ✭ 3,418 (+1497.2%)

Mutual labels: lstm, rnn, gru

Eeg Dl

A Deep Learning library for EEG Tasks (Signals) Classification, based on TensorFlow.

Stars: ✭ 165 (-22.9%)

Mutual labels: lstm, rnn, gru

Rnn Notebooks

RNN(SimpleRNN, LSTM, GRU) Tensorflow2.0 & Keras Notebooks (Workshop materials)

Stars: ✭ 48 (-77.57%)

Mutual labels: lstm, rnn, gru

myDL

Deep Learning

Stars: ✭ 18 (-91.59%)

Mutual labels: lstm, gru, rnn

See Rnn

RNN and general weights, gradients, & activations visualization in Keras & TensorFlow

Stars: ✭ 102 (-52.34%)

Mutual labels: lstm, rnn, gru

Easy Deep Learning With Keras

Keras tutorial for beginners (using TF backend)

Stars: ✭ 367 (+71.5%)

Mutual labels: lstm, rnn, gru

Gdax Orderbook Ml

Application of machine learning to the Coinbase (GDAX) orderbook

Stars: ✭ 60 (-71.96%)

Mutual labels: lstm, gru, cuda

Pytorch Rnn Text Classification

Word Embedding + LSTM + FC

Stars: ✭ 112 (-47.66%)

Mutual labels: lstm, rnn, gru

Lstm Crypto Price Prediction

Predicting price trends in cryptomarkets using an lstm-RNN for the use of a trading bot

Stars: ✭ 136 (-36.45%)

Mutual labels: lstm, rnn

Chinese Chatbot

中文聊天机器人，基于10万组对白训练而成，采用注意力机制，对一般问题都会生成一个有意义的答复。已上传模型，可直接运行，跑不起来直播吃键盘。

Stars: ✭ 124 (-42.06%)

Mutual labels: lstm, rnn

Rnn poetry generator

基于RNN生成古诗

Stars: ✭ 143 (-33.18%)

Mutual labels: lstm, rnn

Skip Thoughts.torch

Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7

Stars: ✭ 146 (-31.78%)

Mutual labels: rnn, gru

Onemkl

oneAPI Math Kernel Library (oneMKL) Interfaces

Stars: ✭ 122 (-42.99%)

Mutual labels: api, cuda

View All Similar Projects ➔

Haste is a CUDA implementation of fused RNN layers with built-in DropConnect and Zoneout regularization. These layers are exposed through C++ and Python APIs for easy integration into your own projects or machine learning frameworks.

Which RNN types are supported?

What's included in this project?

a standalone C++ API (libhaste)
a TensorFlow Python API (haste_tf)
a PyTorch API (haste_pytorch)
examples for writing your own custom C++ inference / training code using libhaste
benchmarking programs to evaluate the performance of RNN implementations

For questions or feedback about Haste, please open an issue on GitHub or send us an email at [email protected].

Install

Here's what you'll need to get started:

a CUDA Compute Capability 3.7+ GPU (required)
CUDA Toolkit 10.0+ (required)
TensorFlow GPU 1.14+ or 2.0+ for TensorFlow integration (optional)
PyTorch 1.3+ for PyTorch integration (optional)
Eigen 3 to build the C++ examples (optional)
cuDNN Developer Library to build benchmarking programs (optional)

Once you have the prerequisites, you can install with pip or by building the source code.

Using pip

pip install haste_pytorch
pip install haste_tf

Building from source

make               # Build everything
make haste         # ;) Build C++ API
make haste_tf      # Build TensorFlow API
make haste_pytorch # Build PyTorch API
make examples
make benchmarks

If you built the TensorFlow or PyTorch API, install it with pip:

pip install haste_tf-*.whl
pip install haste_pytorch-*.whl

If the CUDA Toolkit that you're building against is not in /usr/local/cuda, you must specify the $CUDA_HOME environment variable before running make:

CUDA_HOME=/usr/local/cuda-10.2 make

Performance

Our LSTM and GRU benchmarks indicate that Haste has the fastest publicly available implementation for nearly all problem sizes. The following charts show our LSTM results, but the GRU results are qualitatively similar.

Here is our complete LSTM benchmark result grid:
N=1 C=64 N=1 C=128 N=1 C=256 N=1 C=512
N=32 C=64 N=32 C=128 N=32 C=256 N=32 C=512
N=64 C=64 N=64 C=128 N=64 C=256 N=64 C=512
N=128 C=64 N=128 C=128 N=128 C=256 N=128 C=512

Documentation

TensorFlow API

import haste_tf as haste

gru_layer = haste.GRU(num_units=256, direction='bidirectional', zoneout=0.1, dropout=0.05)
indrnn_layer = haste.IndRNN(num_units=256, direction='bidirectional', zoneout=0.1)
lstm_layer = haste.LSTM(num_units=256, direction='bidirectional', zoneout=0.1, dropout=0.05)
norm_gru_layer = haste.LayerNormGRU(num_units=256, direction='bidirectional', zoneout=0.1, dropout=0.05)
norm_lstm_layer = haste.LayerNormLSTM(num_units=256, direction='bidirectional', zoneout=0.1, dropout=0.05)

# `x` is a tensor with shape [N,T,C]
x = tf.random.normal([5, 25, 128])

y, state = gru_layer(x, training=True)
y, state = indrnn_layer(x, training=True)
y, state = lstm_layer(x, training=True)
y, state = norm_gru_layer(x, training=True)
y, state = norm_lstm_layer(x, training=True)

The TensorFlow Python API is documented in docs/tf/haste_tf.md.

PyTorch API

import torch
import haste_pytorch as haste

gru_layer = haste.GRU(input_size=128, hidden_size=256, zoneout=0.1, dropout=0.05)
indrnn_layer = haste.IndRNN(input_size=128, hidden_size=256, zoneout=0.1)
lstm_layer = haste.LSTM(input_size=128, hidden_size=256, zoneout=0.1, dropout=0.05)
norm_gru_layer = haste.LayerNormGRU(input_size=128, hidden_size=256, zoneout=0.1, dropout=0.05)
norm_lstm_layer = haste.LayerNormLSTM(input_size=128, hidden_size=256, zoneout=0.1, dropout=0.05)

gru_layer.cuda()
indrnn_layer.cuda()
lstm_layer.cuda()
norm_gru_layer.cuda()
norm_lstm_layer.cuda()

# `x` is a CUDA tensor with shape [T,N,C]
x = torch.rand([25, 5, 128]).cuda()

y, state = gru_layer(x)
y, state = indrnn_layer(x)
y, state = lstm_layer(x)
y, state = norm_gru_layer(x)
y, state = norm_lstm_layer(x)

The PyTorch API is documented in docs/pytorch/haste_pytorch.md.

C++ API

The C++ API is documented in lib/haste/*.h and there are code samples in examples/.

Code layout

benchmarks/: programs to evaluate performance of RNN implementations
docs/tf/: API reference documentation for haste_tf
docs/pytorch/: API reference documentation for haste_pytorch
examples/: examples for writing your own C++ inference / training code using libhaste
frameworks/tf/: TensorFlow Python API and custom op code
frameworks/pytorch/: PyTorch API and custom op code
lib/: CUDA kernels and C++ API
validation/: scripts to validate output and gradients of RNN layers

Implementation notes

the GRU implementation is based on 1406.1078v1 (same as cuDNN) rather than 1406.1078v3
Zoneout on LSTM cells is applied to the hidden state only, and not the cell state
the layer normalized LSTM implementation uses these equations

References

Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv:1406.1078 [cs, stat]. http://arxiv.org/abs/1406.1078.
Wan, L., Zeiler, M., Zhang, S., Cun, Y. L., & Fergus, R. (2013). Regularization of Neural Networks using DropConnect. In International Conference on Machine Learning (pp. 1058–1066). Presented at the International Conference on Machine Learning. http://proceedings.mlr.press/v28/wan13.html.
Krueger, D., Maharaj, T., Kramár, J., Pezeshki, M., Ballas, N., Ke, N. R., et al. (2017). Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations. arXiv:1606.01305 [cs]. http://arxiv.org/abs/1606.01305.
Ba, J., Kiros, J.R., & Hinton, G.E. (2016). Layer Normalization. arXiv:1607.06450 [cs, stat]. https://arxiv.org/abs/1607.06450.
Li, S., Li, W., Cook, C., Zhu, C., & Gao, Y. (2018). Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN. arXiv:1803.04831 [cs]. http://arxiv.org/abs/1803.04831.

Citing this work

To cite this work, please use the following BibTeX entry:

@misc{haste2020,
  title  = {Haste: a fast, simple, and open RNN library},
  author = {Sharvil Nanavati},
  year   = 2020,
  month  = "Jan",
  howpublished = {\url{https://github.com/lmnt-com/haste/}},
}

License

Apache 2.0

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 214

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗