Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → DeepLearnXMU → ABD-NMT

DeepLearnXMU / ABD-NMT

Licence: other

Code for "Asynchronous bidirectional decoding for neural machine translation" (AAAI, 2018)

Programming Languages

139335 projects - #7 most used programming language

6916 projects

77523 projects

Labels

neural-machine-translation

Projects that are alternatives of or similar to ABD-NMT

Neural Adaptive Machine Translation that adapts to context and learns from corrections.

Stars: ✭ 231 (+621.88%)

Mutual labels: neural-machine-translation

TS3000 TheChatBOT

Its a social networking chat-bot trained on Reddit dataset . It supports open bounded queries developed on the concept of Neural Machine Translation. Beware of its being sarcastic just like its creator 😝 BDW it uses Pytorch framework and Python3.

Stars: ✭ 20 (-37.5%)

Mutual labels: neural-machine-translation

sentencepiece-jni

Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.

Stars: ✭ 26 (-18.75%)

Mutual labels: neural-machine-translation

I try my best to keep updated cutting-edge knowledge in Machine Learning/Deep Learning and Natural Language Processing. These are my notes on some good papers

Stars: ✭ 248 (+675%)

Mutual labels: neural-machine-translation

bergamot-translator

Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.

Stars: ✭ 181 (+465.63%)

Mutual labels: neural-machine-translation

Word-Level-Eng-Mar-NMT

Translating English sentences to Marathi using Neural Machine Translation

Stars: ✭ 37 (+15.63%)

Mutual labels: neural-machine-translation

Coursera Deep Learning Specialization

Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv) Convolutional Neural Networks; (v) Sequence Models

Stars: ✭ 188 (+487.5%)

Mutual labels: neural-machine-translation

A Fast Neural Machine Translation System. It is developed in C++ and resorts to NiuTensor for fast tensor APIs.

Stars: ✭ 112 (+250%)

Mutual labels: neural-machine-translation

Densely Connected Graph Convolutional Networks for Graph-to-Sequence Learning (authors' MXNet implementation for the TACL19 paper)

Stars: ✭ 73 (+128.13%)

Mutual labels: neural-machine-translation

Machine Translation (MT) Preparation Scripts

Stars: ✭ 15 (-53.12%)

Mutual labels: neural-machine-translation

Pytorch Seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Stars: ✭ 3,418 (+10581.25%)

Mutual labels: neural-machine-translation

bytenet translation

A TensorFlow Implementation of Machine Translation In Neural Machine Translation in Linear Time

Stars: ✭ 60 (+87.5%)

Mutual labels: neural-machine-translation

UPC Deep Learning for Speech and Language 2018

Stars: ✭ 18 (-43.75%)

Mutual labels: neural-machine-translation

Tensorflow Shakespeare

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

Stars: ✭ 244 (+662.5%)

Mutual labels: neural-machine-translation

Neural machine translation implementation using dynet's python bindings

Stars: ✭ 17 (-46.87%)

Mutual labels: neural-machine-translation

Open Source Neural Machine Translation in Torch (deprecated)

Stars: ✭ 2,339 (+7209.38%)

Mutual labels: neural-machine-translation

Neural-Machine-Translation

Several basic neural machine translation models implemented by PyTorch & TensorFlow

Stars: ✭ 29 (-9.37%)

Mutual labels: neural-machine-translation

A minimal nmt example to serve as an seq2seq+attention reference.

Stars: ✭ 36 (+12.5%)

Mutual labels: neural-machine-translation

Zero -- A neural machine translation system

Stars: ✭ 121 (+278.13%)

Mutual labels: neural-machine-translation

parallel-corpora-tools

Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.

Stars: ✭ 35 (+9.38%)

Mutual labels: neural-machine-translation

View All Similar Projects ➔

Asynchronous Bidirectional Decoding for Neural Machine Translation

This codebase contains all scripts except training corpus to reproduce our results in the paper.

Installation

The following packages are needed:

Python >= 2.7
numpy
Theano >= 0.7 (and its dependencies).

Preparation

First, preprocess your training corpus. For English-German translation, use BPE(byte-piar-encoding) to segment text into subword units. Please follow https://github.com/rsennrich/subword-nmt for further details.

To obtain vocabulary for training, run:

python scripts/buildvocab.py --corpus /path/to/train.zh --output /path/to/zh.voc3.pkl \
--limit 30000 --groundhog
python scripts/buildvocab.py --corpus /path/to/train.en --output /path/to/de.voc3.pkl \
--limit 30000 --groundhog

Similarly, vocabularies for English-German translation can be obtained in the same way.

And also, it's preferred, but not required to initialize encoder-backward decoder component with pretrained parameters in the proposed model of this work.

Training

For Chinese-English experiment, do the following:

python -u rnnsearch.py train \
--corpus /path/to/train.zh /path/to/train.en \
--vocab /path/to/zh.voc3.pkl /path/to/en.voc3.pkl \
--method concat \
--softk 1 \
--lambda 0.5 \
--ext-val-script scripts/validate-zhen.sh \
--model zhen \
--embdim 620 620 \
--hidden 1000 1000 1000 \
--maxhid 500 \
--deephid 620 \
--maxpart 2 \
--alpha 5e-4 \
--norm 1.0 \
--batch 80 \
--maxepoch 5 \
--seed 1235 \
--freq 500 \
--vfreq 1500 \
--sfreq 500 \
--sort 32 \
--validation /path/to/nist/nist02.src \
--references /path/to/nist/nist02.ref0 /path/to/nist/nist02.ref1 /path/to/nist/nist02.ref2 /path/to/nist/nist02.ref3 \
--optimizer rmsprop \
--shuffle 1 \
--keep-prob 0.7 \
--limit 50 50 \
--delay-val 1 \
--initialize /path/to/models/r.zhen.best.pkl \
> log.zhen 2>&1 &

where /path/to/models/r.zhen.best.pkl is a pretrained model running from right to left. The training procedure continues about 2 days On a single Nvidia Titan x GPU.

For English-German experiment, do the following:

python -u rnnsearch.py train \
--corpus /path/to/BPE/train.en /path/to/BPE/train.de \
--vocab /path/to/BPE/en.voc5.pkl /path/to/BPE/de.voc5.pkl \
--method concat \
--softk 1 \
--lambda 0.5 \
--ext-val-script scripts/validate-deen-bpe.sh \
--model ende \
--embdim 620 620 \
--hidden 1000 1000 1000 \
--maxhid 500 \
--deephid 620 \
--maxpart 2 \
--alpha 5e-4 \
--norm 1.0 \
--batch 80 \
--maxepoch 5 \
--seed 1234 \
--freq 500 \
--vfreq 2500 \
--sfreq 500 \
--sort 32 \
--validation /path/to/BPE/newstest2013.en \
--references /path/to/newstest2013.tc.de \
--optimizer rmsprop \
--shuffle 1 \
--keep-prob 0.7 \
--limit 50 50 \
--delay-val 1 \
--initialize /path/to/models/r.ende.best.pkl \
> log.ende 2>&1 &

where /path/to/models/r.ende.best.pkl is a pretrained model running from right to left. The training procedure costs about 7 days on the same device.

Evaluation

The evaluation metric for English-Germnan we use is case-sensitive BLEU on tokenized reference. Translate the test set and restore text to the original segmentation:

python rnnsearch.py translate --model ende.best.pkl < /path/to/BPE/newstest2015.en \
| scripts/restore_bpe.sh > newstest2015.de.trans

And evaluation proceeds by running:

perl scripts/multi-bleu.perl /path/to/newstest2015.tc.de < newstest2015.de.trans

For Chinese-English evaluation with case-insensitive BLEU, run multi-bleu.perl with -lc option on nist test set.

perl scripts/multi-bleu.perl -lc /path/to/nist03.ref? < nist03.src.trans

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 32

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗