Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → ryonakamura → Parlai_agents

ryonakamura / Parlai_agents

Licence: other

# ParlAI Agent examples with PyTorch, Chainer and TensorFlow

Labels

jupyter-notebook

Projects that are alternatives of or similar to Parlai agents

Carnd Vehicle Detection

A vehicle tracker based on sliding windows, HOG and an SVM

Stars: ✭ 43 (-2.27%)

Mutual labels: jupyter-notebook

Walkwithfastai.github.io

Host for https://walkwithfastai.com

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

A general-purpose neural simulator focusing on topographic maps.

Stars: ✭ 43 (-2.27%)

Mutual labels: jupyter-notebook

Teaching Material

Teaching materials for the machine learning and deep learning classes at Stanford and Cornell

Stars: ✭ 1,022 (+2222.73%)

Mutual labels: jupyter-notebook

Sagemaker Churn

Hands-on SageMaker lab looking at cell phone churn

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

This repo contains deep learning + computer vision code I have ran on the Microsoft Data Science Virtual Machine

Stars: ✭ 43 (-2.27%)

Mutual labels: jupyter-notebook

Google Street View House Number(SVHN) Dataset, and classifying them through CNN

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

Batch Normalization with Enhanced Linear Transformation

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

Python MIDI track classifier and tonal tension calculation based on spiral array theory

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

Cost function builder. For fitting distributions.

Stars: ✭ 43 (-2.27%)

Mutual labels: jupyter-notebook

Reinforcement Learning Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Stars: ✭ 1,020 (+2218.18%)

Mutual labels: jupyter-notebook

Big data benchmarks

big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

Data sets from subject/item type studies in Psychology and Linguistics

Stars: ✭ 43 (-2.27%)

Mutual labels: jupyter-notebook

Deep Learning the Chemistry of Materials From Only Elemental Composition for Enhancing Materials Property Prediction

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

Cnn Encoder Nmt

Stars: ✭ 43 (-2.27%)

Mutual labels: jupyter-notebook

Yolov5 Deepsort

Stars: ✭ 42 (-4.55%)

Mutual labels: jupyter-notebook

Spaced repetition through deep reinforcement learning

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

News push project

Real Time News Scraping and Recommendation System - React | Tensorflow | NLP | News Scrapers

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

A way to use N-Beats in fastai for sequence data

Stars: ✭ 44 (+0%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

ParlAI Agent examples with PyTorch, Chainer and TensorFlow

ParlAI is a unified platform for training and evaluating dialog models across many tasks.
Currently, the following agents are implemented in this repository.

RNNAgent by PyTorch
RNNAgent by Chainer
RNNAgent by TensorFlow
AttentionAgent (seq2seq with Attention) by PyTorch
MemN2NAgent (End-To-End Memory Networks) by Chainer
MemN2NAgent (End-To-End Memory Networks) by PyTorch New!
SaveAgent (Save losses and attention weights)
visualization.ipynb (visualize valid and test results, losses and attention weights)

I will implement it soon.

EntNetAgent (Recurrent Entity Networks) by PyTorch
RelNetAgent (Relation Networks) by PyTorch
GeneralDictionaryAgent (Use sub-word, sentencepiece and character-level tokenizer)

I also wrote an article on ParlAI introduction in Japanese. Please see here.

Usage

Please download and install ParlAI first.

git clone https://github.com/facebookresearch/ParlAI.git ~/ParlAI
cd ~/ParlAI

pip install -r requirements.txt
sudo python setup.py develop

Then download and put this repository in ~/ParlAI/parlai/.

git clone https://github.com/ryonakamura/parlai_agents.git
mv parlai_agents ~/ParlAI/parlai/

Simple Agents

bAbI is a pure text-based QA dataset [Weston, 2015]. There are 20 tasks, each corresponding to a particular type of reasoning, such as deduction, induction, or counting.

According to [Sukhbaatar, NIPS 2015], the mean test accuracy by LSTM in bAbI All 10k is 63.6%.
The following RNNAgent achieves similar mean test accuracy in any library.

Note that the correct labels in Task 19 (path finding) are two words, but RNN generates only one word.
The correct labels in Task 8 (lists/sets) also has two or more words, but the majority is one word.

The meaning of the arguments are as follows.

-m: the model class name, should match parlai.parlai_agents.<directory_name>.<file_name>:<class_name>
-t: ParlAI task(s), e.g. babi:Task1k:1 or babi,cbt
-mf: model file name for loading and saving models
-e: number of epochs
-rnn: choose GRU or LSTM
-bs: batch size for minibatch training schemes
-hs: size of the hidden layers and embeddings
-nl: number of hidden layers
-lr: learning rate
-dr: dropout rate
-ltim: log every n secs
-vtim: validation every n secs

I examined the simple benchmark of 1000 parleys (iterations) speed on single GPU.

in Joint Training

PyTorch	Chainer	TensorFlow
96 sec	157 sec	320 sec

in Single Training (Task 1)

PyTorch	Chainer	TensorFlow
36 sec	71 sec	64 sec

TensorFlow may be slow because it doesn't use truncated BPTT.

RNNAgent by PyTorch

PyTorch is an easy and powerful way to implement ParlAI Agent.
When using GPU PyTorch is 1.5 ~ 2 times faster than Chainer.

cd ~/ParlAI
python examples/train_model.py -m parlai.parlai_agents.pytorch_rnn.pytorch_rnn:RNNAgent -t babi:Task1k:1 -mf './parlai/parlai_agents/pytorch_rnn/model_file/babi1' -e 20 -rnn GRU -bs 32 -hs 64 -nl 2 -lr 0.5 -dr 0.2 -ltim 2 -vtim 30

RNNAgent by Chainer

This implementation use multiple inheritance of ParlAI's Agent class and Chainer's Chain classe.
You can choose RNN from links.NStepGRU or links.NStepLSTM.

cd ~/ParlAI
python examples/train_model.py -m parlai.parlai_agents.chainer_rnn.chainer_rnn:RNNAgent -t babi:Task1k:1 -mf './parlai/parlai_agents/chainer_rnn/model_file/babi1' -e 20 -rnn GRU -bs 32 -hs 64 -nl 2 -lr 0.5 -dr 0.2 -ltim 2 -vtim 30

RNNAgent by TensorFlow

TensorFlow can handle variable length input using tf.nn.dynamic_rnn.
dynamic_rnn internally executes loop processing to unroll time series using control_flow_ops.while_loop (same as tf.while_loop).

If you want to run with only CPU on the GPU server, you can set the environment variable to CUDA_VISIBLE_DEVICES="".

cd ~/ParlAI
python examples/train_model.py -m parlai.parlai_agents.tensorflow_rnn.tensorflow_rnn:RNNAgent -t babi:Task1k:1 -mf './parlai/parlai_agents/tensorflow_rnn/model_file/babi1' -e 20 -rnn GRU -bs 32 -hs 64 -nl 2 -lr 0.5 -dr 0.2 -ltim 2 -vtim 30

More Advanced Agents

AttentionAgent by PyTorch

attention

The meaning of the additional arguments are as follows.

-bi: if True, use a bidirectional encoder for the first layer
-atte: if True, use an Luong's attention
-cont: if True, use only a context vector without using decoder query for the final output
-sf: select score function for attention from dot, general, concat
-tf: teacher forcing ratio

cd ~/ParlAI
python examples/train_model.py -m parlai.parlai_agents.pytorch_attention.pytorch_attention:AttentionAgent -t babi:Task1k:1 -mf './parlai/parlai_agents/pytorch_attention/model_file/babi1' -e 20 -rnn GRU -bi True -atte True -sf general -tf 1. -bs 32 -hs 64 -nl 2 -lr 0.5 -dr 0.2 -ltim 2 -vtim 30

MemN2NAgent by Chainer

memn2n

The meaning of the additional arguments are as follows.

-ms: size of the memory (both key and value)
-nl: number of memory layers (hops)
-wt: select weight tying from Adjacent, Layer-wise, Nothing
-pe: if True, use a Position Encoding for word embedding
-te: if True, use a Temporal Encoding for sentence memorization
-rn: if True, use a Random Noise to regularize TE
-ls: if True, use a Linear Start (remove softmax for the memory layers)
-opt: select optimizer from SGD, AdaGrad, Adam

Chainer version

cd ~/ParlAI
python examples/train_model.py -m parlai.parlai_agents.chainer_memn2n.chainer_memn2n:MemN2NAgent -t babi:Task1k:1 -mf './parlai/parlai_agents/chainer_memn2n/model_file/babi1' -e 100 -bs 32 -hs 20 -ms 50 -nl 3 -wt Adjacent -pe True -te True -rn True -ls False -opt Adam -lr 0.05 -ltim 2 -vtim 60 -vp -1

PyTorch version

cd ~/ParlAI
python examples/train_model.py -m parlai.parlai_agents.pytorch_memn2n.pytorch_memn2n:MemN2NAgent -t babi:Task1k:1 -mf './parlai/parlai_agents/pytorch_memn2n/model_file/babi1' -e 100 -bs 32 -hs 20 -ms 50 -nl 3 -wt Adjacent -pe True -te True -rn True -ls False -opt Adam -lr 0.05 -ltim 2 -vtim 60 -vp -1

Outperform the Results of the Paper!

Benchmark results to compare this repo implementation, the author's original Matlab code on the bAbI tasks and the paper description.

Default Configuration: 3 Hops, Position Encoding (PE), Temporal Encoding (TE), Linear Start (LS), Random Noise (RN) and Adjacent Weight Tying.

this repo: No weight tying, No Linear Start, 5 Hops, Hidden Size 128 and Adam (Annealing). It failed at linear start. (trained only once)
matlab: check it
paper: repeated 10 times with different random initializations, and picked the one with the lowest training error. check it

bAbI Task paper consider a task successfully passed if ≥ 95% accuracy is obtained.

In the best results of the paper, 14/20 tasks succeeded.
The best settings of this repo has succeeded in 15/20 tasks!

Visualize Position Encoding

Other Agents

SaveAgent

Save losses and attention weights.

The meaning of the arguments are as follows.

-sltim, --save-loss-every-n-secs: second interval to save losses
-sae, --save-attention-exs: maximum number of examples to save attention weights

Contact

If you have any questions or anything, please do not hesitate to contact me or post on my Github Issues page.

@_Ryobot on Twitter
nakamura.ryo.nm8[at]is.naist.jp

License

All codes in this repository are BSD-licensed.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 44

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗