Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → HLTCHKUST → Mem2seq

HLTCHKUST / Mem2seq

Licence: mit

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems

Programming Languages

139335 projects - #7 most used programming language

Labels

deep-learning dialogue-systems

Projects that are alternatives of or similar to Mem2seq

deeplearning-papernotes

Краткое изложение статей по NLP, Deep Learning и диалоговым агентам

Stars: ✭ 17 (-94.86%)

Mutual labels: dialogue-systems

dialogue-datasets

collect the open dialog corpus and some useful data processing utils.

Stars: ✭ 24 (-92.75%)

Mutual labels: dialogue-systems

Dstc8 Schema Guided Dialogue

The Schema-Guided Dialogue Dataset

Stars: ✭ 277 (-16.31%)

Mutual labels: dialogue-systems

Dialogue Plugin System for Unreal Engine | 🪞 Mirror of https://bit.ly/DlgSource

Stars: ✭ 136 (-58.91%)

Mutual labels: dialogue-systems

YarnGdx is a Libgdx Library for interactive dialogue in games! This is a port of [YarnSpinner](https://github.com/thesecretlab/YarnSpinner) by thesecretlab

Stars: ✭ 25 (-92.45%)

Mutual labels: dialogue-systems

Posterior-Knowledge-Selection

Learning to Select Knowledge for Response Generation in Dialog Systems

Stars: ✭ 32 (-90.33%)

Mutual labels: dialogue-systems

TOWARDS AN AUTOMATIC TURING TEST: LEARNING TO EVALUATE DIALOGUE RESPONSES

Stars: ✭ 25 (-92.45%)

Mutual labels: dialogue-systems

GoNorth is a story and content planning tool for RPGs and other open world games.

Stars: ✭ 289 (-12.69%)

Mutual labels: dialogue-systems

NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"

Stars: ✭ 73 (-77.95%)

Mutual labels: dialogue-systems

Merino is a narrative design tool that lets you write Yarn scripts inside the Unity Editor

Stars: ✭ 275 (-16.92%)

Mutual labels: dialogue-systems

This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020's Insights Workshop https://insights-workshop.github.io/ Preprint for the paper is available here https://arxiv.org/abs/2009.13833

Stars: ✭ 27 (-91.84%)

Mutual labels: dialogue-systems

permuted-bAbI-dialog-tasks

Dataset for 'Learning End-to-End Goal-Oriented Dialog with Multiple Answers' EMNLP 2018

Stars: ✭ 17 (-94.86%)

Mutual labels: dialogue-systems

Dialog Generation Paper

A list of recent papers regarding dialogue generation

Stars: ✭ 265 (-19.94%)

Mutual labels: dialogue-systems

explicit memory tracker

[ACL 2020] Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading

Stars: ✭ 35 (-89.43%)

Mutual labels: dialogue-systems

Stars: ✭ 279 (-15.71%)

Mutual labels: dialogue-systems

distributed-architecture-of-moba-game-server

分布式服务器架构下的3v3团队对抗游戏

Stars: ✭ 31 (-90.63%)

Mutual labels: dialogue-systems

TEXTOIR is a flexible toolkit for open intent detection and discovery. (ACL 2021)

Stars: ✭ 31 (-90.63%)

Mutual labels: dialogue-systems

Framework (inspired by Ren'Py) for story driven games in Godot.

Stars: ✭ 291 (-12.08%)

Mutual labels: dialogue-systems

Neuraldialog Cvae

Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU

Stars: ✭ 279 (-15.71%)

Mutual labels: dialogue-systems

Nodebaseddialoguesystem

Node Based Dialogue System for Unity

Stars: ✭ 269 (-18.73%)

Mutual labels: dialogue-systems

View All Similar Projects ➔

Mem2Seq

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems (ACL 2018). Andrea Madotto, Chien-Sheng Wu, Pascale Fung. Accepted at ACL 2018. [PDF] in ACL anthology. Andrea Madotto and Chien-Sheng Wu contribute equally at this work.

This code has been written using Pytorch 0.3, soon we will update the code to Pytorch 0.4.

If you use any source codes or datasets included in this toolkit in your work, please cite the following paper. The bibtex are listed below:

@InProceedings{P18-1136,
  author = 	"Madotto, Andrea
		and Wu, Chien-Sheng
		and Fung, Pascale",
  title = 	"Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems",
  booktitle = 	"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
  year = 	"2018",
  publisher = 	"Association for Computational Linguistics",
  pages = 	"1468--1478",
  location = 	"Melbourne, Australia",
  url = 	"http://aclweb.org/anthology/P18-1136"
}

Mem2Seq in pytorch

In this repository we implemented Mem2Seq and several baseline in pytorch (Version 0.3). To make the code more reusable we diveded each model in a separated files (obviusly there is a large code overlap). In the folder models you can find the following:

Mem2Seq: Memory to Sequence (Our model)
Seq2Seq: Vanilla seq2seq model with no attention (enc_vanilla)
+Attn: Luong attention attention model
Ptr-Unk: combination between Bahdanau attention and Pointer Networks (Point to UNK words)

All of these file share the same structure, which is: a class that builds an encoder and a decoder, and provide training and validation methods (all inside the class).

Import data

Under the utils folder, we have the script to import and batch the data for each dataset.

Basic example

Mem2Seq can be considered as a general sequence to sequence model with the ability to address external memories. We prepared a very basic implementation (including data preprocessing and model) for a English to France translation task. Obviusly there is not much to copy from the input in this small corpus, so it is just to show how the model works in a general sequence to sequence task. Run:

❱❱❱ python3 main_nmt.py

This version uses a flat memory instead of triple as described in the paper.

Train a model for task-oriented dialog datasets

We created main_train.py to train models. You can see there is a notation, globals()[args['decoder']], it is converting a string into a fuction. So to train a model you can run: Mem2Seq bAbI t1-t6:

❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=Mem2Seq -bsz=8 -ds=babi -t=1 
❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=VanillaSeqToSeq -bsz=8 -ds=babi -t=1
❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=LuongSeqToSeq -bsz=8 -ds=babi -t=1
❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=PTRUNK -bsz=8 -ds=babi -t=1

or Mem2Seq In-Car

❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=Mem2Seq -bsz=8 -ds=kvr -t=
❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=VanillaSeqToSeq -bsz=8 -ds=kvr -t=
❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=LuongSeqToSeq -bsz=8 -ds=kvr -t=
❱❱❱ python3 main_train.py -lr=0.001 -layer=1 -hdd=128 -dr=0.2 -dec=PTRUNK -bsz=8 -ds=kvr -t=

the option you can choose are:

-t this is task dependent. 1-6 for bAbI and nothing for In-Car
-ds choose which dataset to use (babi and kvr)
-dec to choose the model. The option are: Mem2Seq, VanillaSeqToSeq, LuongSeqToSeq, PTRUNK
-hdd hidden state size of the two rnn
-bsz batch size
-lr learning rate
-dr dropout rate
-layer number of stacked rnn layers, or number of hops for Mem2Seq

While training, the model with the best validation is saved. If you want to reuse a model add -path=path_name_model to the function call. The model is evaluated by using per responce accuracy, WER, F1 and BLEU.

Visualization Memory Access

Notes

For hyper-parameter search of Mem2Seq, our suggestions are:

Try to use a higher dropout rate (dr >= 0.2) and larger hidden size (hdd>=256) to get better performance when training with small hop (H<=3).
While training Mem2Seq with larger hops (H>3), it may perform better with smaller hidden size (hdd<256) and higher dropout rate.
Since there are some variances between runs, so it's better to run several times or run different seeds to get the best performance.

Enjoy! :)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 331

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (5) 🔗