Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → jasonwu0731 → Trade Dst

jasonwu0731 / Trade Dst

Source code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743

Programming Languages

python

139335 projects - #7 most used programming language

Labels

machine-learning natural-language-processing seq2seq dialogue

Projects that are alternatives of or similar to Trade Dst

Multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Stars: ✭ 384 (+33.8%)

Mutual labels: dialogue, natural-language-processing, seq2seq

Nndial

NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Stars: ✭ 332 (+15.68%)

Mutual labels: dialogue, natural-language-processing

Seq2seq Chatbot For Keras

This repository contains a new generative model of chatbot based on seq2seq modeling.

Stars: ✭ 322 (+12.2%)

Mutual labels: dialogue, seq2seq

Rnnlg

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Stars: ✭ 487 (+69.69%)

Mutual labels: dialogue, natural-language-processing

Spark Nlp

State of the Art Natural Language Processing

Stars: ✭ 2,518 (+777.35%)

Mutual labels: natural-language-processing, seq2seq

Pytorch Beam Search Decoding

PyTorch implementation of beam search decoding for seq2seq models

Stars: ✭ 204 (-28.92%)

Mutual labels: natural-language-processing, seq2seq

Nlp Progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Stars: ✭ 19,518 (+6700.7%)

Mutual labels: dialogue, natural-language-processing

Practical Pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained

Stars: ✭ 4,329 (+1408.36%)

Mutual labels: natural-language-processing, seq2seq

Dialogue Understanding

This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study

Stars: ✭ 77 (-73.17%)

Mutual labels: dialogue, natural-language-processing

Msr Nlp Projects

This is a list of open-source projects at Microsoft Research NLP Group

Stars: ✭ 92 (-67.94%)

Mutual labels: dialogue, natural-language-processing

Tod Bert

Pre-Trained Models for ToD-BERT

Stars: ✭ 143 (-50.17%)

Mutual labels: dialogue, natural-language-processing

Question generation

It is a question-generator model. It takes text and an answer as input and outputs a question.

Stars: ✭ 166 (-42.16%)

Mutual labels: natural-language-processing, seq2seq

Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+19322.3%)

Mutual labels: natural-language-processing, seq2seq

Text summarization with tensorflow

Implementation of a seq2seq model for summarization of textual data. Demonstrated on amazon reviews, github issues and news articles.

Stars: ✭ 226 (-21.25%)

Mutual labels: natural-language-processing, seq2seq

Ai Writer data2doc

PyTorch Implementation of NBA game summary generator.

Stars: ✭ 69 (-75.96%)

Mutual labels: natural-language-processing, seq2seq

SequenceToSequence

A seq2seq with attention dialogue/MT model implemented by TensorFlow.

Stars: ✭ 11 (-96.17%)

Mutual labels: dialogue, seq2seq

Tf Seq2seq

Sequence to sequence learning using TensorFlow.

Stars: ✭ 387 (+34.84%)

Mutual labels: natural-language-processing, seq2seq

Nlg Eval

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

Stars: ✭ 822 (+186.41%)

Mutual labels: dialogue, natural-language-processing

Tgen

Statistical NLG for spoken dialogue systems

Stars: ✭ 179 (-37.63%)

Mutual labels: dialogue, seq2seq

CVAE Dial

CVAE_XGate model in paper "Xu, Dusek, Konstas, Rieser. Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Diversity"

Stars: ✭ 16 (-94.43%)

Mutual labels: dialogue, seq2seq

View All Similar Projects ➔

TRADE Multi-Domain and Unseen-Domain Dialogue State Tracking

This is the PyTorch implementation of the paper: Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems. Chien-Sheng Wu, Andrea Madotto, Ehsan Hosseini-Asl, Caiming Xiong, Richard Socher and Pascale Fung. ACL 2019. [PDF]

This code has been written using PyTorch >= 1.0. If you use any source codes or datasets included in this toolkit in your work, please cite the following paper. The bibtex is listed below:

@InProceedings{WuTradeDST2019,
  	author = "Wu, Chien-Sheng and Madotto, Andrea and Hosseini-Asl, Ehsan and Xiong, Caiming and Socher, Richard and Fung, Pascale",
  	title = 	"Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems",
  	booktitle = 	"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
  	year = 	"2019",
  	publisher = "Association for Computational Linguistics"
}

Abstract

Over-dependence on domain ontology and lack of knowledge sharing across domains are two practical and yet less studied problems of dialogue state tracking. Existing approaches generally fall short in tracking unknown slot values during inference and often have difficulties in adapting to new domains. In this paper, we propose a Transferable Dialogue State Generator (TRADE) that generates dialogue states from utterances using a copy mechanism, facilitating knowledge transfer when predicting (domain, slot, value) triplets not encountered during training. Our model is composed of an utterance encoder, a slot gate, and a state generator, which are shared across domains. Empirical results demonstrate that TRADE achieves state-of-the-art joint goal accuracy of 48.62% for the five domains of MultiWOZ, a human-human dialogue dataset. In addition, we show its transferring ability by simulating zero-shot and few-shot dialogue state tracking for unseen domains. TRADE achieves 60.58% joint goal accuracy in one of the zero-shot domains, and is able to adapt to few-shot cases without forgetting already trained domains.

Model Architecture

The architecture of the proposed TRADE model, which includes (a) an utterance encoder, (b) a state generator, and (c) a slot gate, all of which are shared among domains. The state generator will decode J times independently for all the possible (domain, slot) pairs. At the first decoding step, state generator will take the j-th (domain, slot) embeddings as input to generate its corresponding slot values and slot gate. The slot gate predicts whether the j-th (domain, slot) pair is triggered by the dialogue.

Data

Download the MultiWOZ dataset and the processed dst version.

❱❱❱ python3 create_data.py

An example of multi-domain dialogue state tracking in a conversation. The solid arrows on the left are the single-turn mapping, and the dot arrows on the right are multi-turn mapping. The state tracker needs to track slot values mentioned by the user for all the slots in all the domains.

Dependency

Check the packages needed or simply run the command

❱❱❱ pip install -r requirements.txt

If you run into an error related to Cython, try to upgrade it first.

❱❱❱ pip install --upgrade cython

Multi-Domain DST

Training

❱❱❱ python3 myTrain.py -dec=TRADE -bsz=32 -dr=0.2 -lr=0.001 -le=1

Testing

❱❱❱ python3 myTest.py -path=${save_path}

-bsz: batch size
-dr: drop out ratio
-lr: learning rate
-le: loading pretrained embeddings
-path: model saved path

[2019.08 Update] Now the decoder can generate all the (domain, slot) pairs in one batch at the same time to speedup decoding process. You can set flag "--parallel_decode=1" to decode all (domain, slot) pairs in one batch.

Unseen Domain DST

Zero-Shot DST

Training

❱❱❱ python3 myTrain.py -dec=TRADE -bsz=32 -dr=0.2 -lr=0.001 -le=1 -exceptd=${domain}

Testing

❱❱❱ python3 myTest.py -path=${save_path} -exceptd=${domain}

-exceptd: except domain selection, choose one from {hotel, train, attraction, restaurant, taxi}.

Few-Shot DST with CL

Training Naive

❱❱❱ python3 fine_tune.py -bsz=8 -dr=0.2 -lr=0.001 -path=${save_path_except_domain} -exceptd=${except_domain}

EWC

❱❱❱ python3 EWC_train.py -bsz=8 -dr=0.2 -lr=0.001 -path=${save_path_except_domain} -exceptd=${except_domain} -fisher_sample=10000 -l_ewc=${lambda}

GEM

❱❱❱ python3 GEM_train.py -bsz=8 -dr=0.2 -lr=0.001 -path={save_path_except_domain} -exceptd=${except_domain}

-l_ewc: lambda value in EWC training

Other Notes

We found that there might be some variances in different runs, especially for the few-shot setting. For our own experiments, we only use one random seed (seed=10) to do the experiments reported in the paper. Please check the results for average three runs in our ACL presentation.

Bug Report

Feel free to create an issue or send email to [email protected]

License

copyright 2019-present https://jasonwu0731.github.io/

Permission is hereby granted, free of charge, to any person obtaining a copy 
of this software and associated documentation files (the "Software"), to deal 
in the Software without restriction, including without limitation the rights 
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell 
copies of the Software, and to permit persons to whom the Software is 
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all 
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR 
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, 
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE 
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER 
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, 
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE 
SOFTWARE.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 287

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗