All Projects → yxuansu → PlanGen

yxuansu / PlanGen

Licence: other
Plan-then-Generate: Controlled Data-to-Text Generation via Planning (EMNLP 2021)

Programming Languages

python
139335 projects - #7 most used programming language
Jupyter Notebook
11667 projects
HTML
75241 projects
shell
77523 projects

Projects that are alternatives of or similar to PlanGen

Nlg Eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+1577.55%)
Mutual labels:  nlg
Chatette
A powerful dataset generator for Rasa NLU, inspired by Chatito
Stars: ✭ 205 (+318.37%)
Mutual labels:  nlg
awesome-nlg
A curated list of resources dedicated to Natural Language Generation (NLG)
Stars: ✭ 386 (+687.76%)
Mutual labels:  nlg
Text Generation Papers
Paper collection of Neural Text Generation
Stars: ✭ 46 (-6.12%)
Mutual labels:  nlg
Awesome Pretrained Chinese Nlp Models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型集合
Stars: ✭ 195 (+297.96%)
Mutual labels:  nlg
transformer-drg-style-transfer
This repository have scripts and Jupyter-notebooks to perform all the different steps involved in Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer
Stars: ✭ 97 (+97.96%)
Mutual labels:  nlg
Tc Bot
User Simulation for Task-Completion Dialogues
Stars: ✭ 733 (+1395.92%)
Mutual labels:  nlg
RapLyrics-Back
Model training, custom generative function and training for raplyrics.eu - A rap music lyrics generation project
Stars: ✭ 14 (-71.43%)
Mutual labels:  nlg
Gluon Nlp
NLP made easy
Stars: ✭ 2,344 (+4683.67%)
Mutual labels:  nlg
question generator
An NLP system for generating reading comprehension questions
Stars: ✭ 188 (+283.67%)
Mutual labels:  nlg
Xnlg
AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training
Stars: ✭ 95 (+93.88%)
Mutual labels:  nlg
Nlg Yongzhuo
中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize tool or tookit)
Stars: ✭ 175 (+257.14%)
Mutual labels:  nlg
recipe-personalization
EMNLP 2019: Generating Personalized Recipes from Historical User Preferences
Stars: ✭ 48 (-2.04%)
Mutual labels:  nlg
Humour.ai Language Model That Can Crack Jokes
Language Model that makes you Laugh .
Stars: ✭ 18 (-63.27%)
Mutual labels:  nlg
gdc
Code for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"
Stars: ✭ 94 (+91.84%)
Mutual labels:  nlg
Chatbot cn
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
Stars: ✭ 791 (+1514.29%)
Mutual labels:  nlg
Awesome Nlg
A curated list of resources dedicated to Natural Language Generation (NLG)
Stars: ✭ 211 (+330.61%)
Mutual labels:  nlg
factedit
🧐 Code & Data for Fact-based Text Editing (Iso et al; ACL 2020)
Stars: ✭ 16 (-67.35%)
Mutual labels:  nlg
pen.el
Pen.el stands for Prompt Engineering in emacs. It facilitates the creation, discovery and usage of prompts to language models. Pen supports OpenAI, EleutherAI, Aleph-Alpha, HuggingFace and others. It's the engine for the LookingGlass imaginary web browser.
Stars: ✭ 376 (+667.35%)
Mutual labels:  nlg
xbot
Task-oriented Chatbot
Stars: ✭ 78 (+59.18%)
Mutual labels:  nlg

Plan-then-Generate: Controlled Data-to-Text Generation via Planning

Authors: Yixuan Su, David Vandyke, Sihui Wang, Yimai Fang, and Nigel Collier

Code for EMNLP 2021 paper Plan-then-Generate: Controlled Data-to-Text Generation via Planning

1. Environment Setup:

(1) Hardware Requirement:

The code in this repo is thoroughly tested on our machine with a single Nvida V100 GPU (16GB)

(2) Installation:

chmod +x ./config_setup.sh
./config_setup.sh

2. ToTTo Data Preprocessing:

Option (1): Preprocess the ToTTo data from scratch by yourself:

cd ./data
chmod +x ./prepare_data.sh
./prepare_data.sh

This process could take up to 1 hour

Option (2): Download the our processed data here

unzip data.zip and replace with the empty ./data folder

For more details about ToTTo dataset, please refer to the original Google Research repo

3. Content Planner:

Please refer to README.md in ./content_planner folder

4. Sequence Generator:

Please refer to README.md in ./generator folder

5. Citation

If you find our paper and resources useful, please kindly cite our paper:

@inproceedings{su-etal-2021-plan-generate,
    title = "Plan-then-Generate: Controlled Data-to-Text Generation via Planning",
    author = "Su, Yixuan  and
      Vandyke, David  and
      Wang, Sihui  and
      Fang, Yimai  and
      Collier, Nigel",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.findings-emnlp.76",
    pages = "895--909",
    abstract = "Recent developments in neural networks have led to the advance in data-to-text generation. However, the lack of ability of neural models to control the structure of generated output can be limiting in certain real-world applications. In this study, we propose a novel Plan-then-Generate (PlanGen) framework to improve the controllability of neural data-to-text models. Extensive experiments and analyses are conducted on two benchmark datasets, ToTTo and WebNLG. The results show that our model is able to control both the intra-sentence and inter-sentence structure of the generated output. Furthermore, empirical comparisons against previous state-of-the-art methods show that our model improves the generation quality as well as the output diversity as judged by human and automatic evaluations.",
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].