tokenmill / Awesome Nlg
Licence: cc0-1.0
A curated list of resources dedicated to Natural Language Generation (NLG)
Stars: ✭ 211
Projects that are alternatives of or similar to Awesome Nlg
awesome-nlg
A curated list of resources dedicated to Natural Language Generation (NLG)
Stars: ✭ 386 (+82.94%)
Mutual labels: natural-language-generation, nlg, natural-language-understanding
Gluon Nlp
NLP made easy
Stars: ✭ 2,344 (+1010.9%)
Mutual labels: natural-language-understanding, natural-language-generation, nlg
Kenlg Reading
Reading list for knowledge-enhanced text generation, with a survey
Stars: ✭ 257 (+21.8%)
Mutual labels: natural-language-generation, nlg
Accelerated Text
Accelerated Text is a no-code natural language generation platform. It will help you construct document plans which define how your data is converted to textual descriptions varying in wording and structure.
Stars: ✭ 256 (+21.33%)
Mutual labels: natural-language-generation, nlg
Practical Pytorch
Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained
Stars: ✭ 4,329 (+1951.66%)
Mutual labels: natural-language-generation, nlg
classy
classy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Stars: ✭ 61 (-71.09%)
Mutual labels: natural-language-generation, natural-language-understanding
uctf
Unsupervised Controllable Text Generation (Applied to text Formalization)
Stars: ✭ 19 (-91%)
Mutual labels: natural-language-generation, nlg
Question generation
Neural question generation using transformers
Stars: ✭ 356 (+68.72%)
Mutual labels: natural-language-generation, nlg
factedit
🧐 Code & Data for Fact-based Text Editing (Iso et al; ACL 2020)
Stars: ✭ 16 (-92.42%)
Mutual labels: natural-language-generation, nlg
Nlg Eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+289.57%)
Mutual labels: natural-language-generation, nlg
Simplenlg
Java API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version.
Stars: ✭ 708 (+235.55%)
Mutual labels: natural-language-generation, nlg
Ludwig
Data-centric declarative deep learning framework
Stars: ✭ 8,018 (+3700%)
Mutual labels: natural-language-understanding, natural-language-generation
turingadvice
Evaluating Machines by their Real-World Language Use
Stars: ✭ 23 (-89.1%)
Mutual labels: natural-language-generation, natural-language-understanding
syntaxmaker
The NLG tool for Finnish
Stars: ✭ 19 (-91%)
Mutual labels: natural-language-generation, nlg
nlp-notebooks
A collection of natural language processing notebooks.
Stars: ✭ 19 (-91%)
Mutual labels: natural-language-generation, natural-language-understanding
Nlp Conference Compendium
Compendium of the resources available from top NLP conferences.
Stars: ✭ 349 (+65.4%)
Mutual labels: natural-language-understanding, natural-language-generation
Transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+26318.01%)
Mutual labels: natural-language-understanding, natural-language-generation
wikiHow paper list
A paper list of research conducted based on wikiHow
Stars: ✭ 25 (-88.15%)
Mutual labels: natural-language-generation, natural-language-understanding
TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (-80.09%)
Mutual labels: natural-language-generation, natural-language-understanding
This Word Does Not Exist
This Word Does Not Exist
Stars: ✭ 640 (+203.32%)
Mutual labels: natural-language-understanding, natural-language-generation
Awesome Natural Language Generation
Natural Language Generation is a broad domain with applications in chat-bots, story generation, and data descriptions. There is a wide spectrum of different technologies addressing parts or the whole of the NLG process. This list aims to represent this deversity of NLG applications and techniques by providing links to various projects, tools, research papers, and learning materials.
Contents
- Datasets
- Dialog
- Evaluation
- Grammar
- Libraries
- Narrative Generation
- Neural Natural Language Generation
- Papers and Articles
- Products
- Realizers
- Templating Languages
- Videos
Datasets
- Alex Context NLG Dataset - A dataset for NLG in dialogue systems in the public transport information domain.
- Box-score data - This dataset consists of (human-written) NBA basketball game summaries aligned with their corresponding box- and line-scores.
- E2E - This shared task focuses on recent end-to-end (E2E), data-driven NLG methods, which jointly learn sentence planning and surface realisation from non-aligned data.
- Neural-Wikipedian - The repository contains the code along with the required corpora that were used in order to build a system that "learns" how to generate English biographies for Semantic Web triples.
- WeatherGov - Computer-generated weather forecasts from weather.gov (US public forecast), along with corresponding weather data.
- WebNLG - The enriched version of the WebNLG - a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation.
- WikiBio - wikipedia biography dataset - This dataset gathers 728,321 biographies from wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).
- The Schema-Guided Dialogue Dataset - The Schema-Guided Dialogue (SGD) dataset consists of over 20k annotated multi-domain, task-oriented conversations between a human and a virtual assistant.
- The Wikipedia company corpus - Company descriptions collected from Wikipedia. The dataset contains semantic representations, short, and long descriptions for 51K companies in English.
- YelpNLG - YelpNLG provides resources for natural language generation of restaurant reviews.
Dialog
- Chatito - Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
- NNDIAL - NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models.
- Plato - This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.
- RNNLG - RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains.
- TGen - Statistical NLG for spoken dialogue systems.
Evaluation
- BLEURT: a Transfer Learning-Based Metric for Natural Language Generation
- compare-mt - A tool for holistic analysis of language generations systems.
- GEM - a benchmark environment for NLG with a focus on its Evaluation, both through human annotations and automated Metrics.
- NLG-eval - Evaluation code for various unsupervised automated metrics for Natural Language Generation.
- VizSeq - A Visual Analysis Toolkit for Text Generation Tasks.
Grammar
- OpenCCG - OpenCCG library for parsing and realization with CCG.
- GrammaticalFramework - A programming language for multilingual grammar applications.
- EasyCCG - CCG: All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG.
- CCG Lab - All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG.
- CCGweb - A Web platform for parsing and annotation.
Libraries
- Cron Expression Descriptor - A .NET library that converts cron expressions into human readable descriptions.
- Number Words - Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.
Narrative Generation
- Random Story Generator - Using Natural Language Generation (NLG) to create a random short story.
- Tracery - A story-grammar generation library for JavaScript.
Neural Natural Language Generation
- aitextgen - A robust Python tool for text-based AI training and generation using GPT-2.
- graph-2-text - Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py.
- Image Caption Generator - A Neural Network based generative model for captioning images using Tensorflow.
- PaperRobot: Incremental Draft Generation of Scientific Ideas - We present a PaperRobot who performs as an automatic research assistant.
- PPLM - Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
- Question Generation using hugstransformers - Question generation is the task of automatically generating questions from a text paragraph.
- Texar - Texar is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks.
- textgenrnn - Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
- This Word Does Not Exist - This is a project allows people to train a variant of GPT-2 that makes up words, definitions and examples from scratch.
- Transformers - State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
- Summary Generation From Structured Data - For converting information present in the form of structured data into natural language text.
Papers and Articles
- 2020: The Curious Case of Neural Text Degeneration
- 2020: A Gold Standard Methodology for Evaluating Accuracy in Data-To-Text Systems
- 2020: Evaluating the state-of-the-art of End-to-End Natural Language Generation: The E2E NLG challenge
- 2020: How to generate text: using different decoding methods for language generation with Transformers
- 2020: Natural language generation: The commercial state ofthe art in 2020
- 2020: Turing-NLG: A 17-billion-parameter language model by Microsoft
- 2019: A Closer Look at Recent Results of Verb Selection for Data-to-Text NLG
- 2019: A Personalized Data-to-Text Support Tool for Cancer Patients
- 2019: Controlling Contents in Data-to-Document Generation with Human-Designed Topic Labels
- 2019: Generated Texts Must Be Accurate!
- 2019: Hotel Scribe: Generating High Variation Hotel Descriptions
- 2019: Revisiting Challenges in Data-to-Text Generation with Fact Grounding
- 2017: Survey of the State of the Art in NaturalLanguage Generation: Core tasks, applicationsand evaluation
- 2016: Natural Language Generation enhances human decision-making with uncertain information
Products
- Accelerated Text - Automatically generate multiple natural language descriptions of your data varying in wording and structure.
- RosaeNLG - An open-source library for node.js or client side (browser) execution, based on the Pug template engine, to generate texts in English, French, German and Italian.
- Twine - An open-source tool for telling interactive, nonlinear stories.
Realizers
- Genl - Surface realiser (part of a Natural Language Generation system) using Tree Adjoining Grammar.
- JSrealB - A JavaScript bilingual text realizer for web development.
- SimpleNLG - Java API for Natural Language Generation.
- SimpleNLG DE - German version of SimpleNLG 4.
- SimpleNLG-EnFr - SimpleNLG-EnFr 1.1 is a bilingual English/French adaption of SimpleNLG v4.2.
Templating Languages
- calyx - A Ruby library for generating text with recursive template grammars.
- nalgene - Natural language generation language.
- StringTemplate - Java template engine (with ports for C##, Objective-C, JavaScript, Scala) for generating source code, web pages, emails, or any other formatted text output.
Videos
- Data-To-Text: Generating Textual Summaries of Complex Data - Ehud Reiter
- Imitation Learning and its Application to Natural Language Generation
- Natural Language Generation (Introduction)
- Strata Data Conference | The future of natural language generation: 2017-2027
- The Quest for Automated Story Generation - Mark Riedl
License
To the extent possible under law, TokenMill has waived all copyright and related or neighboring rights to this work.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].