Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Maluuba → Nlg Eval

Maluuba / Nlg Eval

Licence: other

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

nlp natural-language-processing dialog meteor evaluation machine-translation dialogue natural-language-generation nlg

Projects that are alternatives of or similar to Nlg Eval

Rnnlg

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Stars: ✭ 487 (-40.75%)

Mutual labels: dialogue, natural-language-processing, natural-language-generation

Nlp Progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Stars: ✭ 19,518 (+2274.45%)

Mutual labels: dialogue, natural-language-processing, machine-translation

Question generation

Neural question generation using transformers

Stars: ✭ 356 (-56.69%)

Mutual labels: natural-language-processing, natural-language-generation, nlg

Nndial

NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Stars: ✭ 332 (-59.61%)

Mutual labels: dialogue, natural-language-processing, natural-language-generation

Practical Pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained

Stars: ✭ 4,329 (+426.64%)

Mutual labels: natural-language-processing, natural-language-generation, nlg

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (-85.64%)

Mutual labels: natural-language-processing, machine-translation, natural-language-generation

Gluon Nlp

NLP made easy

Stars: ✭ 2,344 (+185.16%)

Mutual labels: natural-language-processing, natural-language-generation, nlg

Trade Dst

Source code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743

Stars: ✭ 287 (-65.09%)

Mutual labels: dialogue, natural-language-processing

Zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.

Stars: ✭ 3,307 (+302.31%)

Mutual labels: natural-language-processing, machine-translation

Bytenet Tensorflow

ByteNet for character-level language modelling

Stars: ✭ 319 (-61.19%)

Mutual labels: natural-language-processing, machine-translation

SpeechTransProgress

Tracking the progress in end-to-end speech translation

Stars: ✭ 139 (-83.09%)

Mutual labels: machine-translation, natural-language-generation

Nlp Conference Compendium

Compendium of the resources available from top NLP conferences.

Stars: ✭ 349 (-57.54%)

Mutual labels: natural-language-processing, natural-language-generation

Multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Stars: ✭ 384 (-53.28%)

Mutual labels: dialogue, natural-language-processing

Text2sql Data

A collection of datasets that pair questions with SQL queries.

Stars: ✭ 287 (-65.09%)

Mutual labels: natural-language-processing, evaluation

Accelerated Text

Accelerated Text is a no-code natural language generation platform. It will help you construct document plans which define how your data is converted to textual descriptions varying in wording and structure.

Stars: ✭ 256 (-68.86%)

Mutual labels: natural-language-generation, nlg

Kenlg Reading

Reading list for knowledge-enhanced text generation, with a survey

Stars: ✭ 257 (-68.73%)

Mutual labels: natural-language-generation, nlg

Texar Pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

Stars: ✭ 636 (-22.63%)

Mutual labels: natural-language-processing, machine-translation

Pplm

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

Stars: ✭ 674 (-18%)

Mutual labels: natural-language-processing, natural-language-generation

inkling

Limited Rust implementation of the Ink markup/scripting language for game narratives

Stars: ✭ 22 (-97.32%)

Mutual labels: dialogue, dialog

dialogue-datasets

collect the open dialog corpus and some useful data processing utils.

Stars: ✭ 24 (-97.08%)

Mutual labels: dialogue, dialog

View All Similar Projects ➔

nlg-eval

Evaluation code for various unsupervised automated metrics for NLG (Natural Language Generation). It takes as input a hypothesis file, and one or more references files and outputs values of metrics. Rows across these files should correspond to the same example.

Metrics

BLEU
METEOR
ROUGE
CIDEr
SkipThought cosine similarity
Embedding Average cosine similarity
Vector Extrema cosine similarity
Greedy Matching score

Setup

Install Java 1.8.0 (or higher). Then run:

# Install the Python dependencies.
pip install git+https://github.com/Maluuba/[email protected]

# If using macOS High Sierra or higher, run this before run setup, to allow multithreading
# export OBJC_DISABLE_INITIALIZE_FORK_SAFETY=YES

# Simple setup:
# Download required data (e.g. models, embeddings) and external code files.
nlg-eval --setup

Custom Setup

# If you don't like the default path (~/.cache/nlgeval) for the downloaded data,
# then specify a path where you want the files to be downloaded.
# The value for the data path is stored in ~/.config/nlgeval/rc.json and can be overwritten by
# setting the NLGEVAL_DATA environment variable.
nlg-eval --setup ${data_path}

Usage

Once setup has completed, the metrics can be evaluated with a Python API or in the command line.

Examples of the Python API can be found in test_nlgeval.py.

Standalone

nlg-eval --hypothesis=examples/hyp.txt --references=examples/ref1.txt --references=examples/ref2.txt

where each line in the hypothesis file is a generated sentence and the corresponding lines across the reference files are ground truth reference sentences for the corresponding hypothesis.

functional API: for the entire corpus

from nlgeval import compute_metrics
metrics_dict = compute_metrics(hypothesis='examples/hyp.txt',
                               references=['examples/ref1.txt', 'examples/ref2.txt'])

functional API: for only one sentence

from nlgeval import compute_individual_metrics
metrics_dict = compute_individual_metrics(references, hypothesis)

where references is a list of ground truth reference text strings and hypothesis is the hypothesis text string.

object oriented API for repeated calls in a script - single example

from nlgeval import NLGEval
nlgeval = NLGEval()  # loads the models
metrics_dict = nlgeval.compute_individual_metrics(references, hypothesis)

where references is a list of ground truth reference text strings and hypothesis is the hypothesis text string.

object oriented API for repeated calls in a script - multiple examples

from nlgeval import NLGEval
nlgeval = NLGEval()  # loads the models
metrics_dict = nlgeval.compute_metrics(references, hypothesis)

where references is a list of lists of ground truth reference text strings and hypothesis is a list of hypothesis text strings. Each inner list in references is one set of references for the hypothesis (a list of single reference strings for each sentence in hypothesis in the same order).

Reference

If you use this code as part of any published research, please cite the following paper:

Shikhar Sharma, Layla El Asri, Hannes Schulz, and Jeremie Zumer. "Relevance of Unsupervised Metrics in Task-Oriented Dialogue for Evaluating Natural Language Generation" arXiv preprint arXiv:1706.09799 (2017)

@article{sharma2017nlgeval,
    author  = {Sharma, Shikhar and El Asri, Layla and Schulz, Hannes and Zumer, Jeremie},
    title   = {Relevance of Unsupervised Metrics in Task-Oriented Dialogue for Evaluating Natural Language Generation},
    journal = {CoRR},
    volume  = {abs/1706.09799},
    year    = {2017},
    url     = {http://arxiv.org/abs/1706.09799}
}

Example

Running

nlg-eval --hypothesis=examples/hyp.txt --references=examples/ref1.txt --references=examples/ref2.txt

gives

Bleu_1: 0.550000
Bleu_2: 0.428174
Bleu_3: 0.284043
Bleu_4: 0.201143
METEOR: 0.295797
ROUGE_L: 0.522104
CIDEr: 1.242192
SkipThoughtsCosineSimilarity: 0.626149
EmbeddingAverageCosineSimilarity: 0.884690
VectorExtremaCosineSimilarity: 0.568696
GreedyMatchingScore: 0.784205

Troubleshooting

If you have issues with Meteor then you can try lowering the mem variable in meteor.py

Important Note

CIDEr by default (with idf parameter set to "corpus" mode) computes IDF values using the reference sentences provided. Thus, CIDEr score for a reference dataset with only 1 image (or example for NLG) will be zero. When evaluating using one (or few) images, set idf to "coco-val-df" instead, which uses IDF from the MSCOCO Vaildation Dataset for reliable results. This has not been adapted in this code. For this use-case, apply patches from vrama91/coco-caption.

External data directory

To mount an already prepared data directory to a Docker container or share it between users, you can set the NLGEVAL_DATA environment variable to let nlg-eval know where to find its models and data. E.g.

NLGEVAL_DATA=~/workspace/nlg-eval/nlgeval/data

This variable overrides the value provided during setup (stored in ~/.config/nlgeval/rc.json)

Microsoft Open Source Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

License

See LICENSE.md.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 822

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (9) 🔗