Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → oxford-cs-deepnlp-2017 → Practical 3

oxford-cs-deepnlp-2017 / Practical 3

Oxford Deep NLP 2017 course - Practical 3: Text Classification with RNNs

Labels

deep-learning machine-learning nlp natural-language-processing

Projects that are alternatives of or similar to Practical 3

Multi-Task Deep Neural Networks for Natural Language Understanding

Stars: ✭ 72 (-7.69%)

Mutual labels: natural-language-processing

Natural Language Processing Tutorial for Deep Learning Researchers

Stars: ✭ 9,895 (+12585.9%)

Mutual labels: natural-language-processing

Dialogue Understanding

This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study

Stars: ✭ 77 (-1.28%)

Mutual labels: natural-language-processing

🦆 Contextually-keyed word vectors

Stars: ✭ 1,184 (+1417.95%)

Mutual labels: natural-language-processing

Course Computational Literary Analysis

Course materials for Introduction to Computational Literary Analysis, taught at UC Berkeley in Summer 2018, 2019, and 2020, and at Columbia University in Fall 2020.

Stars: ✭ 74 (-5.13%)

Mutual labels: natural-language-processing

Term extraction for Russian language

Stars: ✭ 75 (-3.85%)

Mutual labels: natural-language-processing

Causal Text Papers

Curated research at the intersection of causal inference and natural language processing.

Stars: ✭ 72 (-7.69%)

Mutual labels: natural-language-processing

Pre-Trained Chinese XLNet（中文XLNet预训练模型）

Stars: ✭ 1,213 (+1455.13%)

Mutual labels: natural-language-processing

A Shiny Application for Inspecting Structural Topic Models

Stars: ✭ 74 (-5.13%)

Mutual labels: natural-language-processing

Monkeylearn Ruby

Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.

Stars: ✭ 76 (-2.56%)

Mutual labels: natural-language-processing

Python nlp tutorial

This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)

Stars: ✭ 72 (-7.69%)

Mutual labels: natural-language-processing

A list of NLP(Natural Language Processing) tutorials

Stars: ✭ 1,188 (+1423.08%)

Mutual labels: natural-language-processing

Awesome Bert Japanese

📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information

Stars: ✭ 76 (-2.56%)

Mutual labels: natural-language-processing

Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析，使用PyTorch实现。

Stars: ✭ 1,181 (+1414.1%)

Mutual labels: natural-language-processing

A collection of 500+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML)

Stars: ✭ 1,203 (+1442.31%)

Mutual labels: natural-language-processing

Multinomial Adversarial Networks for Multi-Domain Text Classification (NAACL 2018)

Stars: ✭ 72 (-7.69%)

Mutual labels: natural-language-processing

The most popular spellchecking library.

Stars: ✭ 1,196 (+1433.33%)

Mutual labels: natural-language-processing

Text Dependency Parser

🏄 依存关系分析，NLP，自然语言处理

Stars: ✭ 78 (+0%)

Mutual labels: natural-language-processing

Multimodal Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Stars: ✭ 78 (+0%)

Mutual labels: natural-language-processing

Nested Ner Tacl2020 Transformers

Implementation of Nested Named Entity Recognition using BERT

Stars: ✭ 76 (-2.56%)

Mutual labels: natural-language-processing

View All Similar Projects ➔

Practical 3: Text Classification with RNNs

[Chris Dyer, Phil Blunsom, Yannis Assael, Brendan Shillingford, Yishu Miao]

In this practical, you can explore one of two applications of RNNs: text classification or language modelling (you are welcome to try both, too). We will be using the training/dev/test splits that we created in Practical 2.

Text classification (Task 1)

Last week’s practical introduced text classification as a problem that could be solved with deep learning. The document representation function we used was very simple: an average over the word embeddings in the document. This week, you will use RNNs to compute the representations of the documents.

In the figure below, on the left we show the document representation function that was used in last week’s practical. Your goal in this task is to adapt your code to use the architecture on the right.

Note that in Practical 3, x is defined to be the average of the RNN hidden states (the h_t’s), just just the sum.

Questions

What are the benefits and downsides of the RNN-based representation over the bag of words representation used last week? How would availability of data affect your answer?
One possible architectural variant is to use only the final hidden state of the RNN as the document representation (i.e., x) rather than the average of the hidden states over time. How does this work? What are the potential benefits and downsides to this representation?
Try different RNN architectures, e.g., simple Elman RNNs or GRUs or LSTMs. Which ones work best?
What happens if you use a bidirectional LSTM (i.e., the dashed arrows in the figure)?

(Optional, for enthusiastic students) RNNs are expensive use as “readers” on long sequences. Truncated backpropagation through time (truncated BPTT) can be used to get better parallelism. You are encouraged to use this to get better computational efficiency.

Language Modelling with RNNs (Task 2)

As covered in lecture last week, RNN language models use the chain rule to decompose the probability of a sequence into the product of probabilities of words, conditional of previously generated words:

To avoid problems with floating point underflow, you it is customary to model this in log space.

Given a training sequence training graph for a language model looks like this:

Your task is to train an RNN language model on the training portion of the TED data, using the validation set to determine when to stop optimising the parameters of the model.

A language model can be evaluated quantitatively by computing the (per-word) perplexity of the model on a held-out test corpus,

where |test set| is the length of the test set in words, including any <UNK> tokens. (Note: you can measure length in terms of any units, including characters, words, or sentences, these are just ways of quantifying how much uncertainty the model has about different units.)

To evaluate the model qualitatively, generate random samples from the model by sampling from p(w_t | w_{<t} ) and then feeding the sampled value of wt into the RNN at time t+1.

Questions

If you change the preprocessing of your corpus (e.g., you turn more words in UNK or you lowercase everything), is perplexity still comparable?
To make training tractable you can either treat sentences as i.i.d. or you can use truncated BPTT. Is the i.i.d. assumption valid? What are the benefits and downsides of making this assumptions vs. truncated BPTT? How do you think perplexities will compare on the held-out test set? (If you’re feeling adventurous, try it out!)
Rather than modeling documents as a sequence of words, you can model the document as a sequence of characters. Are the per-word perplexities comparable between these two models? What benefits does modeling text at the character level have? What disadvantages?
Try a couple variations of the model using different definitions of RNNs (e.g., LSTMs, GRUs, simple Elman RNNs) that were covered in class. How do perplexities compare?
In text classification, using bidirectional RNNs was suggested, could you use bidirectional RNNs for the language modeling task? Why or why not?

Handin

On paper, show a practical demonstrator your response to these to get signed off.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 78

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗