All Projects → lil-lab → Touchdown

lil-lab / Touchdown

Licence: cc-by-4.0
Cornell Touchdown natural language navigation and spatial reasoning dataset.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Touchdown

Bidaf Keras
Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2
Stars: ✭ 60 (-13.04%)
Mutual labels:  natural-language-processing
Emailparser
remove signature blocks from emails
Stars: ✭ 63 (-8.7%)
Mutual labels:  natural-language-processing
Convai Bot 1337
NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager
Stars: ✭ 65 (-5.8%)
Mutual labels:  natural-language-processing
Language Models
Build unigram and bigram language models, implement Laplace smoothing and use the models to compute the perplexity of test corpora.
Stars: ✭ 59 (-14.49%)
Mutual labels:  natural-language-processing
Emnlp2018 nli
Repository for NLI models (EMNLP 2018)
Stars: ✭ 62 (-10.14%)
Mutual labels:  natural-language-processing
Gpt2
PyTorch Implementation of OpenAI GPT-2
Stars: ✭ 64 (-7.25%)
Mutual labels:  natural-language-processing
Nlg Rl
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Stars: ✭ 59 (-14.49%)
Mutual labels:  natural-language-processing
Intent classifier
Stars: ✭ 67 (-2.9%)
Mutual labels:  natural-language-processing
Repo 2017
Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano
Stars: ✭ 1,123 (+1527.54%)
Mutual labels:  natural-language-processing
Chicksexer
A Python package for gender classification.
Stars: ✭ 64 (-7.25%)
Mutual labels:  natural-language-processing
Fromscratch
Stars: ✭ 61 (-11.59%)
Mutual labels:  natural-language-processing
Slate
A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python
Stars: ✭ 61 (-11.59%)
Mutual labels:  natural-language-processing
Multilingual Latent Dirichlet Allocation Lda
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Stars: ✭ 64 (-7.25%)
Mutual labels:  natural-language-processing
Textblob Ar
Arabic support for textblob
Stars: ✭ 60 (-13.04%)
Mutual labels:  natural-language-processing
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+1540.58%)
Mutual labels:  natural-language-processing
Botsharp
The Open Source AI Chatbot Platform Builder in 100% C# Running in .NET Core with Machine Learning algorithm.
Stars: ✭ 1,103 (+1498.55%)
Mutual labels:  natural-language-processing
Languagetoys
Random fun with statistical language models.
Stars: ✭ 63 (-8.7%)
Mutual labels:  natural-language-processing
Hackerrank
This is the Repository where you can find all the solution of the Problems which you solve on competitive platforms mainly HackerRank and HackerEarth
Stars: ✭ 68 (-1.45%)
Mutual labels:  natural-language-processing
Capsnet Nlp
CapsNet for NLP
Stars: ✭ 66 (-4.35%)
Mutual labels:  natural-language-processing
Kor2vec
Library for Korean morpheme and word vector representation
Stars: ✭ 64 (-7.25%)
Mutual labels:  natural-language-processing

The Touchdown Dataset

Touchdown is a corpus for executing navigation instructions and resolving spatial descriptions in visual real-world environments. The task is to follow instruction to a goal position and there find a hidden object, Touchdown the bear.

The details of the corpus and task are described in: Touchdown: Natural Language Navigation and Spatial Reasoning in Visual Street Environments. Howard Chen, Alane Suhr, Dipendra Misra, Noah Snavely, and Yoav Artzi.

Paper: https://arxiv.org/abs/1811.12354

A short video explaining the task and showing an example: https://youtu.be/uCcDbTZs3v4

The code for the original navigation expriments is available here: https://github.com/lil-lab/ciff

Data

This repository contains the Touchdown corpus. The navigation environment is made of a large number of panoramas. To download the panoramas, please use the StreetLearn environment. You can request access to the panoramic images by filling out the form in StreetLearn Dataset. More details are here.

Starting example

The example runs a random policy with dummy image features in the environment.

python3 navigator.py

Structure of directory

  • data/: includes JSON files train.json, dev.json, test.json. These are the data files for navigation and spatial description resolution (SDR) tasks.

  • graph/: includes .txt files for constructing the graph. nodes.txt contains all nodes in the graph. links.txt contains all edges in the graph.

Graph

The script graph_loader.py loads the graph with the following two files, and base_navigator.py uses it to initialize the graph.

  • nodes.txt: has 4 columns panoid, pano_yaw_angle, latitude, and longitude
  • links.txt: has 3 columns start_panoid, heading, and end_panoid

JSON files

The JSON files contain both data for the navigation task and the SDR task. All three files follow the same structure described as follows.

Route information

  • city: city name
  • route_id: unique route id
  • elapsed: time spent on writing instructions for this route
  • failure_stats: number of attempts the instrcution writer took to place Touchdown the bear at the final position/panorama
  • num_finished: number of runs for followers to find the bear
  • full_text: full instructions from navigation to Touchdown bear placement

Navigation task

  • navigation_text: instruction text for navigation
  • route_panoids: a list of panorama ids of the route from start to end
  • start_heading: start heading angle in degrees
  • end_heading: end heading angle in degrees

Spatial Description Resolution (SDR) task

  • td_location_text: instruction text for SDR
  • pre_pano, main_pano, post_pano: panorama ids, main_pano is the target position pano id where Touchdown is placed. per_pano and post_pano are the before and after target position panorama
  • pre_static_center, main_static_center, post_static_center: the click position {x: width_ratio, y: height_ratio} of where Touchdown is placed, {x: -1, y: -1} means Touchdown can't be found for the panorama

You can construct your Gaussian smoothed target from the *_center click positions or contact us for cached targets.

Experiments reproduction code

The Touchdown tasks are reproduced by Harsh et al (2020). For more details, please refer to this technical report and the VALAN codebase.

License

The Touchdown Dataset (c) 2018

The Touchdown Dataset is licensed under a Creative Commons Attribution 4.0 International License.

You should have received a copy of the license along with this work. If not, see http://creativecommons.org/licenses/by/4.0/.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].