Official Python/PyTorch Implementation for "All-In-One Drive: A Large-Scale Comprehensive Perception Dataset with High-Density Long-Range Point Clouds"

Stars: ✭ 32 (+113.33%)

Mutual labels: evaluation

NominatimGeocoderBackend

UnifiedNlp geocoder backend that uses the OSM Nominatim service

Stars: ✭ 49 (+226.67%)

Mutual labels: geocoding

View All Similar Projects ➔

What's Missing In Geoparsing?

NEWS UPDATE 31.9.2019 - We have a LONG FOLLOW-UP PAPER OUT NOW that greatly expands on this topic. The title is "A Pragmatic Guide to Geoparsing Evaluation." It's now been published at Springer LREV Journal. For the project/paper repository, follow this link.

"Science is a wonderful thing if one does not have to earn one's living at it." -- Albert Einstein

Summary

Thanks for stopping by! In this repository, you will find the accompanying code and data for the publication "What's missing in geographical parsing?" in the journal Language Resources and Evaluation. In the unlikely case of any files missing, please track me down and I'll upload 👍

What's included

data - This is the output of all systems on both datasets (2 * 5 files) plus the gold standard (2 files)
The dataset WikToR(SciPaper).xml is the original data as described and used in the paper.
The LGL dataset, which is also used for evaluation is included as lgl.xml
Essential experiment files (plus supporting scripts)

How to replicate

You should have some basic Python libraries like Numpy, NLTK, Matplotlib (if you want graphics), ... to start with.

methods.py is the main python script for running the experiments (requires the yahoo.py script)
Please install GeoPy to calculate the distances between coordinates.
Also install Wikipedia for Python, nice API wrapper 👍
Scroll down to the end of the file to see example usage, I included all necessary instructions and comments.
Enjoy!

How to (re)create and modify WikToR

The dataset (WikToR) can be created (and unite tested) from scratch, extended, reduced, with more or fewer sentences added, etc. If you wish to do that, great! Here's what you need:

The wiktor.py file is the python script used to (re)generate and unit test WikToR.
Download the allCountries.txt data dump from GeoNames and save in the same directory as the script.
Please sign up for a GeoNames account and a USERNAME, which you will need to fill in on line 42 to ensure the API query works.
The first half of wiktor.py is for CORPUS CREATION, the second half is for CORPUS TESTING.
Enjoy!

"The science of today is the technology of tomorrow." -- Edward Teller

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

milangritta / WhatsMissingInGeoparsing

Programming Languages

Labels

Projects that are alternatives of or similar to WhatsMissingInGeoparsing

What's Missing In Geoparsing?

Summary

What's included

How to replicate

How to (re)create and modify WikToR