All Projects → nasa-jpl-memex → GeoParser

nasa-jpl-memex / GeoParser

Licence: Apache-2.0 license
Extract and Visualize location from any file

Programming Languages

javascript
184084 projects - #8 most used programming language
CSS
56736 projects
HTML
75241 projects
java
68154 projects - #9 most used programming language
shell
77523 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to GeoParser

geospatial-learn
A python library for geo-spatial processing and machine learning
Stars: ✭ 20 (-58.33%)
Mutual labels:  geospatial-data, geospatial-processing, geospatial-analysis
Xponents
Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction from unstructured data and GIS outputters.
Stars: ✭ 39 (-18.75%)
Mutual labels:  solr, tika
community-sprints
This repo is used to organize and collaborate in sprints for code projects, hackathons, STAC and metadata specs development
Stars: ✭ 18 (-62.5%)
Mutual labels:  geospatial-data, geospatial-analysis
Open Semantic Etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (+243.75%)
Mutual labels:  solr, extract
zzlib
zlib-compressed file depacking library in Lua
Stars: ✭ 44 (-8.33%)
Mutual labels:  extract
CoronaApp
Real Time Corona App
Stars: ✭ 22 (-54.17%)
Mutual labels:  covid-19
dekontaminasi
DIY static API server for COVID-19 data in Indonesia
Stars: ✭ 66 (+37.5%)
Mutual labels:  covid-19
ZeeArchiver
Zee is an efficient and simple to use Android Archiver and decompressor. It can decompress and compress from-to all the formats supported by the well known 7zip utility. Copyright © 2018 Mahmoud Galal , for support contact me:[email protected]
Stars: ✭ 35 (-27.08%)
Mutual labels:  extract
covaccine-notifier
CoWIN Vaccine availability notifier for India
Stars: ✭ 63 (+31.25%)
Mutual labels:  covid-19
video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Stars: ✭ 1,763 (+3572.92%)
Mutual labels:  extract
ncovis-2020
covid-19 舆论和新闻的可视化平台,获得了中国计算机学会、阿里云和机器之心等举办的疫情可视化比赛铜奖。🔥
Stars: ✭ 37 (-22.92%)
Mutual labels:  covid-19
covidestim
Bayesian nowcasting with adjustment for delayed and incomplete reporting to estimate COVID-19 infections in the United States
Stars: ✭ 20 (-58.33%)
Mutual labels:  covid-19
covid19
愛知県 新型コロナウイルス感染症対策サイト / Aichi COVID-19 Task Force website
Stars: ✭ 68 (+41.67%)
Mutual labels:  covid-19
R3ditor
An open-source project created to reverse-engineering some Resident Evil 3 files
Stars: ✭ 19 (-60.42%)
Mutual labels:  extract
awesome-geospatial-data-download-sites
This is the repo for open source geospatial data download sites.
Stars: ✭ 19 (-60.42%)
Mutual labels:  geospatial-data
NCOV-19
An app that keeps you Updated on the CORONA virus outbreak stats.
Stars: ✭ 16 (-66.67%)
Mutual labels:  covid-19
solr-container
Ansible Container project that manages the lifecycle of Apache Solr on Docker.
Stars: ✭ 17 (-64.58%)
Mutual labels:  solr
COVID-EMDA
A Cross-Domain Data Hub with Electricity Market, Coronavirus Case, Mobility and Satellite Data in U.S.
Stars: ✭ 53 (+10.42%)
Mutual labels:  covid-19
web-covid-api
🦠COVID-19 Coronavirus 🔥Tracker Dashboard and 🚀Super fast API's (< 200ms) 🆕Updates every 3 mins
Stars: ✭ 18 (-62.5%)
Mutual labels:  covid-19
pantaucovid-android
🌍 open source project for learns android app using covid topic.
Stars: ✭ 20 (-58.33%)
Mutual labels:  covid-19

GeoParser

The Geoparser is a software tool that can process information from any type of file, extract geographic coordinates, and visualize locations on a map. Users who are interested in seeing a geographical representation of information or data can choose to search for locations using the Geoparser, through a search index or by uploading files from their computer. The Geoparser will parse the files and visualizes cities or latitude-longitude points on the map. After the information is parsed and points are plotted on the map, users are able to filter their results by density, or by searching a key word and applying a "facet" to the parsed information. On the map, users can click on location points to reveal more information about the location and how it is related to their search.

Installation (Docker)

  1. docker build -t nasajplmemex/geo-parser --no-cache -f Dockerfile .
  2. docker-compose up -d
  3. Visit http://localhost:8000 on your browser

Try it out to help fight COVID!

GeoParser has been updated with a new easy to use Docker install, and also an example to download and run the COVID-19 literature data and view the locations. Use that example to explore and test out GeoParser on a real example and view locations from that dataset.

Installation (manually)

Requirements

  1. Python 2.7
  2. pip
  3. Django
  4. Tika Python

Install Requirements

  1. Install python requirements
pip install -r requirements.txt

How to Run the Application

  1. Run Solr Change directory to where you cloned the project cd Solr/solr-5.3.1/ ./bin/solr start

  2. Clone lucene-geo-gazetteer repo

    git clone https://github.com/chrismattmann/lucene-geo-gazetteer.git
    cd lucene-geo-gazetteer
    mvn install assembly:assembly
    add lucene-geo-gazetteer/src/main/bin to your PATH environment variable
    

    make sure it is working

    lucene-geo-gazetteer --help
    usage: lucene-geo-gazetteer
     -b,--build <gazetteer file>           The Path to the Geonames
                                           allCountries.txt
     -h,--help                             Print this message.
     -i,--index <directoryPath>            The path to the Lucene index
                                           directory to either create or read
     -s,--search <set of location names>   Location names to search the
                                           Gazetteer for
    
  3. You will now need to build a Gazetteer using the Geonames.org dataset. (1.2 GB)

    cd lucene-geo-gazetteer
    curl -O http://download.geonames.org/export/dump/allCountries.zip
    unzip allCountries.zip
    lucene-geo-gazetteer -i geoIndex -b allCountries.txt
    

    make sure it is working

    lucene-geo-gazetteer -s Pasadena Texas
    [
    {"Texas" : [
    "Texas",
    "-91.92139",
    "18.05333"
    ]},
    {"Pasadena" : [
    "Pasadena",
    "-74.06446",
    "4.6964"
    ]}
    ]
    

Now start lucene-geo-gazetteer server

lucene-geo-gazetteer -server
  1. Run tika server as mentioned in https://cwiki.apache.org/confluence/display/TIKA/GeoTopicParser on port 8001. Port can be configured via config.txt

  2. Make sure you can extract locations from Tika Server

curl -T /path/to/polar.geot -H "Content-Disposition: attachment; filename=polar.geot" http://localhost:8001/rmeta

You can obtain [file here] (https://raw.githubusercontent.com/chrismattmann/geotopicparser-utils/master/geotopics/polar.geot)

Output should be this

[
   {
      "Content-Type":"application/geotopic",
      "Geographic_LATITUDE":"39.76",
      "Geographic_LONGITUDE":"-98.5",
      "Geographic_NAME":"United States",
      "Optional_LATITUDE1":"27.33931",
      "Optional_LONGITUDE1":"-108.60288",
      "Optional_NAME1":"China",
      "X-Parsed-By":[
         "org.apache.tika.parser.DefaultParser",
         "org.apache.tika.parser.geo.topic.GeoParser"
      ],
      "X-TIKA:parse_time_millis":"1634",
      "resourceName":"polar.geot"
   }
]
  1. Run Django server python manage.py runserver

  2. Open in browser http://localhost:8000/ Note : Please refer to the wiki page on this github repository which can act as a guide for you on how to use GeoParser.

Technologies we Use

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].