All Projects → adrianulbona → osm-parquetizer

adrianulbona / osm-parquetizer

Licence: Apache-2.0 license
A converter for the OSM PBFs to Parquet files

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to osm-parquetizer

Osm2xmap
Converter from OpenStreetMap data format to OpenOrienteering Mapper format.
Stars: ✭ 5 (-92.96%)
Mutual labels:  converter, openstreetmap
gosmparse
Processing OpenStreetMap PBF files at speed with Go
Stars: ✭ 55 (-22.54%)
Mutual labels:  openstreetmap, pbf
osm4scala
Scala and Spark library focused on reading OpenStreetMap Pbf files.
Stars: ✭ 62 (-12.68%)
Mutual labels:  openstreetmap, pbf
pydriosm
PyDriosm: an open-source tool for downloading, reading and PostgreSQL-based I/O of OpenStreetMap data
Stars: ✭ 42 (-40.85%)
Mutual labels:  openstreetmap, pbf
chef
Chef configuration management repo for configuring & maintaining the OpenStreetMap servers.
Stars: ✭ 94 (+32.39%)
Mutual labels:  openstreetmap
zxtap-to-wav
Converter of .TAP files (a ZX-Spectrum emulator data format) into .WAV sound files
Stars: ✭ 34 (-52.11%)
Mutual labels:  converter
mapsplit
A fast way to split OSM data in to a portable tiled format
Stars: ✭ 55 (-22.54%)
Mutual labels:  openstreetmap
BlocksConverter
A PocketMine-MP plugin allows you to convert Minecraft PC maps to MCPE/Bedrock maps or vice-versa.
Stars: ✭ 47 (-33.8%)
Mutual labels:  converter
sparklygraphs
Old repo for R interface for GraphFrames
Stars: ✭ 13 (-81.69%)
Mutual labels:  apache-spark
py midicsv
A Python port and library-fication of the midicsv tool by John Walker. If you need to convert MIDI files to human-readable text files and back, this is the library for you.
Stars: ✭ 55 (-22.54%)
Mutual labels:  converter
AndroidApp
CityZen Android App, OpenStreetMap base-map
Stars: ✭ 70 (-1.41%)
Mutual labels:  openstreetmap
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (-22.54%)
Mutual labels:  apache-spark
gtfs-osm-sync
Synchronizes public transportation data in GTFS format with OpenStreetMap.org
Stars: ✭ 85 (+19.72%)
Mutual labels:  openstreetmap
streamsx.kafka
Repository for integration with Apache Kafka
Stars: ✭ 13 (-81.69%)
Mutual labels:  apache-spark
proxima-platform
The Proxima platform.
Stars: ✭ 17 (-76.06%)
Mutual labels:  apache-spark
brkraw
BrkRaw: A comprehensive tool to access raw Bruker Biospin MRI data
Stars: ✭ 31 (-56.34%)
Mutual labels:  converter
ciclomapa
Beautiful, interactive & open bike maps of Brazilian cities. Powered by OpenStreetMap.
Stars: ✭ 56 (-21.13%)
Mutual labels:  openstreetmap
a2mp3
convert (nearly) every type of (audio)file to mp3 in a quick, easy, batch-enabled way!
Stars: ✭ 43 (-39.44%)
Mutual labels:  converter
csv-to-sqlite
A command-line tool that copies data from CSV files into a SQLite database.
Stars: ✭ 62 (-12.68%)
Mutual labels:  converter
3D-Public-Transport-Simulator
The 3D Public Transport Simulator is a Unity-based simulation, which uses OpenStreetMap data in order to support the simulation of worldwide locations. The development was part of a Bachelor thesis.
Stars: ✭ 87 (+22.54%)
Mutual labels:  openstreetmap

OpenStreetMap Parquetizer

Build Status

The project intends to provide a way to get the OpenStreetMap data available in a Big Data friendly format as Parquet.

Currently any PBF file is converted into three parquet files, one for each type of entity from the original PBF (Nodes, Ways and Relations).

In order to get started:

git clone https://github.com/adrianulbona/osm-parquetizer.git
cd osm-parquetizer
mvn clean package
java -jar target/osm-parquetizer-1.0.1-SNAPSHOT.jar path_to_your.pbf

For example, by running:

java -jar target/osm-parquetizer-1.0.1-SNAPSHOT.jar romania-latest.osm.pbf

In a few seconds (on a decent laptop) you should get the following files:

-rw-r--r--  1 adrianbona  adrianbona   145M Apr  3 19:57 romania-latest.osm.pbf
-rw-r--r--  1 adrianbona  adrianbona   372M Apr  3 19:58 romania-latest.osm.pbf.node.parquet
-rw-r--r--  1 adrianbona  adrianbona   1.1M Apr  3 19:58 romania-latest.osm.pbf.relation.parquet
-rw-r--r--  1 adrianbona  adrianbona   123M Apr  3 19:58 romania-latest.osm.pbf.way.parquet

The parquet files have the following schemas:

node
 |-- id: long
 |-- version: integer
 |-- timestamp: long
 |-- changeset: long
 |-- uid: integer
 |-- user_sid: string
 |-- tags: array
 |    |-- element: struct
 |    |    |-- key: string
 |    |    |-- value: string
 |-- latitude: double
 |-- longitude: double

way
 |-- id: long
 |-- version: integer
 |-- timestamp: long
 |-- changeset: long
 |-- uid: integer
 |-- user_sid: string
 |-- tags: array
 |    |-- element: struct
 |    |    |-- key: string
 |    |    |-- value: string
 |-- nodes: array
 |    |-- element: struct
 |    |    |-- index: integer
 |    |    |-- nodeId: long

relation
 |-- id: long
 |-- version: integer
 |-- timestamp: long
 |-- changeset: long
 |-- uid: integer
 |-- user_sid: string
 |-- tags: array
 |    |-- element: struct
 |    |    |-- key: string
 |    |    |-- value: string
 |-- members: array
 |    |-- element: struct
 |    |    |-- id: long
 |    |    |-- role: string
 |    |    |-- type: string
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].