GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (-67.06%)
GeotoolsOfficial GeoTools repository
Stars: ✭ 1,109 (+118.74%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+4248.72%)
krawlerA minimalist (geospatial) ETL
Stars: ✭ 51 (-89.94%)
pyGISS📡 A lightweight GIS Software in less than 100 lines of code
Stars: ✭ 114 (-77.51%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-84.42%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+408.09%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-57.59%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-57.4%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (-81.26%)
xyz-hubXYZ Hub is a RESTful web service for the access and management of geospatial data.
Stars: ✭ 43 (-91.52%)
Go GeomPackage geom implements efficient geometry types for geospatial applications.
Stars: ✭ 456 (-10.06%)
LabsResearch on distributed system
Stars: ✭ 73 (-85.6%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-78.3%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+163.91%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-70.02%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+471.79%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+500.39%)
geojsonGeoJSON classes for R
Stars: ✭ 32 (-93.69%)
xyz-spaces-pythonManage your XYZ Hub or HERE Data Hub spaces from Python.
Stars: ✭ 29 (-94.28%)
GeoJSON.jlUtilities for working with GeoJSON data in Julia
Stars: ✭ 46 (-90.93%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-82.05%)
OrbTypes and utilities for working with 2d geometry in Golang
Stars: ✭ 378 (-25.44%)
Election GeodataPrecinct shapes (and vote results) for US elections past, present, and future
Stars: ✭ 289 (-43%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (-28.6%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-87.18%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-80.87%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+223.87%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-69.82%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-70.41%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-72.58%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-51.28%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (-51.48%)
de9imDE-9IM spatial predicate library implemented in Javascript.
Stars: ✭ 22 (-95.66%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+6136.29%)
turf-goA Go language port of Turf.js
Stars: ✭ 41 (-91.91%)
pygeoifBasic implementation of the __geo_interface__
Stars: ✭ 44 (-91.32%)
pyturfA modular geospatial engine written in python
Stars: ✭ 15 (-97.04%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-97.44%)
ibmpairsopen source tools for interaction with IBM PAIRS:
Stars: ✭ 23 (-95.46%)
geojsonLibrary for serializing the GeoJSON vector GIS file format
Stars: ✭ 171 (-66.27%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (-49.31%)
GeoConvertConverting between Geojson and GIS file formats
Stars: ✭ 32 (-93.69%)
DeltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+669.82%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-97.24%)
BlendergisBlender addons to make the bridge between Blender and geographic data
Stars: ✭ 4,642 (+815.58%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+46.94%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-97.83%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-97.04%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-78.11%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-28.8%)