CalciteApache Calcite
Stars: ✭ 2,816 (+18673.33%)
Mutual labels: big-data, hadoop, geospatial
Calcite AvaticaMirror of Apache Calcite - Avatica
Stars: ✭ 130 (+766.67%)
Mutual labels: big-data, hadoop, geospatial
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (+753.33%)
Mutual labels: big-data, hadoop
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+10846.67%)
Mutual labels: big-data, hadoop
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+900%)
Mutual labels: big-data, hadoop
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+10693.33%)
Mutual labels: big-data, hadoop
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (+680%)
Mutual labels: big-data, hadoop
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (+833.33%)
Mutual labels: big-data, hadoop
MoosefsMooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+6733.33%)
Mutual labels: big-data, hadoop
GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (+1013.33%)
Mutual labels: big-data, geospatial
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+1080%)
Mutual labels: big-data, hadoop
AsakusafwAsakusa Framework
Stars: ✭ 114 (+660%)
Mutual labels: big-data, hadoop
Bigdata Notes大数据入门指南 ⭐
Stars: ✭ 10,991 (+73173.33%)
Mutual labels: big-data, hadoop
RichdemHigh-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (+746.67%)
Mutual labels: big-data, geospatial
Docker Spark ClusterA Spark cluster setup running on Docker containers
Stars: ✭ 57 (+280%)
Mutual labels: big-data, hadoop
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (+6.67%)
Mutual labels: big-data, hadoop
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+37606.67%)
Mutual labels: big-data, hadoop
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-66.67%)
Mutual labels: big-data, hadoop
PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+86280%)
Mutual labels: big-data, hadoop
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+1333.33%)
Mutual labels: big-data, hadoop