SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-90%)
Mutual labels: spark, big-data
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-40.91%)
Mutual labels: spark, big-data
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+28643.64%)
Mutual labels: spark, big-data
ZeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+4911.82%)
Mutual labels: spark, big-data
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-28.18%)
Mutual labels: spark, big-data
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+5041.82%)
Mutual labels: spark, big-data
Spark Doc ZhApache Spark 官方文档中文版
Stars: ✭ 1,126 (+923.64%)
Mutual labels: spark, big-data
BigdlBuilding Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+3366.36%)
Mutual labels: spark, big-data
Spark WebsiteApache Spark Website
Stars: ✭ 75 (-31.82%)
Mutual labels: spark, big-data
LabsResearch on distributed system
Stars: ✭ 73 (-33.64%)
Mutual labels: spark, big-data
MagellanGeo Spatial Data Analytics on Spark
Stars: ✭ 507 (+360.91%)
Mutual labels: spark, big-data
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-11.82%)
Mutual labels: spark, big-data
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+19943.64%)
Mutual labels: spark, big-data
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+577.27%)
Mutual labels: spark, big-data
Listenbrainz ServerServer for the ListenBrainz project
Stars: ✭ 420 (+281.82%)
Mutual labels: spark, big-data
Docker Spark ClusterA Spark cluster setup running on Docker containers
Stars: ✭ 57 (-48.18%)
Mutual labels: spark, big-data
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+229.09%)
Mutual labels: spark, big-data
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+228.18%)
Mutual labels: spark, big-data
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1116.36%)
Mutual labels: spark, big-data