All Projects → Spark Doc Zh → Similar Projects or Alternatives

1281 Open source projects that are alternatives of or similar to Spark Doc Zh

Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-93.69%)
Mutual labels:  spark, big-data
Geopyspark
GeoTrellis for PySpark
Stars: ✭ 167 (-85.17%)
Mutual labels:  spark, big-data
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-86.41%)
Mutual labels:  spark, big-data
Succinct
Enabling queries on compressed data.
Stars: ✭ 257 (-77.18%)
Mutual labels:  spark, big-data
Delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+246.63%)
Mutual labels:  spark, big-data
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-67.94%)
Mutual labels:  spark, big-data
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+128.77%)
Mutual labels:  spark, big-data
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+45.83%)
Mutual labels:  spark, big-data
Docker Spark Cluster
A Spark cluster setup running on Docker containers
Stars: ✭ 57 (-94.94%)
Mutual labels:  spark, big-data
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-98.76%)
Mutual labels:  big-data, spark
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-80.82%)
Mutual labels:  spark, big-data
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+402.31%)
Mutual labels:  spark, big-data
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-98.85%)
Mutual labels:  big-data, spark
Zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+389.61%)
Mutual labels:  spark, big-data
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-80.91%)
Mutual labels:  spark, big-data
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-91.39%)
Mutual labels:  spark, big-data
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+876.11%)
Mutual labels:  spark, big-data
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-86.68%)
Mutual labels:  spark, big-data
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-86.5%)
Mutual labels:  spark, big-data
Koalas
Koalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+170.34%)
Mutual labels:  spark, big-data
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-90.41%)
Mutual labels:  documentation, spark
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-90.14%)
Mutual labels:  big-data, spark
awesome-AI-kubernetes
❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (-91.56%)
Mutual labels:  big-data, spark
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+1858.08%)
Mutual labels:  spark, big-data
Magellan
Geo Spatial Data Analytics on Spark
Stars: ✭ 507 (-54.97%)
Mutual labels:  spark, big-data
Storm Doc Zh
Apache Storm 官方文档中文版
Stars: ✭ 142 (-87.39%)
Mutual labels:  documentation, big-data
Sparkjni
A heterogeneous Apache Spark framework.
Stars: ✭ 11 (-99.02%)
Mutual labels:  spark, big-data
Hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (-78.15%)
Mutual labels:  spark, big-data
spark-acid
ACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-91.92%)
Mutual labels:  big-data, spark
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-78.06%)
Mutual labels:  spark, big-data
Bigdl
Building Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+238.63%)
Mutual labels:  spark, big-data
Listenbrainz Server
Server for the ListenBrainz project
Stars: ✭ 420 (-62.7%)
Mutual labels:  spark, big-data
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-94.23%)
Mutual labels:  spark, big-data
Labs
Research on distributed system
Stars: ✭ 73 (-93.52%)
Mutual labels:  spark, big-data
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+18.83%)
Mutual labels:  spark, big-data
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-92.98%)
Mutual labels:  spark, big-data
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-90.23%)
Mutual labels:  spark, big-data
Spark Website
Apache Spark Website
Stars: ✭ 75 (-93.34%)
Mutual labels:  spark, big-data
Sparkling Graph
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-87.66%)
Mutual labels:  spark, big-data
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (-87.83%)
Mutual labels:  spark, big-data
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+157.46%)
Mutual labels:  spark, big-data
Sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (-67.85%)
Mutual labels:  spark, big-data
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-33.84%)
Mutual labels:  spark, big-data
Spark
Apache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+2707.99%)
Mutual labels:  spark, big-data
Rust By Example Ext
Rust by Example -- Extended Edition
Stars: ✭ 56 (-95.03%)
Mutual labels:  documentation
Rxswift Chinese Documentation
RxSwift 中文文档
Stars: ✭ 1,107 (-1.69%)
Mutual labels:  documentation
Net.jgp.labs.spark
Apache Spark examples exclusively in Java
Stars: ✭ 55 (-95.12%)
Mutual labels:  spark
Settingsguide
More extensive explanations of Cura slicing settings.
Stars: ✭ 55 (-95.12%)
Mutual labels:  documentation
Django Chinese Docs 18
📖 [译] django 中文文档协作翻译计划
Stars: ✭ 61 (-94.58%)
Mutual labels:  documentation
Nexmo Developer
Provides resources for developers using Nexmo API platforms
Stars: ✭ 59 (-94.76%)
Mutual labels:  documentation
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-95.12%)
Mutual labels:  spark
Swiftmarkup
Parses Swift documentation comments into structured entities
Stars: ✭ 55 (-95.12%)
Mutual labels:  documentation
Verticapy
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-94.76%)
Mutual labels:  big-data
Autoobjectdocumentation
Auto Object Documentation - JavaScript
Stars: ✭ 54 (-95.2%)
Mutual labels:  documentation
Docs
Documentation for OpenPOWER Firmware
Stars: ✭ 54 (-95.2%)
Mutual labels:  documentation
Hugo Book
Hugo documentation theme as simple as plain book
Stars: ✭ 1,115 (-0.98%)
Mutual labels:  documentation
Docs
Documentation for The Things Network
Stars: ✭ 61 (-94.58%)
Mutual labels:  documentation
Rest Hapi
🚀 A RESTful API generator for Node.js
Stars: ✭ 1,102 (-2.13%)
Mutual labels:  documentation
Redux In Spanish
Traducción al español de la documentación de Redux.
Stars: ✭ 54 (-95.2%)
Mutual labels:  documentation
Docker Hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-95.2%)
Mutual labels:  spark
1-60 of 1281 similar projects