All Projects → Spark On K8s Operator → Similar Projects or Alternatives

637 Open source projects that are alternatives of or similar to Spark On K8s Operator

rabbitmq-operator

RabbitMQ Kubernetes operator

Stars: ✭ 16 (-99.1%)

Mutual labels: kubernetes-operator

Spark Redis

A connector for Spark that allows reading and writing to/from Redis cluster

Stars: ✭ 773 (-56.57%)

Mutual labels: spark

Iql

An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)

Stars: ✭ 341 (-80.84%)

Mutual labels: spark

Ytk Learn

Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).

Stars: ✭ 337 (-81.07%)

Mutual labels: spark

Angel

A Flexible and Powerful Parameter Server for large-scale machine learning

Stars: ✭ 6,458 (+262.81%)

Mutual labels: spark

Net.jgp.labs.spark

Apache Spark examples exclusively in Java

Stars: ✭ 55 (-96.91%)

Mutual labels: spark

Fast Mrmr

An improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).

Stars: ✭ 67 (-96.24%)

Mutual labels: spark

Datahacksummit 2017

Apache Zeppelin notebooks for Recommendation Engines using Keras and Machine Learning on Apache Spark

Stars: ✭ 30 (-98.31%)

Mutual labels: apache-spark

Metering Operator

The Metering Operator is responsible for collecting metrics and other information about what's happening in a Kubernetes cluster, and providing a way to create reports on the collected data.

Stars: ✭ 320 (-82.02%)

Mutual labels: kubernetes-operator

tpch-spark

TPC-H queries in Apache Spark SQL using native DataFrames API

Stars: ✭ 63 (-96.46%)

Mutual labels: spark

Spark Movie Lens

An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset

Stars: ✭ 745 (-58.15%)

Mutual labels: spark

couchdb-operator

prototype kubernetes operator for couchDB

Stars: ✭ 17 (-99.04%)

Mutual labels: kubernetes-operator

Cdhproject

hadoop各组件使用，持续更新

Stars: ✭ 733 (-58.82%)

Mutual labels: spark

Kube Cleanup Operator

Kubernetes Operator to automatically delete completed Jobs and their Pods

Stars: ✭ 318 (-82.13%)

Mutual labels: kubernetes-operator

Sparkmagic

Jupyter magics and kernels for working with remote Spark clusters

Stars: ✭ 954 (-46.4%)

Mutual labels: spark

Pyspark Boilerplate

A boilerplate for writing PySpark Jobs

Stars: ✭ 318 (-82.13%)

Mutual labels: apache-spark

docker-spark

Apache Spark docker container image (Standalone mode)

Stars: ✭ 34 (-98.09%)

Mutual labels: spark

Frameless

Expressive types for Spark.

Stars: ✭ 717 (-59.72%)

Mutual labels: spark

Python Master Courses

人生苦短我用Python

Stars: ✭ 61 (-96.57%)

Mutual labels: spark

Setl

A simple Spark-powered ETL framework that just works 🍺

Stars: ✭ 79 (-95.56%)

Mutual labels: spark

spark-sql-flow-plugin

Visualize column-level data lineage in Spark SQL

Stars: ✭ 20 (-98.88%)

Mutual labels: spark

Elasticsearch Spark Recommender

Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch

Stars: ✭ 707 (-60.28%)

Mutual labels: spark

konsumerator

Kafka Consumer Operator. Kubernetes operator to manage consumers of unbalanced kafka topics with per-partition vertical autoscaling based on Prometheus metrics

Stars: ✭ 20 (-98.88%)

Mutual labels: kubernetes-operator

Utils4s

scala、spark使用过程中，各种测试用例以及相关资料整理

Stars: ✭ 1,070 (-39.89%)

Mutual labels: spark

shamash

Autoscaling for Google Cloud Dataproc

Stars: ✭ 31 (-98.26%)

Mutual labels: spark

Useractionanalyzeplatform

电商用户行为分析大数据平台

Stars: ✭ 645 (-63.76%)

Mutual labels: spark

Search Ads Web Service

Online search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]

Stars: ✭ 30 (-98.31%)

Mutual labels: spark

Repository

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (-94.83%)

Mutual labels: spark

Freestyle

A cohesive & pragmatic framework of FP centric Scala libraries

Stars: ✭ 627 (-64.78%)

Mutual labels: spark

spark-util

low-level helpers for Apache Spark libraries and tests

Stars: ✭ 16 (-99.1%)

Mutual labels: spark

Awesome Spark

A curated list of awesome Apache Spark packages and resources.

Stars: ✭ 1,061 (-40.39%)

Mutual labels: apache-spark

connected-component

Map Reduce Implementation of Connected Component on Apache Spark

Stars: ✭ 68 (-96.18%)

Mutual labels: apache-spark

H2o 3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Stars: ✭ 5,656 (+217.75%)

Mutual labels: spark

siddhi-operator

Operator allows you to run stream processing logic directly on a Kubernetes cluster

Stars: ✭ 16 (-99.1%)

Mutual labels: kubernetes-operator

Wlm Operator

Singularity implementation of k8s operator for interacting with SLURM.

Stars: ✭ 78 (-95.62%)

Mutual labels: kubernetes-operator

spark-druid-olap

Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.

Stars: ✭ 286 (-83.93%)

Mutual labels: spark

Datafusion

DataFusion has now been donated to the Apache Arrow project

Stars: ✭ 611 (-65.67%)

Mutual labels: spark

swordfish

Open-source distribute workflow schedule tools, also support streaming task.

Stars: ✭ 35 (-98.03%)

Mutual labels: spark

Rabbitmq Operator

A Kubernetes Operator for RabbitMQ

Stars: ✭ 51 (-97.13%)

Mutual labels: kubernetes-operator

Spark-Ar

Resources for Spark AR

Stars: ✭ 43 (-97.58%)

Mutual labels: spark

Mongo Spark

The MongoDB Spark Connector

Stars: ✭ 588 (-66.97%)

Mutual labels: spark

Bigdata Notebook

Stars: ✭ 100 (-94.38%)

Mutual labels: spark

Logisland

Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.

Stars: ✭ 97 (-94.55%)

Mutual labels: spark

Sparklint

A tool for monitoring and tuning Spark jobs for efficiency.

Stars: ✭ 316 (-82.25%)

Mutual labels: spark

Kontextfrei

Writing application logic for Spark jobs that can be unit-tested without a SparkContext

Stars: ✭ 67 (-96.24%)

Mutual labels: spark

Pucket

Bucketing and partitioning system for Parquet