WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!

Stars: ✭ 372 (+18.47%)

Mutual labels: scheduler, spark

mentos

Fresh Python Mesos Scheduler and Executor driver

Stars: ✭ 18 (-94.27%)

Mutual labels: scheduler, mesos

Elasticluster

Create clusters of VMs on the cloud and configure them with Ansible.

Stars: ✭ 298 (-5.1%)

Mutual labels: spark, cluster

mesos-framework

A wrapper around the Mesos HTTP APIs for Schedulers and Executors. Write your Mesos framework in pure JavaScript!

Stars: ✭ 61 (-80.57%)

Mutual labels: scheduler, mesos

Tensorflowonspark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

Stars: ✭ 3,748 (+1093.63%)

Mutual labels: spark, cluster

humpback-center

Humpback Center 主要为 Humpback 平台提供集群容器调度服务，以集群中心角色实现各个 Group 的容器分配管理。

Stars: ✭ 37 (-88.22%)

Mutual labels: cluster, scheduler

Sparkmagic

Jupyter magics and kernels for working with remote Spark clusters

Stars: ✭ 954 (+203.82%)

Mutual labels: spark, cluster

Dcos

DC/OS - The Datacenter Operating System

Stars: ✭ 2,316 (+637.58%)

Mutual labels: mesos, cluster

container-orchestration

A Benchmark for Container Orchestration Systems

Stars: ✭ 19 (-93.95%)

Mutual labels: cluster, mesos

Datafusion

DataFusion has now been donated to the Apache Arrow project

Stars: ✭ 611 (+94.59%)

Mutual labels: spark, cluster

Etcd Mesos

self-healing etcd on mesos!

Stars: ✭ 68 (-78.34%)

Mutual labels: mesos, cluster

josk

🏃🤖 Scheduler and manager for jobs and tasks in node.js on multi-server and clusters setup

Stars: ✭ 27 (-91.4%)

Mutual labels: cluster, scheduler

Goodreads etl pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Stars: ✭ 793 (+152.55%)

Mutual labels: scheduler, spark

rundeck-nomad-plugin

Rundeck plugin running jobs on Nomad cluster.

Stars: ✭ 17 (-94.59%)

Mutual labels: cluster, scheduler

swordfish

Open-source distribute workflow schedule tools, also support streaming task.

Stars: ✭ 35 (-88.85%)

Mutual labels: spark, scheduler

K8s Tew

Kubernetes - The Easier Way

Stars: ✭ 269 (-14.33%)

Mutual labels: cluster

Spark Notebook

Interactive and Reactive Data Science using Scala and Spark.

Stars: ✭ 3,081 (+881.21%)

Mutual labels: spark

Fabrikate

Making GitOps with Kubernetes easier one component at a time

Stars: ✭ 263 (-16.24%)

Mutual labels: cluster

Datavec

ETL Library for Machine Learning - data pipelines, data munging and wrangling

Stars: ✭ 272 (-13.38%)

Mutual labels: spark

Dagster

An orchestration platform for the development, production, and observation of data assets.

Stars: ✭ 4,099 (+1205.41%)

Mutual labels: scheduler

Helk

The Hunting ELK

Stars: ✭ 3,097 (+886.31%)

Mutual labels: spark

Weave

A state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead

Stars: ✭ 305 (-2.87%)

Mutual labels: scheduler

Reshifter

Kubernetes cluster state management

Stars: ✭ 292 (-7.01%)

Mutual labels: cluster

Docker Spark Cluster

A simple spark standalone cluster for your testing environment purposses

Stars: ✭ 261 (-16.88%)

Mutual labels: spark

Around Dataengineering

A Data Engineering & Machine Learning Knowledge Hub

Stars: ✭ 257 (-18.15%)

Mutual labels: spark

Sk Dist

Distributed scikit-learn meta-estimators in PySpark

Stars: ✭ 260 (-17.2%)

Mutual labels: spark

Awesome Raspberry Pi

curated list of projects with raspberry pi

Stars: ✭ 309 (-1.59%)

Mutual labels: cluster

Zat

Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark

Stars: ✭ 303 (-3.5%)

Mutual labels: spark

Jaas

Run jobs (tasks/one-shot containers) with Docker

Stars: ✭ 291 (-7.32%)

Mutual labels: cluster

Nixy

nixy - nginx auto configuration and service discovery for Mesos/Marathon

Stars: ✭ 259 (-17.52%)

Mutual labels: mesos

Spark Jupyter Aws

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

Stars: ✭ 259 (-17.52%)

Mutual labels: spark

Crate

CrateDB is a distributed SQL database that makes it simple to store and analyze massive amounts of data in real-time.

Stars: ✭ 3,254 (+936.31%)

Mutual labels: cluster

Succinct

Enabling queries on compressed data.

Stars: ✭ 257 (-18.15%)

Mutual labels: spark

Ej2 Javascript Ui Controls

Syncfusion JavaScript UI controls library offer more than 50+ cross-browser, responsive, and lightweight HTML5 UI controls for building modern web applications.

Stars: ✭ 256 (-18.47%)

Mutual labels: scheduler

Awesome Ada

A curated list of awesome resources related to the Ada and SPARK programming language

Stars: ✭ 299 (-4.78%)

Mutual labels: spark

Kube No Trouble

Easily check your cluster for use of deprecated APIs

Stars: ✭ 280 (-10.83%)

Mutual labels: cluster

Big Data Rosetta Code

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

Stars: ✭ 254 (-19.11%)

Mutual labels: spark

spark-structured-streaming-examples

Spark structured streaming examples with using of version 3.0.0

Stars: ✭ 23 (-92.68%)

Mutual labels: spark

Redisson

Redisson - Redis Java client with features of In-Memory Data Grid. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Publish / Subscribe, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, MyBatis, RPC, local cache ...

Stars: ✭ 17,972 (+5623.57%)

Mutual labels: scheduler

laravel-spark-camera

Profile Photo Camera support for Laravel Spark

Stars: ✭ 30 (-90.45%)

Mutual labels: spark

awake-action

Keep your free servers, clusters, dynos awaken (ex: heroku, mongodb, etc.)

Stars: ✭ 152 (-51.59%)

Mutual labels: cluster

Coolplayspark

酷玩 Spark: Spark 源代码解析、Spark 类库等

Stars: ✭ 3,318 (+956.69%)

Mutual labels: spark

Crayon

Simple framework agnostic UI router for SPAs

Stars: ✭ 310 (-1.27%)

Mutual labels: spark

Ftpgrab

Grab your files periodically from a remote FTP or SFTP server easily

Stars: ✭ 300 (-4.46%)

Mutual labels: scheduler

Spark Druid Olap

Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.

Stars: ✭ 282 (-10.19%)

Mutual labels: spark

sparkProjectTemplate.g8

Template for Spark Projects

Stars: ✭ 77 (-75.48%)

Mutual labels: spark

Broccoli

Broccoli - distributed task queues for ESP32 cluster

Stars: ✭ 280 (-10.83%)

Mutual labels: cluster

Book

本项目收藏这些年来看过或者听过的一些不错的书籍，在整理文件时看见这些，发现删掉有点可惜，放着又太浪费空间，本着分享的原则，就把它们共享出来，一方面给需要的读者提供这些书籍，另一方面也是一种像知识库的积累吧