SparklensQubole Sparklens tool for performance tuning Apache Spark
Stars: ✭ 345 (+9.87%)
Docker practiceLearn and understand Docker technologies, with real DevOps practice!
Stars: ✭ 19,768 (+6195.54%)
rhythmTime-based job scheduler for Apache Mesos
Stars: ✭ 30 (-90.45%)
urb-k8sKubernetes adapter for Universal Resource Broker
Stars: ✭ 19 (-93.95%)
schedulerMaintenance fork of Apache Aurora's Scheduler
Stars: ✭ 21 (-93.31%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-93.63%)
MinimesosThe experimentation and testing tool for Apache Mesos - NO LONGER MAINTANED!
Stars: ✭ 429 (+36.62%)
DaskosApache Mesos backend for Dask scheduling library
Stars: ✭ 28 (-91.08%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+18.47%)
mentosFresh Python Mesos Scheduler and Executor driver
Stars: ✭ 18 (-94.27%)
ElasticlusterCreate clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-5.1%)
mesos-frameworkA wrapper around the Mesos HTTP APIs for Schedulers and Executors. Write your Mesos framework in pure JavaScript!
Stars: ✭ 61 (-80.57%)
TensorflowonsparkTensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Stars: ✭ 3,748 (+1093.63%)
humpback-centerHumpback Center 主要为 Humpback 平台提供集群容器调度服务,以集群中心角色实现各个 Group 的容器分配管理。
Stars: ✭ 37 (-88.22%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+203.82%)
DcosDC/OS - The Datacenter Operating System
Stars: ✭ 2,316 (+637.58%)
DatafusionDataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (+94.59%)
Etcd Mesosself-healing etcd on mesos!
Stars: ✭ 68 (-78.34%)
josk🏃🤖 Scheduler and manager for jobs and tasks in node.js on multi-server and clusters setup
Stars: ✭ 27 (-91.4%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+152.55%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-88.85%)
K8s TewKubernetes - The Easier Way
Stars: ✭ 269 (-14.33%)
Spark NotebookInteractive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+881.21%)
FabrikateMaking GitOps with Kubernetes easier one component at a time
Stars: ✭ 263 (-16.24%)
DatavecETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (-13.38%)
DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+1205.41%)
HelkThe Hunting ELK
Stars: ✭ 3,097 (+886.31%)
WeaveA state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead
Stars: ✭ 305 (-2.87%)
ReshifterKubernetes cluster state management
Stars: ✭ 292 (-7.01%)
Docker Spark ClusterA simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (-16.88%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (-17.2%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-3.5%)
JaasRun jobs (tasks/one-shot containers) with Docker
Stars: ✭ 291 (-7.32%)
Nixynixy - nginx auto configuration and service discovery for Mesos/Marathon
Stars: ✭ 259 (-17.52%)
Spark Jupyter AwsA guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-17.52%)
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (+936.31%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (-18.15%)
Ej2 Javascript Ui ControlsSyncfusion JavaScript UI controls library offer more than 50+ cross-browser, responsive, and lightweight HTML5 UI controls for building modern web applications.
Stars: ✭ 256 (-18.47%)
Awesome AdaA curated list of awesome resources related to the Ada and SPARK programming language
Stars: ✭ 299 (-4.78%)
Kube No TroubleEasily check your cluster for use of deprecated APIs
Stars: ✭ 280 (-10.83%)
Big Data Rosetta CodeCode snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (-19.11%)
RedissonRedisson - Redis Java client with features of In-Memory Data Grid. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Publish / Subscribe, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, MyBatis, RPC, local cache ...
Stars: ✭ 17,972 (+5623.57%)
awake-actionKeep your free servers, clusters, dynos awaken (ex: heroku, mongodb, etc.)
Stars: ✭ 152 (-51.59%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+956.69%)
CrayonSimple framework agnostic UI router for SPAs
Stars: ✭ 310 (-1.27%)
FtpgrabGrab your files periodically from a remote FTP or SFTP server easily
Stars: ✭ 300 (-4.46%)
Spark Druid OlapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 282 (-10.19%)
BroccoliBroccoli - distributed task queues for ESP32 cluster
Stars: ✭ 280 (-10.83%)
Book本项目收藏这些年来看过或者听过的一些不错的书籍,在整理文件时看见这些,发现删掉有点可惜,放着又太浪费空间,本着分享的原则,就把它们共享出来,一方面给需要的读者提供这些书籍,另一方面也是一种像知识库的积累吧
Stars: ✭ 47 (-85.03%)
CloudflowCloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Stars: ✭ 278 (-11.46%)
sgiSocket Gateway Interface
Stars: ✭ 16 (-94.9%)