swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-89.61%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (-88.43%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (-23.74%)
Spark NotebookInteractive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+814.24%)
Circuitscircuits is a Lightweight Event driven and Asynchronous Application Framework for the Python Programming Language with a strong Component Architecture.
Stars: ✭ 256 (-24.04%)
Big Data Rosetta CodeCode snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (-24.63%)
CookFair job scheduler on Kubernetes and Mesos for batch workloads and Spark
Stars: ✭ 314 (-6.82%)
Redsync.go*DEPRECATED* Please use https://gopkg.in/redsync.v1 (https://github.com/go-redsync/redsync)
Stars: ✭ 292 (-13.35%)
Joyrpchigh-performance, high-extensibility Java rpc framework.
Stars: ✭ 290 (-13.95%)
Book本项目收藏这些年来看过或者听过的一些不错的书籍,在整理文件时看见这些,发现删掉有点可惜,放着又太浪费空间,本着分享的原则,就把它们共享出来,一方面给需要的读者提供这些书籍,另一方面也是一种像知识库的积累吧
Stars: ✭ 47 (-86.05%)
CoherenceOracle Coherence Community Edition
Stars: ✭ 328 (-2.67%)
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (+865.58%)
ClearlyClearly see and debug your celery cluster in real time!
Stars: ✭ 287 (-14.84%)
hekateJava Library for Distributed Services
Stars: ✭ 17 (-94.96%)
DiFacto2 ffmDistributed Fieldaware Factorization Machines based on Parameter Server
Stars: ✭ 11 (-96.74%)
Hadoop BookExample source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
Stars: ✭ 3,317 (+884.27%)
BehemothBehemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Stars: ✭ 286 (-15.13%)
spark-http-streamspark structured streaming via HTTP communication
Stars: ✭ 17 (-94.96%)
PhoenixPeace of mind from prototype to production
Stars: ✭ 17,476 (+5085.76%)
daf-kyloKylo integration with PDND (previously DAF).
Stars: ✭ 20 (-94.07%)
dllibdllib is a distributed deep learning library running on Apache Spark
Stars: ✭ 32 (-90.5%)
Diablo分布式配置管理平台(Distributed Configuration Management Platform)
Stars: ✭ 336 (-0.3%)
Xxl JobA distributed task scheduling framework.(分布式任务调度平台XXL-JOB)
Stars: ✭ 20,197 (+5893.18%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+884.57%)
RedissonRedisson - Redis Java client with features of In-Memory Data Grid. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Publish / Subscribe, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, MyBatis, RPC, local cache ...
Stars: ✭ 17,972 (+5232.94%)
pulsephData Pulse application log aggregation and monitoring
Stars: ✭ 13 (-96.14%)
DgraphNative GraphQL Database with graph backend
Stars: ✭ 17,127 (+4982.2%)
EvaEngine.jsA micro service development engine for node.js
Stars: ✭ 31 (-90.8%)
Kickstarter-AnticipatorThe main aim of this project is to tell that the certain project will be successful or it will fail by applying machine learning algorithm. In this , LOGISTIC REGRESSION is used to determine the success of the project by splitting the data into training and testing models and predicting a successful one.
Stars: ✭ 13 (-96.14%)
Learningsparkv2This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Stars: ✭ 307 (-8.9%)
Android NosqlLightweight, simple structured NoSQL database for Android
Stars: ✭ 284 (-15.73%)
parallaxA Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.
Stars: ✭ 128 (-62.02%)
Spark Druid OlapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 282 (-16.32%)
spark-data-sourcesDeveloping Spark External Data Sources using the V2 API
Stars: ✭ 36 (-89.32%)
hadoop-docker-liteDocker build project to setup a lightweight hadoop cluster containing hadoop, pig, zookeeper, hbase, phoenix, storm, kafka, kafka manager
Stars: ✭ 24 (-92.88%)
CntkMicrosoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
Stars: ✭ 17,113 (+4978.04%)
CrayonSimple framework agnostic UI router for SPAs
Stars: ✭ 310 (-8.01%)
Fuku MlSimple machine learning library / 簡單易用的機器學習套件
Stars: ✭ 280 (-16.91%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-83.98%)
MServerMini Distributed Game Server
Stars: ✭ 49 (-85.46%)
BroccoliBroccoli - distributed task queues for ESP32 cluster
Stars: ✭ 280 (-16.91%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-95.25%)
confluent-spark-avroSpark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
Stars: ✭ 18 (-94.66%)
DeltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+1058.16%)
DaisyrecA developing recommender system in pytorch. Algorithm: KNN, LFM, SLIM, NeuMF, FM, DeepFM, VAE and so on, which aims to fair comparison for recommender system benchmarks
Stars: ✭ 280 (-16.91%)
Covid19TrackerA Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.
Stars: ✭ 65 (-80.71%)
knitDeprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Stars: ✭ 53 (-84.27%)
DotnetspiderDotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework
Stars: ✭ 3,233 (+859.35%)
blogblog entries
Stars: ✭ 39 (-88.43%)
MeshbirdMeshbird is open-source cloud-native multi-region multi-cloud distributed private networking.
Stars: ✭ 3,401 (+909.2%)
Dota2 PredictorTool that predicts the outcome of a Dota 2 game using Machine Learning
Stars: ✭ 332 (-1.48%)
Gather DeploymentGathers scalable tensorflow and infrastructure deployment
Stars: ✭ 326 (-3.26%)