rippleSimple shared surface streaming application
Stars: ✭ 17 (-86.61%)
HazelcastOpen-source distributed computation and storage platform
Stars: ✭ 4,662 (+3570.87%)
tutorialTutorials to help you build your first Swim app
Stars: ✭ 27 (-78.74%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+4077.17%)
trading sim📈📆 Backtest trading strategies concurrently using historical chart data from various financial exchanges.
Stars: ✭ 21 (-83.46%)
HelicalinsightHelical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
Stars: ✭ 214 (+68.5%)
GoaccessGoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Stars: ✭ 14,096 (+10999.21%)
realtimemap-dotnetA showcase for Proto.Actor - an ultra-fast distributed actors solution for Go, C#, and Java/Kotlin.
Stars: ✭ 47 (-62.99%)
SleuthA Go library for master-less peer-to-peer autodiscovery and RPC between HTTP services
Stars: ✭ 331 (+160.63%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+3507.09%)
Platon GoGolang implementation of the PlatON protocol
Stars: ✭ 331 (+160.63%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (+20.47%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+19.69%)
talariaTalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (+16.54%)
NakedtensorBare bone examples of machine learning in TensorFlow
Stars: ✭ 2,443 (+1823.62%)
IoTPyPython for streams
Stars: ✭ 24 (-81.1%)
transitMassively real-time city transit streaming application
Stars: ✭ 20 (-84.25%)
MementoSimple + Powerful interface to the Mnesia Distributed Database 💾
Stars: ✭ 597 (+370.08%)
LizardfsLizardFS is an Open Source Distributed File System licensed under GPLv3.
Stars: ✭ 793 (+524.41%)
Protoactor DotnetProto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 1,070 (+742.52%)
Distributedsystem Series📚 深入浅出分布式基础架构,Linux 与操作系统篇 | 分布式系统篇 | 分布式计算篇 | 数据库篇 | 网络篇 | 虚拟化与编排篇 | 大数据与云计算篇
Stars: ✭ 1,092 (+759.84%)
ParapetA purely functional library to build distributed and event-driven systems
Stars: ✭ 106 (-16.54%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+18.11%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-11.02%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (+51.97%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-14.17%)
Selinon An advanced distributed task flow management on top of Celery
Stars: ✭ 237 (+86.61%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+85.04%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-15.75%)
SwellrtSwellRT main project. Server, JavaScript and Java clients
Stars: ✭ 205 (+61.42%)
trafficMassively real-time traffic streaming application
Stars: ✭ 25 (-80.31%)
golearn🔥 Golang basics and actual-combat (including: crawler, distributed-systems, data-analysis, redis, etcd, raft, crontab-task)
Stars: ✭ 36 (-71.65%)
protoactor-goProto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 4,138 (+3158.27%)
dislibThe Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (-69.29%)
GleamFast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.
Stars: ✭ 2,949 (+2222.05%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+953.54%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+235.43%)
nebulaA distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+6353.54%)
TitanoboaTitanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.
Stars: ✭ 787 (+519.69%)
SwimDistributed software platform for building stateful, massively real-time streaming applications.
Stars: ✭ 368 (+189.76%)
Awesome ScalabilityThe Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Stars: ✭ 36,688 (+28788.19%)
ConstructJavaScript Digital Organisms simulator
Stars: ✭ 17 (-86.61%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+185.04%)
QixMachine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang
Stars: ✭ 13,740 (+10718.9%)
GosirisAn actor framework for Go
Stars: ✭ 222 (+74.8%)
Scalecube ClusterScaleCube Cluster is a lightweight Java VM implementation of SWIM: Scalable Weakly-consistent Infection-style Process Group Membership Protocol. features cluster membership, failure detection, and gossip protocol library.
Stars: ✭ 119 (-6.3%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-37.8%)
DiplomatA HTTP Ruby API for Consul
Stars: ✭ 358 (+181.89%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (+1115.75%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (-43.31%)
rceDistributed, workflow-driven integration environment
Stars: ✭ 42 (-66.93%)