AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-41.18%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-28.05%)
Improved Body PartsSimple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation
Stars: ✭ 202 (-8.6%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-28.51%)
DiasporaA privacy-aware, distributed, open source social network.
Stars: ✭ 12,937 (+5753.85%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-31.22%)
PotteryRedis for humans. 🌎🌍🌏
Stars: ✭ 204 (-7.69%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-30.32%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+1013.12%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+804.07%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-9.5%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-30.77%)
DkerasDistributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (-18.1%)
Gym FxForex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Stars: ✭ 151 (-31.67%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-2.26%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (-31.67%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-19%)
CookimDistributed web chat application base websocket built on akka.
Stars: ✭ 198 (-10.41%)
AztkAZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-31.22%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-33.48%)
ScannerlThe modular distributed fingerprinting engine
Stars: ✭ 208 (-5.88%)
Mysterium VpnDEPRECATED version of Mysterium dVPN app. Please look at mysterium-vpn-desktop instead.
Stars: ✭ 149 (-32.58%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (-21.27%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-33.48%)
DsockDistributed WebSocket broker
Stars: ✭ 197 (-10.86%)
FsynthWeb-based and pixels-based collaborative synthesizer
Stars: ✭ 146 (-33.94%)
Spoon🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-21.72%)
MachinReinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
Stars: ✭ 145 (-34.39%)
VernemqA distributed MQTT message broker based on Erlang/OTP. Built for high quality & Industrial use cases.
Stars: ✭ 2,628 (+1089.14%)
Nile.jsServer
Stars: ✭ 1,757 (+695.02%)
Idworkeridworker 是一个基于zookeeper和snowflake算法的分布式ID生成工具,通过zookeeper自动注册机器(最多1024台),无需手动指定workerId和datacenterId
Stars: ✭ 171 (-22.62%)
EnslavismA framework to manage distributed WebRTC servers that communicate with browser clients
Stars: ✭ 143 (-35.29%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-35.75%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+5455.2%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-36.65%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-7.24%)
OnyxDistributed, masterless, high performance, fault tolerant data processing
Stars: ✭ 2,019 (+813.57%)
HerddbA JVM-embeddable Distributed Database
Stars: ✭ 192 (-13.12%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-37.1%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+723.98%)
PysrSimple, fast, and parallelized symbolic regression in Python/Julia via regularized evolution and simulated annealing
Stars: ✭ 213 (-3.62%)
MsgfloDistributed Flow-Based Programming via message queues
Stars: ✭ 136 (-38.46%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+5304.07%)
Zi5bookbook.zi5.me全站kindle电子书籍爬取,按照作者书籍名分类,每本书有mobi和equb两种格式,采用分布式进行全站爬取
Stars: ✭ 191 (-13.57%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-25.79%)
OneflowOneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1197.74%)
OpaqueAn encrypted data analytics platform
Stars: ✭ 129 (-41.63%)
ArewedistributedyetWebsite + Community effort to unlock the peer-to-peer web at arewedistributedyet.com ⚡🌐🔑
Stars: ✭ 189 (-14.48%)
DiztlShare, discover & download files in your network 💥
Stars: ✭ 162 (-26.7%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+951.13%)