All Projects → Ruby Spark → Similar Projects or Alternatives

752 Open source projects that are alternatives of or similar to Ruby Spark

Abris
Avro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-41.18%)
Mutual labels:  spark
Glow
An open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-28.05%)
Mutual labels:  spark
Improved Body Parts
Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation
Stars: ✭ 202 (-8.6%)
Mutual labels:  distributed
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-28.51%)
Mutual labels:  spark
Diaspora
A privacy-aware, distributed, open source social network.
Stars: ✭ 12,937 (+5753.85%)
Mutual labels:  distributed
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-31.22%)
Mutual labels:  spark
Pottery
Redis for humans. 🌎🌍🌏
Stars: ✭ 204 (-7.69%)
Mutual labels:  distributed
Sparkmonitor
Monitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-30.32%)
Mutual labels:  spark
Roaringbitmap
A better compressed bitset in Java
Stars: ✭ 2,460 (+1013.12%)
Mutual labels:  spark
Quill
Compile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+804.07%)
Mutual labels:  spark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-9.5%)
Mutual labels:  spark
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-30.77%)
Mutual labels:  spark
Dkeras
Distributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (-18.1%)
Mutual labels:  distributed
Gym Fx
Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Stars: ✭ 151 (-31.67%)
Mutual labels:  distributed
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-2.26%)
Mutual labels:  spark
Spark Tsne
Distributed t-SNE via Apache Spark
Stars: ✭ 151 (-31.67%)
Mutual labels:  spark
Sparkstreaming
💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-19%)
Mutual labels:  spark
Spark Ml Source Analysis
spark ml 算法原理剖析以及具体的源码实现分析
Stars: ✭ 1,873 (+747.51%)
Mutual labels:  spark
Cookim
Distributed web chat application base websocket built on akka.
Stars: ✭ 198 (-10.41%)
Mutual labels:  distributed
Aztk
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-31.22%)
Mutual labels:  spark
Spark Kafka Writer
Write your Spark data to Kafka seamlessly
Stars: ✭ 175 (-20.81%)
Mutual labels:  spark
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-33.48%)
Mutual labels:  spark
Scannerl
The modular distributed fingerprinting engine
Stars: ✭ 208 (-5.88%)
Mutual labels:  distributed
Mysterium Vpn
DEPRECATED version of Mysterium dVPN app. Please look at mysterium-vpn-desktop instead.
Stars: ✭ 149 (-32.58%)
Mutual labels:  distributed
Spark
Firely's open source FHIR server
Stars: ✭ 174 (-21.27%)
Mutual labels:  spark
Datacompy
Pandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-33.48%)
Mutual labels:  spark
Dsock
Distributed WebSocket broker
Stars: ✭ 197 (-10.86%)
Mutual labels:  distributed
Fsynth
Web-based and pixels-based collaborative synthesizer
Stars: ✭ 146 (-33.94%)
Mutual labels:  distributed
Spoon
🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-21.72%)
Mutual labels:  distributed
Machin
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
Stars: ✭ 145 (-34.39%)
Mutual labels:  distributed
Vernemq
A distributed MQTT message broker based on Erlang/OTP. Built for high quality & Industrial use cases.
Stars: ✭ 2,628 (+1089.14%)
Mutual labels:  distributed
Nile.js
Server
Stars: ✭ 1,757 (+695.02%)
Mutual labels:  distributed
Idworker
idworker 是一个基于zookeeper和snowflake算法的分布式ID生成工具,通过zookeeper自动注册机器(最多1024台),无需手动指定workerId和datacenterId
Stars: ✭ 171 (-22.62%)
Mutual labels:  distributed
Enslavism
A framework to manage distributed WebRTC servers that communicate with browser clients
Stars: ✭ 143 (-35.29%)
Mutual labels:  distributed
Rasterframes
Geospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-35.75%)
Mutual labels:  spark
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+5455.2%)
Mutual labels:  spark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-36.65%)
Mutual labels:  spark
Spark Knn
k-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-7.24%)
Mutual labels:  spark
Hazelcast Go Client
Hazelcast IMDG Go Client
Stars: ✭ 140 (-36.65%)
Mutual labels:  distributed
Onyx
Distributed, masterless, high performance, fault tolerant data processing
Stars: ✭ 2,019 (+813.57%)
Mutual labels:  distributed
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (-37.1%)
Mutual labels:  spark
Herddb
A JVM-embeddable Distributed Database
Stars: ✭ 192 (-13.12%)
Mutual labels:  distributed
Isolation Forest
A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-37.1%)
Mutual labels:  spark
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (-24.89%)
Mutual labels:  spark
Quicksql
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+723.98%)
Mutual labels:  spark
Pysr
Simple, fast, and parallelized symbolic regression in Python/Julia via regularized evolution and simulated annealing
Stars: ✭ 213 (-3.62%)
Mutual labels:  distributed
Msgflo
Distributed Flow-Based Programming via message queues
Stars: ✭ 136 (-38.46%)
Mutual labels:  distributed
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-25.34%)
Mutual labels:  spark
Horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+5304.07%)
Mutual labels:  spark
Zi5book
book.zi5.me全站kindle电子书籍爬取,按照作者书籍名分类,每本书有mobi和equb两种格式,采用分布式进行全站爬取
Stars: ✭ 191 (-13.57%)
Mutual labels:  distributed
Aliyun Emapreduce Datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Stars: ✭ 132 (-40.27%)
Mutual labels:  spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-25.79%)
Mutual labels:  spark
Iot Traffic Monitor
Stars: ✭ 131 (-40.72%)
Mutual labels:  spark
Oneflow
OneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1197.74%)
Mutual labels:  distributed
Opaque
An encrypted data analytics platform
Stars: ✭ 129 (-41.63%)
Mutual labels:  spark
Arewedistributedyet
Website + Community effort to unlock the peer-to-peer web at arewedistributedyet.com ⚡🌐🔑
Stars: ✭ 189 (-14.48%)
Mutual labels:  distributed
Diztl
Share, discover & download files in your network 💥
Stars: ✭ 162 (-26.7%)
Mutual labels:  distributed
Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+951.13%)
Mutual labels:  spark
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (-27.15%)
Mutual labels:  spark
Sagemaker Spark
A Spark library for Amazon SageMaker.
Stars: ✭ 219 (-0.9%)
Mutual labels:  spark
61-120 of 752 similar projects