All Projects → Ruby Spark → Similar Projects or Alternatives

752 Open source projects that are alternatives of or similar to Ruby Spark

splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (-18.1%)
Mutual labels:  spark
Spark Lucenerdd
Spark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-48.42%)
Mutual labels:  spark
visualize-data-with-python
A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (-72.85%)
Mutual labels:  spark
Crypto Dht
Blockchain over DHT in GO
Stars: ✭ 38 (-82.81%)
Mutual labels:  distributed
http bench
golang HTTP stress test tool, support single and distributed
Stars: ✭ 142 (-35.75%)
Mutual labels:  distributed
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-30.77%)
Mutual labels:  spark
iris
Distributed streaming key-value storage
Stars: ✭ 55 (-75.11%)
Mutual labels:  distributed
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-83.26%)
Mutual labels:  spark
litchi
这是一款分布式的java游戏服务器框架
Stars: ✭ 97 (-56.11%)
Mutual labels:  distributed
Spark Mllib Twitter Sentiment Analysis
🌟 ✨ Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Stars: ✭ 113 (-48.87%)
Mutual labels:  spark
dtm
A distributed transaction framework that supports multiple languages, supports saga, tcc, xa, 2-phase message, outbox patterns.
Stars: ✭ 6,110 (+2664.71%)
Mutual labels:  distributed
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-83.26%)
Mutual labels:  spark
Dkeras
Distributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (-18.1%)
Mutual labels:  distributed
Clearly
Clearly see and debug your celery cluster in real time!
Stars: ✭ 287 (+29.86%)
Mutual labels:  distributed
ddrt
An elixir implementation of Rtree, optimized for fast updates.
Stars: ✭ 38 (-82.81%)
Mutual labels:  distributed
Weidentity
基于区块链的符合W3C DID和Verifiable Credential规范的分布式身份解决方案
Stars: ✭ 972 (+339.82%)
Mutual labels:  distributed
nova
Web framework for Erlang.
Stars: ✭ 175 (-20.81%)
Mutual labels:  distributed
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-49.32%)
Mutual labels:  spark
tensorpeers
p2p peer-to-peer training of tensorflow models
Stars: ✭ 57 (-74.21%)
Mutual labels:  distributed
Xxl Job Dotnet
xxl-job is a lightweight distributed task scheduling framework, and this package provide a dotnet executor client for it
Stars: ✭ 31 (-85.97%)
Mutual labels:  distributed
money
Dapper Style Distributed Tracing Instrumentation Libraries
Stars: ✭ 65 (-70.59%)
Mutual labels:  distributed
Gym Fx
Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Stars: ✭ 151 (-31.67%)
Mutual labels:  distributed
agent
hashtopolis.org
Stars: ✭ 19 (-91.4%)
Mutual labels:  distributed
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+331.67%)
Mutual labels:  spark
PeARS-orchard
This is the decentralised version of PeARS, the people's search engine, to be taken as Phase 1 of the fully distributed system.
Stars: ✭ 34 (-84.62%)
Mutual labels:  distributed
Archivespark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (-49.77%)
Mutual labels:  spark
Gringofts
Gringofts makes it easy to build a replicated, fault-tolerant, high throughput and distributed event-sourced system.
Stars: ✭ 84 (-61.99%)
Mutual labels:  distributed
Pucket
Bucketing and partitioning system for Parquet
Stars: ✭ 29 (-86.88%)
Mutual labels:  spark
pyrsia
Decentralized Package Network
Stars: ✭ 103 (-53.39%)
Mutual labels:  distributed
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-2.26%)
Mutual labels:  spark
oceanbase
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
Stars: ✭ 4,466 (+1920.81%)
Mutual labels:  distributed
Lethean Vpn
Lethean Virtual Private Network (VPN)
Stars: ✭ 29 (-86.88%)
Mutual labels:  distributed
Cherry-Node
Cherry Network's node implemented in Rust
Stars: ✭ 72 (-67.42%)
Mutual labels:  distributed
Memo
The memo elastic and resilient key-value store.
Stars: ✭ 111 (-49.77%)
Mutual labels:  distributed
semagrow
A SPARQL query federator of heterogeneous data sources
Stars: ✭ 27 (-87.78%)
Mutual labels:  distributed
Spark
Apache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+14206.79%)
Mutual labels:  spark
flowgraph
Flowgraph package for scalable asynchronous system development
Stars: ✭ 51 (-76.92%)
Mutual labels:  distributed
Spark Tsne
Distributed t-SNE via Apache Spark
Stars: ✭ 151 (-31.67%)
Mutual labels:  spark
dnr-editor
Distributed Data-Flow Coordination Platform Based on Node-RED
Stars: ✭ 72 (-67.42%)
Mutual labels:  distributed
Interview Questions Collection
按知识领域整理面试题,包括C++、Java、Hadoop、机器学习等
Stars: ✭ 21 (-90.5%)
Mutual labels:  spark
insightedge
InsightEdge Core
Stars: ✭ 22 (-90.05%)
Mutual labels:  distributed
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+739.82%)
Mutual labels:  spark
monitor-merlin
Module for Effortless Redundancy and Loadbalancing In Naemon
Stars: ✭ 21 (-90.5%)
Mutual labels:  distributed
Flint
A Time Series Library for Apache Spark
Stars: ✭ 878 (+297.29%)
Mutual labels:  spark
NScrapy
NScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider
Stars: ✭ 88 (-60.18%)
Mutual labels:  distributed
Sparkstreaming
💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-19%)
Mutual labels:  spark
docs
Documentation repo of nebula orchestration system
Stars: ✭ 16 (-92.76%)
Mutual labels:  distributed
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-93.67%)
Mutual labels:  spark
Raft-Paxos-Sample
MIT6.824实现分布式一致性算法——Raft&Paxos
Stars: ✭ 37 (-83.26%)
Mutual labels:  distributed
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-50.23%)
Mutual labels:  spark
Urhox
Urho3D extension library
Stars: ✭ 13 (-94.12%)
Mutual labels:  spark
Sagemaker Spark
A Spark library for Amazon SageMaker.
Stars: ✭ 219 (-0.9%)
Mutual labels:  spark
Spark Excel
A Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (-2.26%)
Mutual labels:  spark
Hydro Serving
MLOps Platform
Stars: ✭ 213 (-3.62%)
Mutual labels:  spark
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (-8.14%)
Mutual labels:  spark
Gateway
🚀构建分布式即时聊天、消息推送系统。 Building distributed instant messaging, push notification systems.
Stars: ✭ 188 (-14.93%)
Mutual labels:  distributed
9volt
A modern, distributed monitoring system written in Go
Stars: ✭ 160 (-27.6%)
Mutual labels:  distributed
Openuba
A robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-42.53%)
Mutual labels:  spark
Meissa
Cross-platform Distributed Test Runner. Executes tests in parallel, time balanced on multiple machines.
Stars: ✭ 66 (-70.14%)
Mutual labels:  distributed
Phoenix
Peace of mind from prototype to production
Stars: ✭ 17,476 (+7807.69%)
Mutual labels:  distributed
601-660 of 752 similar projects