All Projects → lsds → StreamBench

lsds / StreamBench

Licence: Apache-2.0 license
Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark

Programming Languages

C++
36643 projects - #6 most used programming language
scala
5932 projects
CMake
9771 projects
shell
77523 projects

Projects that are alternatives of or similar to StreamBench

chatnoir-resiliparse
A robust web archive analytics toolkit
Stars: ✭ 26 (-50%)
Mutual labels:  bigdata
dt-sql-parser
SQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (+159.62%)
Mutual labels:  bigdata
amas
Amas is recursive acronym for “Amas, monitor alert system”.
Stars: ✭ 77 (+48.08%)
Mutual labels:  bigdata
TiBigData
TiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (+269.23%)
Mutual labels:  bigdata
greycat
GreyCat - Data Analytics, Temporal data, What-if, Live machine learning
Stars: ✭ 104 (+100%)
Mutual labels:  bigdata
dnstrace
Command-line DNS benchmark
Stars: ✭ 68 (+30.77%)
Mutual labels:  benchmark
PersonNotes
个人笔记集中营,快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧
Stars: ✭ 61 (+17.31%)
Mutual labels:  bigdata
zdh web
大数据采集,抽取平台
Stars: ✭ 292 (+461.54%)
Mutual labels:  bigdata
163-bigdate-note
bigdata note
Stars: ✭ 38 (-26.92%)
Mutual labels:  bigdata
QASMBench
QASMBench is an OpenQASM benchmark suite running on IBM Quantum-Experience backends.
Stars: ✭ 22 (-57.69%)
Mutual labels:  benchmark
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-28.85%)
Mutual labels:  bigdata
2019 egu workshop jupyter notebooks
Short course on interactive analysis of Big Earth Data with Jupyter Notebooks
Stars: ✭ 29 (-44.23%)
Mutual labels:  bigdata
compbench
⌛ Benchmark and visualization of various compression algorithms
Stars: ✭ 21 (-59.62%)
Mutual labels:  benchmark
awesome-coder-resources
编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (+3.85%)
Mutual labels:  bigdata
jhdf
A pure Java HDF5 library
Stars: ✭ 83 (+59.62%)
Mutual labels:  bigdata
Clustering4Ever
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+142.31%)
Mutual labels:  bigdata
utils
⚡ A collection of common functions but with better performance, less allocations and less dependencies created for Fiber.
Stars: ✭ 21 (-59.62%)
Mutual labels:  benchmark
the-apache-ignite-book
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (+25%)
Mutual labels:  bigdata
scRNAseq cell cluster labeling
Scripts to run and benchmark scRNA-seq cell cluster labeling methods
Stars: ✭ 41 (-21.15%)
Mutual labels:  benchmark
moros
A modern http(s) benchmark tool
Stars: ✭ 14 (-73.08%)
Mutual labels:  benchmark

StreamBench

StreamBench is a project to measure the performance of popular streaming engines using Yahoo Streaming Benchmark.

Overview

We compare the performance of an efficient stream processing engine designed for single servers, SABER, with that achieved by popular distributed stream processing systems, Apache Spark and Apache Flink. We also compare the results to that by StreamBox, another recently proposed single-server design that emphases out-of-order processing of data. Based on our results, we argue that a single multicore server can provide better throughput than a multi-node cluster for many streaming applications. This opens an opportunity to cut down system complexity and operational costs by replacing cluster-based stream processing systems with (potentially replicated) single server deployments.

This repository contains code for running the Yahoo Streaming Benchmark in SABER, Spark Streaming, Apache Flink and StreamBox. For Spark and Flink, we follow the approach from previous blogposts by Databricks and DataArtisans. We provide a script for each of these engines to setup and run the benchmark on a single node. The code can be configured to run on a distributed deployment as well.

Benchmark Outline

The Yahoo Streaming Benchmark was designed to emulate an advertisement streaming application. It has a streaming query with four operators: filter, project, join (with relational data) and aggregate (a windowed count).

How to run the code

For every engine, the script provided installs, builds and runs the engines as well as the streaming query.

Credits

StreamBench is brought to you by George Theodorakis, Panagiotis Garefalakis, Alexandros Koliousis, Holger Pirk, Peter Pietzuch

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].