All Projects → bytefish → FlinkExperiments

bytefish / FlinkExperiments

Licence: MIT license
Experiments with Apache Flink.

Programming Languages

java
68154 projects - #9 most used programming language
Batchfile
5799 projects
shell
77523 projects
powershell
5483 projects

Projects that are alternatives of or similar to FlinkExperiments

flink-connectors
Apache Flink connectors for Pravega.
Stars: ✭ 84 (+2700%)
Mutual labels:  stream-processing, flink
Flink Sql Cookbook
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
Stars: ✭ 189 (+6200%)
Mutual labels:  stream-processing, flink
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+379166.67%)
Mutual labels:  stream-processing, flink
open-stream-processing-benchmark
This repository contains the code base for the Open Stream Processing Benchmark.
Stars: ✭ 37 (+1133.33%)
Mutual labels:  stream-processing, flink
Streaming Readings
Streaming System 相关的论文读物
Stars: ✭ 554 (+18366.67%)
Mutual labels:  stream-processing, flink
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (+1733.33%)
Mutual labels:  stream-processing, flink
Pulsar Flink
Elastic data processing with Apache Pulsar and Apache Flink
Stars: ✭ 126 (+4100%)
Mutual labels:  stream-processing, flink
Media Stream Library Js
JavaScript library to handle media streams on the command line (Node.js) and in the browser.
Stars: ✭ 192 (+6300%)
Mutual labels:  stream-processing
Amadeus
Harmonious distributed data analysis in Rust.
Stars: ✭ 240 (+7900%)
Mutual labels:  stream-processing
Hstream
The streaming database built for IoT data storage and real-time processing in the 5G Era
Stars: ✭ 166 (+5433.33%)
Mutual labels:  stream-processing
Akka Stream Contrib
Add-ons to Akka Stream
Stars: ✭ 173 (+5666.67%)
Mutual labels:  stream-processing
fdp-modelserver
An umbrella project for multiple implementations of model serving
Stars: ✭ 47 (+1466.67%)
Mutual labels:  flink
xlstream
Turns XLSX into a readable stream.
Stars: ✭ 148 (+4833.33%)
Mutual labels:  stream-processing
Fluent Bit
Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX and Windows
Stars: ✭ 3,223 (+107333.33%)
Mutual labels:  stream-processing
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+88833.33%)
Mutual labels:  stream-processing
Logrange
High performance data aggregating storage
Stars: ✭ 181 (+5933.33%)
Mutual labels:  stream-processing
Lidea
大型分布式系统实时监控平台
Stars: ✭ 28 (+833.33%)
Mutual labels:  flink
Fero
light, fast, scalable, streaming microservices made easy
Stars: ✭ 175 (+5733.33%)
Mutual labels:  stream-processing
Core
Build platforms that flexibly mix SQL, batch, and stream processing paradigms
Stars: ✭ 231 (+7600%)
Mutual labels:  stream-processing
frizzle
The magic message bus
Stars: ✭ 14 (+366.67%)
Mutual labels:  stream-processing

FlinkExperiments

Project

This project is a sample project for Apache Flink. The application parses the Quality Controlled Local Climatological Data (QCLCD) of March 2015, calculates the maximum daily temperature of the stream by using Apache Flink and writes the results back into an Elasticsearch and PostgreSQL database.

Dataset

The data is the Quality Controlled Local Climatological Data (QCLCD):

Quality Controlled Local Climatological Data (QCLCD) consist of hourly, daily, and monthly summaries for approximately 1,600 U.S. locations. Daily Summary forms are not available for all stations. Data are available beginning January 1, 2005 and continue to the present. Please note, there may be a 48-hour lag in the availability of the most recent data.

The data is available at:

Result

Kibana Visualization of the Average Temperature

Running the Examples

The records in the Quality Controlled Local Climatological Data (QCLCD) dataset are not sorted by the timestamp. The dataset needs to be prepared first, so that all records are sorted ascending by the time of measurement.

I have written a small application, that sorts the original CSV data by the measurement time:

The result is a sorted CSV file, which can be used to run the examples.

Further Reading

I have written several blog posts on Apache Flink:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].