All Projects → datafibers-community → df_data_service

datafibers-community / df_data_service

Licence: Apache-2.0 license
DataFibers Data Service

Programming Languages

java
68154 projects - #9 most used programming language
javascript
184084 projects - #8 most used programming language
CSS
56736 projects
HTML
75241 projects

Projects that are alternatives of or similar to df data service

Bigdata Notebook
Stars: ✭ 100 (+222.58%)
Mutual labels:  streaming, hadoop, flink
Streamline
StreamLine - Streaming Analytics
Stars: ✭ 151 (+387.1%)
Mutual labels:  streaming, flink
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+36603.23%)
Mutual labels:  streaming, flink
TIL
Today I Learned
Stars: ✭ 43 (+38.71%)
Mutual labels:  mongo, hadoop
Athenax
SQL-based streaming analytics platform at scale
Stars: ✭ 1,178 (+3700%)
Mutual labels:  streaming, flink
Fiflow
flink-sql 在 flink 上运行 sql 和 构建数据流的平台 基于 apache flink 1.10.0
Stars: ✭ 100 (+222.58%)
Mutual labels:  streaming, flink
vertx-mongo-client
Mongo Client for Eclipse Vert.x
Stars: ✭ 54 (+74.19%)
Mutual labels:  mongo, vertx
Big Whale
Spark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (+425.81%)
Mutual labels:  hadoop, flink
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (+419.35%)
Mutual labels:  mongo, hadoop
dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+296.77%)
Mutual labels:  hadoop, flink
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (+19.35%)
Mutual labels:  hadoop, flink
Streaming Readings
Streaming System 相关的论文读物
Stars: ✭ 554 (+1687.1%)
Mutual labels:  streaming, flink
Kafka Connect Hdfs
Kafka Connect HDFS connector
Stars: ✭ 400 (+1190.32%)
Mutual labels:  streaming, hadoop
Flink Training Course
Flink 中文视频课程(持续更新...)
Stars: ✭ 3,963 (+12683.87%)
Mutual labels:  streaming, flink
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-6.45%)
Mutual labels:  hadoop, flink
Flink Spector
Framework for Apache Flink unit tests
Stars: ✭ 190 (+512.9%)
Mutual labels:  streaming, flink
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+5887.1%)
Mutual labels:  hadoop, flink
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (+306.45%)
Mutual labels:  hadoop, flink
Android Nosql
Lightweight, simple structured NoSQL database for Android
Stars: ✭ 284 (+816.13%)
Mutual labels:  mongo, hadoop
Websockets-Vertx-Flink-Kafka
A simple request response cycle using Websockets, Eclipse Vert-x server, Apache Kafka, Apache Flink.
Stars: ✭ 14 (-54.84%)
Mutual labels:  vertx, flink

DataFibers Data Services

DataFibers (DF) - A pure streaming processing application on Kafka and Flink. The DF processor has two components defined to deal with stream ETL (Extract, Transform, and Load).

  • Connects is to leverage Kafka Connect REST API on Confluent to landing or publishing data in or out of Apache Kafka.
  • Transforms is to leverage streaming processing engine, such as Apache Flink, for data transformation.

Building

You build the project using:

mvn clean package

Testing

The application is tested using vertx-unit.

Packaging

The application is packaged as a fat jar, using the Maven Shade Plugin.

Running

Once packaged, just launch the fat jar as follows ways

  • Default with no parameters to launch standalone mode with web ui.
java -jar df-data-service-<version>-SNAPSHOT-fat.jar
  • For more running features checking help option
java -jar df-data-service-<version>-SNAPSHOT-fat.jar -h

Web UI

http://localhost:8000/ or http://localhost:8000/dfa/

Manual

https://datafibers-community.gitbooks.io/datafibers-complete-guide/content/

Demo

DataFibers Demo

Todo

  • Fetch all installed connectors/plugins in regularly frequency
  • Need to report connector or job status
  • Need an initial method to import all available|paused|running connectors from kafka connect
  • Add Flink Table API engine
  • Add memory LKP
  • Add Connects, Transforms Logging URL
  • Add to generic function to do connector validation before creation
  • Add submit other job actions, such as start, hold, etc
  • Add Spark Structure Streaming
  • Topic visualization
  • Launch 3rd party jar
  • Job level control, schedule, and metrics
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].