All Projects β†’ apache β†’ incubator-inlong

apache / incubator-inlong

Licence: Apache-2.0 license
Apache InLong - a one-stop integration framework for massive data

Programming Languages

java
68154 projects - #9 most used programming language
javascript
184084 projects - #8 most used programming language
C++
36643 projects - #6 most used programming language
typescript
32286 projects
go
31211 projects - #10 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to incubator-inlong

tgip
TGIP (TGI Pulsar) is a weekly live video streaming about Apache Pulsar and its ecosystem.
Stars: ✭ 17 (-98.44%)
Mutual labels:  event-streaming
walrus
πŸ•‘ Real-time event streaming platform built on top of gRPC streams
Stars: ✭ 15 (-98.62%)
Mutual labels:  event-streaming
Strimzi Kafka Operator
Apache Kafka running on Kubernetes
Stars: ✭ 2,833 (+160.39%)
Mutual labels:  data-streaming
pravega-samples
Sample Applications for Pravega.
Stars: ✭ 43 (-96.05%)
Mutual labels:  data-streaming
wax-ml
A Python library for machine-learning and feedback loops on streaming data
Stars: ✭ 36 (-96.69%)
Mutual labels:  data-streaming
Kafdrop
Kafka Web UI
Stars: ✭ 3,158 (+190.26%)
Mutual labels:  event-streaming
Pulsar
Apache Pulsar - distributed pub-sub messaging system
Stars: ✭ 10,118 (+829.96%)
Mutual labels:  event-streaming
Debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Stars: ✭ 5,937 (+445.68%)
Mutual labels:  event-streaming
pulsar-client-node
Apache Pulsar NodeJS Client
Stars: ✭ 88 (-91.91%)
Mutual labels:  event-streaming
pulsar-helm-chart
Official Apache Pulsar Helm Chart
Stars: ✭ 122 (-88.79%)
Mutual labels:  event-streaming
pulsar-adapters
Apache Pulsar Adapters
Stars: ✭ 18 (-98.35%)
Mutual labels:  event-streaming
scylla-cdc-source-connector
A Kafka source connector capturing Scylla CDC changes
Stars: ✭ 19 (-98.25%)
Mutual labels:  event-streaming
redis-connect-dist
Real-Time Event Streaming & Change Data Capture
Stars: ✭ 21 (-98.07%)
Mutual labels:  event-streaming
incubator-eventmesh
EventMesh is a dynamic event-driven application runtime used to decouple the application and backend middleware layer, which supports a wide range of use cases that encompass complex multi-cloud, widely distributed topologies using diverse technology stacks.
Stars: ✭ 939 (-13.69%)
Mutual labels:  event-streaming
commander
Build event-driven and event streaming applications with ease
Stars: ✭ 60 (-94.49%)
Mutual labels:  event-streaming
pulsar-io-kafka
Pulsar IO Kafka Connector
Stars: ✭ 24 (-97.79%)
Mutual labels:  event-streaming

A one-stop integration framework for massive data

Build Status CodeCov Maven Central GitHub release License Twitter Slack

What is Apache InLong?

Stargazers Over Time Contributors Over Time
Stargazers over time Contributor Over Time

Apache InLong is a one-stop integration framework for massive data that provides automatic, secure and reliable data transmission capabilities. InLong supports both batch and stream data processing at the same time, which offers great power to build data analysis, modeling and other real-time applications based on streaming data.

InLong (εΊ”ιΎ™) is a divine beast in Chinese mythology who guides the river into the sea, and it is regarded as a metaphor of the InLong system for reporting data streams.

InLong was originally built at Tencent, which has served online businesses for more than 8 years, to support massive data (data scale of more than 80 trillion pieces of data per day) reporting services in big data scenarios. The entire platform has integrated 5 modules: Ingestion, Convergence, Caching, Sorting, and Management, so that the business only needs to provide data sources, data service quality, data landing clusters and data landing formats, that is, the data can be continuously pushed from the source to the target cluster, which greatly meets the data reporting service requirements in the business big data scenario.

For getting more information, please visit our project documentation at https://inlong.apache.org/. inlong-structure-en.png

Features

Apache InLong offers a variety of features:

  • Ease of Use: a SaaS-based service platform. Users can easily and quickly report, transfer, and distribute data by publishing and subscribing to data based on topics.
  • Stability & Reliability: derived from the actual online production environment. It delivers high-performance processing capabilities for 10 trillion-level data streams and highly reliable services for 100 billion-level data streams.
  • Comprehensive Features: supports various types of data access methods and can be integrated with different types of Message Queue (MQ). It also provides real-time data extract, transform, and load (ETL) and sorting capabilities based on rules. InLong also allows users to plug features to extend system capabilities.
  • Service Integration: provides unified system monitoring and alert services. It provides fine-grained metrics to facilitate data visualization. Users can view the running status of queues and topic-based data statistics in a unified data metric platform. Users can also configure the alert service based on their business requirements so that users can be alerted when errors occur.
  • Scalability: adopts a pluggable architecture that allows you to plug modules into the system based on specific protocols. Users can replace components and add features based on their business requirements.

When should I use InLong?

InLong is based on MQ and aims to provide a one-stop, practice-tested module pluggable integration framework for massive data, based on this system, users can easily build stream-based data applications. It is suitable for environments that need to quickly build a data reporting platform, as well as an ultra-large-scale data reporting environment that InLong is very suitable for, and an environment that needs to automatically sort and land the reported data.

You can use InLong in the following ways:

Supported Data Nodes (Updating)

Type Name Version Architecture
Extract Node Auto Push None Standard
File None Standard
Kafka 2.x Lightweight, Standard
MongoDB >= 3.6 Lightweight, Standard
MQTT >= 3.1 Standard
MySQL 5.6, 5.7, 8.0.x Lightweight, Standard
Oracle 11,12,19 Lightweight
PostgreSQL 9.6, 10, 11, 12 Lightweight, Standard
Pulsar 2.8.x Lightweight
Redis 2.6.x Standard
SQLServer 2012, 2014, 2016, 2017, 2019 Lightweight, Standard
Load Node Auto Consumption None Standard
ClickHouse 20.7+ Lightweight, Standard
Elasticsearch 6.x, 7.x Lightweight, Standard
Greenplum 4.x, 5.x, 6.x Lightweight, Standard
HBase 2.2.x Lightweight, Standard
HDFS 2.x, 3.x Lightweight, Standard
Hive 1.x, 2.x, 3.x Lightweight, Standard
Iceberg 0.12.x Lightweight, Standard
Hudi 0.12.x Lightweight, Standard
Kafka 2.x Lightweight, Standard
MySQL 5.6, 5.7, 8.0.x Lightweight, Standard
Oracle 11, 12, 19 Lightweight, Standard
PostgreSQL 9.6, 10, 11, 12 Lightweight, Standard
SQLServer 2012, 2014, 2016, 2017, 2019 Lightweight, Standard
TDSQL-PostgreSQL 10.17 Lightweight, Standard
Doris >= 0.13 Lightweight, Standard
StarRocks >= 2.0 Lightweight, Standard

Build InLong

More detailed instructions can be found at Quick Start section in the documentation.

Requirements:

Compile and install:

mvn clean install -DskipTests

(Optional) Compile using docker image:

docker pull maven:3.6-openjdk-8
docker run -v `pwd`:/inlong  -w /inlong maven:3.6-openjdk-8 mvn clean install -DskipTests

after compile successfully, you could find distribution file at inlong-distribution/target.

Deploy InLong

Develop InLong

Contribute to InLong

Contact Us

Documentation

License

Β© Contributors Licensed under an Apache-2.0 license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].