All Projects → behindthescenes-group → oesophagus

behindthescenes-group / oesophagus

Licence: AGPL-3.0 License
Enterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)

Programming Languages

shell
77523 projects
python
139335 projects - #7 most used programming language
HTML
75241 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to oesophagus

Realtime
Listen to your to PostgreSQL database in realtime via websockets. Built with Elixir.
Stars: ✭ 4,278 (+35550%)
Mutual labels:  postgres, change-data-capture
Target Postgres
A Singer.io Target for Postgres
Stars: ✭ 70 (+483.33%)
Mutual labels:  postgres, etl
lighthouse
Easy clojure relational database queries, migrations and connection pooling
Stars: ✭ 19 (+58.33%)
Mutual labels:  postgres, relational-databases
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+558.33%)
Mutual labels:  postgres, etl
teanjs
🔥 TypeORM - Express - Angular 8 - NestJS Server Side Rendering (SSR) 😺
Stars: ✭ 62 (+416.67%)
Mutual labels:  postgres
relay-starter-kit
💥 Monorepo template (seed project) pre-configured with GraphQL API, PostgreSQL, React, Relay, Material UI.
Stars: ✭ 3,513 (+29175%)
Mutual labels:  postgres
next-postgres
A minimal example web application using NextJS 12.0.7, Postgres 11, Google OAuth2 and other useful libraries.
Stars: ✭ 72 (+500%)
Mutual labels:  postgres
kafka-connect-datagen
A Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (+125%)
Mutual labels:  etl
docker-postgres-windows
No description or website provided.
Stars: ✭ 19 (+58.33%)
Mutual labels:  postgres
postgres exporter
Postgres exporter
Stars: ✭ 14 (+16.67%)
Mutual labels:  postgres
mathesar
Web application providing an intuitive user experience to databases.
Stars: ✭ 95 (+691.67%)
Mutual labels:  postgres
benchmark
Parse Server Continuous Benchmark
Stars: ✭ 21 (+75%)
Mutual labels:  postgres
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+3850%)
Mutual labels:  postgres
MySqlCdc
MySQL/MariaDB binlog replication client for .NET
Stars: ✭ 71 (+491.67%)
Mutual labels:  change-data-capture
pg-audit-json
Simple, easily customised trigger-based auditing for PostgreSQL (Postgres). See also pgaudit.
Stars: ✭ 34 (+183.33%)
Mutual labels:  postgres
restria
Entria's REST API boilerplate
Stars: ✭ 25 (+108.33%)
Mutual labels:  postgres
DataXServer
为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Stars: ✭ 130 (+983.33%)
Mutual labels:  etl
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (+33.33%)
Mutual labels:  etl
Serilog.Sinks.Postgresql.Alternative
Serilog.Sinks.Postgresql.Alternative is a library to save logging information from https://github.com/serilog/serilog to https://www.postgresql.org/.
Stars: ✭ 29 (+141.67%)
Mutual labels:  postgres
amcheck
contrib/amcheck from Postgres v11 backported to earlier Postgres versions
Stars: ✭ 74 (+516.67%)
Mutual labels:  postgres

Share your use-case: Google Form

Will help in shaping initial features.


Read more about it on my Blog at Towards Data Science: https://tinyurl.com/yyqr79dh

Oesophagus Ecosystem

Oesophagus enables you to deploy an entirely plug-n-play Data Infrastructure to advance your organisation's data capability.

The architecture consists of:

  • Data Producers, services to fetch data from Relational Databases, 3rd Party APIs, etc.
  • Stream Processors, like Kafka Streams and KSQL.
  • Data Consumers, services to load data into Columnar or Document-Oriented Databases, Search Indices, or other downstream databases and services.

Data Pipeline

Example: Postgres to Elasticsearch Real-Time ETL Setup:

Requirements

Deployment

# Start kafka, connect, schema-registry, ksqldb, ksqlcli, postgres, elasticsearch and automation-scripts
$ docker-compose up -d

Testing Services

# GET Request on Elasticsearch server to test availability
$ curl -f 'localhost:9200'

# Search all indices in Elasticsearch
$ curl -f 'localhost:9200/_search'

Why use Oesophagus's Postgres CDC?

Oesophagus Postges CDC Producer is built to Extract, Transform and Load Relation Databases' data to Downstream databases/services.

It uses Change-Data-Capture Pattern to read changes from the WAL (Write-Ahead-Logs) of the source database.

Change Data Capture (CDC), as its name suggests, is a Database Design Pattern that captures individual data changes instead of dealing with the entire data. Instead of dumping your entire database, using CDC, you would capture just the data changes made to the master database and apply them to the BI databases to keep both of your databases in sync. This is much more scalable because it only deals with data changes. Also, the replication can be done much faster, often in near real-time.

Information Source: FlyData

Functionality

Note: Before starting the service, wal2json plugin should be installed on your postgres container to fetch database logs.

  1. As the service starts, it will first make a Full Table Replication for all the listed table name keys in producer.json.
  2. After the full table migration, the service starts listening to database logs using the replication slot that is created automatically before Full Table Migration starts.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].