Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → postgrespro → Zson

postgrespro / Zson

Licence: mit

ZSON is a PostgreSQL extension for transparent JSONB compression

Programming Languages

50402 projects - #5 most used programming language

Labels

json postgresql compression extensions

Projects that are alternatives of or similar to Zson

Mobility

Pluggable Ruby translation framework

Stars: ✭ 644 (+67.27%)

Mutual labels: json, postgresql

Pg variables

Session wide variables for PostgreSQL

Stars: ✭ 44 (-88.57%)

Mutual labels: json, postgresql

Pmacct

pmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].

Stars: ✭ 677 (+75.84%)

Mutual labels: json, postgresql

Lithium

Easy to use C++17 HTTP Server with no compromise on performances. https://matt-42.github.io/lithium

Stars: ✭ 523 (+35.84%)

Mutual labels: json, postgresql

Octosql

OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

Stars: ✭ 2,579 (+569.87%)

Mutual labels: json, postgresql

Full Stack Fastapi Postgresql

Full stack, modern web application generator. Using FastAPI, PostgreSQL as database, Docker, automatic HTTPS and more.

Stars: ✭ 7,635 (+1883.12%)

Mutual labels: json, postgresql

Treefrog Framework

TreeFrog Framework : High-speed C++ MVC Framework for Web Application

Stars: ✭ 885 (+129.87%)

Mutual labels: json, postgresql

Bareos

Main repository with the code for the libraries and daemons

Stars: ✭ 651 (+69.09%)

Mutual labels: postgresql, compression

Qxorm

QxOrm library - C++ Qt ORM (Object Relational Mapping) and ODM (Object Document Mapper) library - Official repository

Stars: ✭ 176 (-54.29%)

Mutual labels: json, postgresql

Jsonschema Key Compression

Compress json-data based on its json-schema while still having valid json

Stars: ✭ 59 (-84.68%)

Mutual labels: json, compression

Doctrine Json Odm

An object document mapper for Doctrine ORM using JSON types of modern RDBMS.

Stars: ✭ 420 (+9.09%)

Mutual labels: json, postgresql

Zipson

JSON parse and stringify with compression

Stars: ✭ 229 (-40.52%)

Mutual labels: json, compression

Cstore fdw

Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized columnar storage implementation built as a table access method.

Stars: ✭ 1,642 (+326.49%)

Mutual labels: postgresql, compression

Trdsql

CLI tool that can execute SQL queries on CSV, LTSV, JSON and TBLN. Can output to various formats.

Stars: ✭ 593 (+54.03%)

Mutual labels: json, postgresql

Laravel Pg Extensions

Laravel extensions for Postgres

Stars: ✭ 33 (-91.43%)

Mutual labels: postgresql, extensions

Lib

single header libraries for C/C++

Stars: ✭ 866 (+124.94%)

Mutual labels: json, compression

Jsoncrush

Compress JSON into URL friendly strings

Stars: ✭ 1,071 (+178.18%)

Mutual labels: json, compression

Datoji

A tiny JSON storage service. Create, Read, Update, Delete and Search JSON data.

Stars: ✭ 222 (-42.34%)

Mutual labels: json, postgresql

Storagetapper

StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service

Stars: ✭ 232 (-39.74%)

Mutual labels: json, postgresql

Redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Stars: ✭ 20,147 (+5132.99%)

Mutual labels: postgresql

View All Similar Projects ➔

ZSON

About

ZSON is a PostgreSQL extension for transparent JSONB compression. Compression is based on a shared dictionary of strings most frequently used in specific JSONB documents (not only keys, but also values, array elements, etc).

In some cases ZSON can save half of your disk space and give you about 10% more TPS. Memory is saved as well. See docs/benchmark.md. Everything depends on your data and workload, though. Don't believe any benchmarks, re-check everything on your data, configuration, hardware, workload and PostgreSQL version.

ZSON was originally created in 2016 by Postgres Professional team: researched and coded by Aleksander Alekseev; ideas, code review, testing, etc by Alexander Korotkov and Teodor Sigaev.

See also discussions on [email protected], Hacker News, Reddit and HabraHabr.

Install

Build and install ZSON:

cd /path/to/zson/source/code
make
sudo make install

Run tests:

make installcheck

Connect to PostgreSQL:

psql my_database

Enable extension:

create extension zson;

Uninstall

Disable extension:

drop extension zson;

Uninstall ZSON:

cd /path/to/zson/source/code
sudo make uninstall

Usage

First ZSON should be trained on common data using zson_learn procedure:

zson_learn(
    tables_and_columns text[][],
    max_examples int default 10000,
    min_length int default 2,
    max_length int default 128,
    min_count int default 2
)

Example:

select zson_learn('{{"table1", "col1"}, {"table2", "col2"}}');

You can create a temporary table and write some common JSONB documents into it manually or use the existing tables. The idea is to provide a subset of real data. Let's say some document type is twice as frequent as another document type. ZSON expects that there will be twice as many documents of the first type as those of the second one in a learning set.

Resulting dictionary could be examined using this query:

select * from zson_dict;

Now ZSON type could be used as a complete and transparent replacement of JSONB type:

zson_test=# create table zson_example(x zson);
CREATE TABLE

zson_test=# insert into zson_example values ('{"aaa": 123}');
INSERT 0 1

zson_test=# select x -> 'aaa' from zson_example;
-[ RECORD 1 ]-
?column? | 123

Migrating to a new dictionary

When a schema of JSONB documents evolves ZSON could be re-learned:

select zson_learn('{{"table1", "col1"}, {"table2", "col2"}}');

This time second dictionary will be created. Dictionaries are cached in memory so it will take about a minute before ZSON realizes that there is a new dictionary. After that old documents will be decompressed using the old dictionary and new documents will be compressed and decompressed using the new dictionary.

To find out which dictionary is used for a given ZSON document use zson_info procedure:

zson_test=# select zson_info(x) from test_compress where id = 1;
-[ RECORD 1 ]---------------------------------------------------
zson_info | zson version = 0, dict version = 1, ...

zson_test=# select zson_info(x) from test_compress where id = 2;
-[ RECORD 1 ]---------------------------------------------------
zson_info | zson version = 0, dict version = 0, ...

If all ZSON documents are migrated to the new dictionary the old one could be safely removed:

delete from zson_dict where dict_id = 0;

In general, it's safer to keep old dictionaries just in case. Gaining a few KB of disk space is not worth the risk of losing data.

When it's a time to re-learn?

Unfortunately, it's hard to recommend a general approach.

A good heuristic could be:

select pg_table_size('tt') / (select count(*) from tt)

... i.e. average document size. When it suddenly starts to grow it's time to re-learn.

However, developers usually know when they change a schema significantly. It's also easy to re-check whether the current schema differs a lot from the original one using zson_dict table.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 385

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗