All Projects → MaterializeInc → Materialize

MaterializeInc / Materialize

Licence: other
Materialize lets you ask questions of your live data, which it answers and then maintains for you as your data continue to change. The moment you need a refreshed answer, you can get it in milliseconds. Materialize is designed to help you interactively explore your streaming data, perform data warehousing analytics against live relational data, or just increase the freshness and reduce the load of your dashboard and monitoring tasks.

Programming Languages

rust
11053 projects
python
139335 projects - #7 most used programming language
shell
77523 projects
Dockerfile
14818 projects
javascript
184084 projects - #8 most used programming language
HTML
75241 projects

Projects that are alternatives of or similar to Materialize

Corfudb
A cluster consistency platform
Stars: ✭ 539 (-83.87%)
Mutual labels:  database, streaming, distributed-systems
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+37.11%)
Mutual labels:  sql, database, distributed-systems
Rqlite
The lightweight, distributed relational database built on SQLite
Stars: ✭ 9,147 (+173.78%)
Mutual labels:  sql, database, distributed-systems
Jocko
Kafka implemented in Golang with built-in coordination (No ZK dep, single binary install, Cloud Native)
Stars: ✭ 4,445 (+33.04%)
Mutual labels:  kafka, streaming, distributed-systems
Eventql
Distributed "massively parallel" SQL query engine
Stars: ✭ 1,121 (-66.45%)
Mutual labels:  sql, database, streaming
Omniscidb
OmniSciDB (formerly MapD Core)
Stars: ✭ 2,601 (-22.15%)
Mutual labels:  sql, database
Herddb
A JVM-embeddable Distributed Database
Stars: ✭ 192 (-94.25%)
Mutual labels:  sql, database
Trilogy
TypeScript SQLite layer with support for both native C++ & pure JavaScript drivers.
Stars: ✭ 195 (-94.16%)
Mutual labels:  sql, database
Voik
♒︎ [WIP] An experimental ~distributed~ commit-log
Stars: ✭ 200 (-94.01%)
Mutual labels:  kafka, streaming
Swiftdb
A modern database abstraction layer, batteries included.
Stars: ✭ 183 (-94.52%)
Mutual labels:  sql, database
Better Sqlite3
The fastest and simplest library for SQLite3 in Node.js.
Stars: ✭ 2,778 (-16.85%)
Mutual labels:  sql, database
Db
Data access layer for PostgreSQL, CockroachDB, MySQL, SQLite and MongoDB with ORM-like features.
Stars: ✭ 2,832 (-15.23%)
Mutual labels:  sql, database
Pdo
Connecting to MySQL in PHP using PDO.
Stars: ✭ 187 (-94.4%)
Mutual labels:  sql, database
Snowflake Connector Python
Snowflake Connector for Python
Stars: ✭ 185 (-94.46%)
Mutual labels:  sql, database
Fluentmigrator
Fluent migrations framework for .NET
Stars: ✭ 2,636 (-21.1%)
Mutual labels:  sql, database
Cubrid
CUBRID is a comprehensive open source relational database management system highly optimized for Web Applications.
Stars: ✭ 184 (-94.49%)
Mutual labels:  sql, database
Npgsql
Npgsql is the .NET data provider for PostgreSQL.
Stars: ✭ 2,415 (-27.72%)
Mutual labels:  sql, database
Dbtester
Distributed database benchmark tester
Stars: ✭ 214 (-93.59%)
Mutual labels:  database, distributed-systems
Shardingsphere
Build criterion and ecosystem above multi-model databases
Stars: ✭ 14,989 (+348.64%)
Mutual labels:  sql, database
Java Persistence Frameworks Comparison
Comparison of non-JPA SQL mapping frameworks for Java (Jooq, Spring JDBCTemplate, MyBatis, EBean, JDBI, Speedment, sql2o)
Stars: ✭ 213 (-93.62%)
Mutual labels:  sql, database

Build status Doc reference Chat on Slack

Materialize is a streaming database for real-time applications.

Get started

Check out our getting started guide.

About

Materialize lets you ask questions of your live data, which it answers and then maintains for you as your data continue to change. The moment you need a refreshed answer, you can get it in milliseconds. Materialize is designed to help you interactively explore your streaming data, perform data warehousing analytics against live relational data, or just increase the freshness and reduce the load of your dashboard and monitoring tasks.

Materialize focuses on providing correct and consistent answers with minimal latency. It does not ask you to accept either approximate answers or eventual consistency. Whenever Materialize answers a query, that answer is the correct result on some specific (and recent) version of your data. Materialize does all of this by recasting your SQL92 queries as dataflows, which can react efficiently to changes in your data as they happen. Materialize is powered by timely dataflow, which connects the times at which your inputs change with the times of answers reported back to you.

We support a large fraction of PostgreSQL, and are actively working on supporting more builtin PostgreSQL functions. Please file an issue if there's something that you expected to work that didn't!

Get data in

Materialize reads Avro, Protobuf, JSON, and newline-delimited text. Need something else? Just ask.

Materialize can read data from Kafka topics, Kinesis streams (in preview), or tail local files.

Transform, manipulate, and read your data

Once you've got the data in, define views and perform reads via the PostgreSQL protocol. Use your favorite PostgreSQL CLI, including the psql you probably already have on your system.

Materialize supports a comprehensive variety of SQL features, all using the PostgreSQL dialect and protocol:

  • Joins, Joins, Joins! Materialize supports multi-column join conditions, multi-way joins, self-joins, cross-joins, inner joins, outer joins, etc.
  • Delta-joins avoid intermediate state blowup compared to systems that can only plan nested binary joins - tested on joins of up to 64 relations.
  • Support for subqueries. Materialize's SQL optimizer performs subquery decorrelation out-of-the-box, avoiding the need to manually rewrite subqueries into joins.
  • Materialize supports streams that contain CDC data (currently supporting the Debezium format). Materialize can incrementally maintain views in the presence of arbitrary inserts, updates, and deletes. No asterisks.
  • All the aggregations. GROUP BY , MIN, MAX, COUNT, SUM, STDDEV, HAVING, etc.
  • ORDER BY
  • LIMIT
  • DISTINCT
  • JSON support in the PostgreSQL dialect including operators and functions like ->, ->>, @>, ?, jsonb_array_element, jsonb_each. Materialize automatically plans lateral joins for efficient jsonb_each support.
  • Nest views on views on views!
  • Multiple views that have overlapping subplans can share underlying indices for space and compute efficiency, so just declaratively define what you want, and we'll worry about how to efficiently maintain them.

Just show us what it can do!

Here's an example join query that works fine in Materialize, TPC-H query 15:

-- Views define commonly reused subqueries.
CREATE VIEW revenue (supplier_no, total_revenue) AS
    SELECT
        l_suppkey,
        SUM(l_extendedprice * (1 - l_discount))
    FROM
        lineitem
    WHERE
        l_shipdate >= DATE '1996-01-01'
        AND l_shipdate < DATE '1996-01-01' + INTERVAL '3' month
    GROUP BY
        l_suppkey;

-- Materialized views are maintained automatically, and can depend on non-materialized views.
CREATE MATERIALIZED VIEW tpch_q15 AS
  SELECT
    s_suppkey,
    s_name,
    s_address,
    s_phone,
    total_revenue
FROM
    supplier,
    revenue
WHERE
    s_suppkey = supplier_no
    AND total_revenue = (
        SELECT
            max(total_revenue)
        FROM
            revenue
    )
ORDER BY
    s_suppkey

Stream inserts, updates, and deletes on the underlying tables (lineitem and supplier), and Materialize keeps the materialized view incrementally updated. You can type SELECT * FROM tpch_q15 and expect to see the current results immediately!

Get data out

Pull based: Use any PostgreSQL-compatible driver in any language/environment to make SELECT queries against your views. Tell them they're talking to a PostgreSQL database, they don't ever need to know otherwise.

Push based: Or configure Materialize to stream results to a Kafka topic as soon as the views change.

If you want to use an ORM, chat with us. They're surprisingly tricky.

Documentation

Check out our documentation.

License

Materialize is source-available and licensed under the BSL 1.1, converting to the open-source Apache 2.0 license after 4 years. As stated in the BSL, Materialize is free forever on a single node.

Materialize is also available as a paid cloud service with additional features such as high availability via multi-active replication.

How does it work?

Materialize is built on top of differential dataflow and timely dataflow, and builds on a decade of cutting-edge stream processing research.

For developers

Materialize is written entirely in Rust.

Developers can find docs at doc/developer, and Rust API documentation is hosted at https://dev.materialize.com/api/rust/. The Materialize development roadmap is divided up into roughly month-long milestones, and managed in GitHub.

Contributions are welcome. Prospective code contributors might find the good first issue tag useful. We value all contributions equally, but bug reports are more equal.

Credits

Materialize is lovingly crafted by a team of developers and one bot. Join us.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].