Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-94.01%)
autopivotAutoPivot automatically creates in-memory OLAP cubes from CSV files, that you can explore from Excel, Tableau or using the embedded ActiveUI web frontend
Stars: ✭ 23 (-98.5%)
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (+111.99%)
CboardAn easy to use, self-service open BI reporting and BI dashboard platform.
Stars: ✭ 2,795 (+82.08%)
vinumVinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.
Stars: ✭ 57 (-96.29%)
Pivot4jPivot4J provides a common API for OLAP servers which can be used to build an analytical service frontend with pivot style GUI.
Stars: ✭ 113 (-92.64%)
olap睿思BI-OLAP开源多维分析系统
Stars: ✭ 101 (-93.42%)
HTAPBenchBenchmark suite to evaluate HTAP database engines
Stars: ✭ 15 (-99.02%)
CubesviewerExplore and visualize analytical datasets
Stars: ✭ 416 (-72.9%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+1273.88%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-96.48%)
intelli-swift-coreDistributed, Column-oriented storage, Realtime analysis, High performance Database
Stars: ✭ 17 (-98.89%)
duckdbDuckDB is an in-process SQL OLAP Database Management System
Stars: ✭ 4,707 (+206.64%)
VectorsqlVectorSQL is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.
Stars: ✭ 171 (-88.86%)
matrixoneHyperconverged cloud-edge native database
Stars: ✭ 1,057 (-31.14%)
TiBigDataTiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (-87.49%)
arrow-datafusionApache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+53.75%)
fdp-modelserverAn umbrella project for multiple implementations of model serving
Stars: ✭ 47 (-96.94%)
KuiBaDBAnother OLAP database
Stars: ✭ 297 (-80.65%)
Flink Boot懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管理,同时将微服务以及WEB开发领域中经常用到的框架集成进来,进一步提升开发速度。比如集成Mybatis ORM框架,Hibernate Validator校验框架,Spring Retry重试框架等,具体见下面的脚手架特性。
Stars: ✭ 209 (-86.38%)
Flink Sql CookbookThe Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
Stars: ✭ 189 (-87.69%)
IndexrAn open-source columnar data format designed for fast & realtime analytic with big data.
Stars: ✭ 433 (-71.79%)
DatafuseDatafuse is a free Cloud-Native Analytics DBMS(Inspired by ClickHouse) implemented in Rust
Stars: ✭ 327 (-78.7%)
flink-deployerA tool that help automate deployment to an Apache Flink cluster
Stars: ✭ 143 (-90.68%)
Sybilcolumnar storage + NoSQL OLAP engine | https://logv.org
Stars: ✭ 270 (-82.41%)
DuckdbDuckDB is an in-process SQL OLAP Database Management System
Stars: ✭ 4,014 (+161.5%)
GuitarA Simple and Efficient Distributed Multidimensional BI Analysis Engine.
Stars: ✭ 86 (-94.4%)
OmniscidbOmniSciDB (formerly MapD Core)
Stars: ✭ 2,601 (+69.45%)
MondrianMondrian is an Online Analytical Processing (OLAP) server that enables business users to analyze large quantities of data in real-time.
Stars: ✭ 947 (-38.31%)
RegistrySchema Registry
Stars: ✭ 184 (-88.01%)
flink-clientJava library for managing Apache Flink via the Monitoring REST API
Stars: ✭ 48 (-96.87%)
OLAP-cubeis an hypercube of data
Stars: ✭ 23 (-98.5%)
Bats面向 OLTP、OLAP、批处理、流处理场景的大一统 SQL 引擎
Stars: ✭ 152 (-90.1%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-98.63%)
SANSA-StackBig Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Stars: ✭ 130 (-91.53%)
quickstepQuickstep project
Stars: ✭ 22 (-98.57%)
RadonRadonDB is an open source, cloud-native MySQL database for building global, scalable cloud services
Stars: ✭ 1,584 (+3.19%)
Papers4DataAchitectCollect papers for data engineering such as OLTP/OLAP/ETL/DistributedStorage.
Stars: ✭ 17 (-98.89%)
flockFlock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (-84.89%)
CubesLight-weight Python OLAP framework for multi-dimensional data analysis
Stars: ✭ 1,393 (-9.25%)
dpkb大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (-91.99%)
Flink Recommandsystem Demo🚁🚀基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
Stars: ✭ 3,115 (+102.93%)
MiningBusiness Intelligence (BI) in Python, OLAP
Stars: ✭ 1,128 (-26.51%)
Flink SpectorFramework for Apache Flink unit tests
Stars: ✭ 190 (-87.62%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-98.7%)
NussknackerProcess authoring tool for Apache Flink
Stars: ✭ 182 (-88.14%)
Awesome GraphA curated list of resources for graph databases and graph computing tools
Stars: ✭ 717 (-53.29%)
flink-connector-kudu基于Apache-bahir-kudu-connector的flink-connector-kudu,支持Flink1.11.x DynamicTableSource/Sink,支持Range分区等
Stars: ✭ 40 (-97.39%)
metriqlThe metrics layer for your data. Join us at https://metriql.com/slack
Stars: ✭ 227 (-85.21%)
Lidea大型分布式系统实时监控平台
Stars: ✭ 28 (-98.18%)