All Projects → huangfox → dpkb

huangfox / dpkb

Licence: other
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse

Projects that are alternatives of or similar to dpkb

Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+564.23%)
Mutual labels:  hive, hadoop, hbase, flink
Szt Bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
Stars: ✭ 826 (+571.54%)
Mutual labels:  hive, hadoop, hbase, flink
Haproxy Configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (-13.82%)
Mutual labels:  presto, hive, hadoop, hbase
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+4784.55%)
Mutual labels:  hive, hadoop, hbase, flink
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-25.2%)
Mutual labels:  hive, hadoop, hbase, flink
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-76.42%)
Mutual labels:  hive, hadoop, hbase, flink
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (+30.89%)
Mutual labels:  presto, hive, hadoop, hbase
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+3624.39%)
Mutual labels:  presto, hive, hadoop
Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488 (+296.75%)
Mutual labels:  hive, hadoop, hbase
Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+588.62%)
Mutual labels:  presto, hadoop, hbase
BigData-News
基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (-70.73%)
Mutual labels:  hive, hadoop, hbase
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+871.54%)
Mutual labels:  hive, hadoop, flink
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-33.33%)
Mutual labels:  hive, hadoop, hbase
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+270.73%)
Mutual labels:  hive, hbase, flink
Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+202.44%)
Mutual labels:  hive, hadoop, hbase
Presto
The official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+10434.15%)
Mutual labels:  presto, hive, hadoop
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+8835.77%)
Mutual labels:  hive, hadoop, hbase
litemall-dw
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-70.73%)
Mutual labels:  hive, hbase, flink
swordfish
Open-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-71.54%)
Mutual labels:  hive, hadoop, hbase
Wifi
基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (-24.39%)
Mutual labels:  hive, hadoop, hbase

DPKB

大数据相关文章汇总(知识库) 持续更新中(2023-01)

一、开源组件

Hadoop

1)官网、社区、博客

Hive

1)官网、社区、博客

2)专栏

3)大厂实践

Presto、Trino

1)官网、社区、博客

2)专栏

3)大厂实践

Spark

1)官网、社区、博客

2)专栏

3)大厂实践

Flink

1)官网、社区、博客

2)专栏

教程

3)大厂实践

Kudu

1)官网、社区、博客

2)专栏

3)大厂实践

4)其他

HBase

1)官网、社区、博客

2)专栏

3)大厂实践

4)其他

ClickHouse

1)官网、社区、博客

2)专栏

3)大厂实践

4)其他

Doris

1)官网、社区、博客

2)专栏

3)案例实践

StarRocks

1)官网、社区、博客

2) 专栏

Iceberg

1)官网、社区、博客

2)应用

Hudi

1)官网、社区、博客

2)应用

Calcite

1)官网、社区、博客

2)应用

DolphinScheduler

二、大数据应用

大数据架构

数仓相关

数据治理、数据资产、元数据管理

元数据管理

数据湖

推荐系统

基础

技术博客

三、资源汇总

大厂技术博客

大数据相关网站

相关开源项目

相关论文

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].