collabH / Repository
Licence: mit
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92
Programming Languages
shell
77523 projects
Projects that are alternatives of or similar to Repository
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+6430.43%)
Mutual labels: zookeeper, kafka, spark, hadoop, flink, hive, hbase, hdfs
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+11846.74%)
Mutual labels: zookeeper, kafka, spark, hadoop, hive, mapreduce, hbase, hdfs
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+831.52%)
Mutual labels: kafka, spark, hadoop, flink, mapreduce, hbase, hdfs
Szt Bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
Stars: ✭ 826 (+797.83%)
Mutual labels: zookeeper, kafka, spark, hadoop, flink, hive, hbase
Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+788.04%)
Mutual labels: zookeeper, kafka, spark, hadoop, flink, hive, hbase
Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488 (+430.43%)
Mutual labels: zookeeper, hadoop, hive, mapreduce, hbase, hdfs
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (+75%)
Mutual labels: zookeeper, spark, hadoop, hive, hbase, hdfs
Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+820.65%)
Mutual labels: zookeeper, kafka, spark, hadoop, hbase
Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+304.35%)
Mutual labels: kafka, spark, hadoop, hive, hbase
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-59.78%)
Mutual labels: hive, hadoop, hdfs, mapreduce, flink
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-68.48%)
Mutual labels: hive, hadoop, hbase, zookeeper, flink
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-10.87%)
Mutual labels: zookeeper, spark, hadoop, hive, hbase
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+395.65%)
Mutual labels: spark, flink, hive, mapreduce, hbase
yuzhouwan
Code Library for My Blog
Stars: ✭ 39 (-57.61%)
Mutual labels: spark, hadoop, hbase, zookeeper
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+457.61%)
Mutual labels: kafka, spark, olap, hdfs
swordfish
Open-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-61.96%)
Mutual labels: spark, hive, hadoop, hbase
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-84.78%)
Mutual labels: spark, hadoop, hbase, hdfs
Nagios Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+986.96%)
Mutual labels: zookeeper, kafka, hadoop, hbase
repository
概述
- 个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
RoadMap
基础能力
数据结构
分布式理论
计算机理论
Scala
JVM
Java
JDK源码
todo
算法
BigData
datalake
iceberg
rocksDB
Hadoop
- 广义上的Hadoop生态圈的学习笔记,主要记录HDFS、MapReduce、Yarn相关读书笔记及源码分析等。
HDFS
MapReduce
Yarn
高可用配置
Canal
Debezium
Hive
Spark
-
主要包含Spark相关书籍读书笔记、Spark核心组件分析、Spark相关API实践以及Spark生产踩坑等。
Spark Core
Spark SQL
Spark Streaming
源码解析
Zookeeper
Flume
Kafka
HBase
Sqoop
DolphinScheduler
Flink
- 主要包含对Flink文档阅读的总结和相关Flink源码的阅读,以及Flink新特性记录等等
Core
- FlinkOverView
- CheckPoint机制
- TableSQLOverview
- DataStream API
- ProcessFunction API
- Data Source
- Table API
- Flink SQL
- Flink Hive
- Flink CEP
- Flink Function
- DataSource API
SourceCode
- FlinkSQL源码解析
- TaskExecutor内存模型原理深入
- Flink窗口实现应用
- Flink运行环境源码解析
- FlinkTimerService机制分析
- StreamSource源解析
- Flink状态管理与检查点机制
Feature
Practice
Connector
monitor
olap
- 主要核心包含Kudu、Impala相关Olap引擎,生产实践及论文记录等。
Presto
clickhouse
Druid
Kylin
Kudu
paper
Impala
数据仓库
读书笔记
devops
maven
服务监控
mac
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].