Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!

Stars: ✭ 372 (-56.59%)

Mutual labels: kafka, spark, hadoop, hbase

bigdata-fun

A complete (distributed) BigData stack, running in containers

Stars: ✭ 14 (-98.37%)

Mutual labels: spark, hadoop, hbase, hdfs

Devops Python Tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Stars: ✭ 406 (-52.63%)

Mutual labels: spark, hadoop, hbase, hdfs

Javaorbigdata Interview

Java开发者或者大数据开发者面试知识点整理

Stars: ✭ 203 (-76.31%)

Mutual labels: spark, hadoop, bigdata, interview

Sparkstreaming

💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算)；🚀 支持运行过程中增删topic；🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。

Stars: ✭ 179 (-79.11%)

Mutual labels: kafka, spark, flink, hbase

dockerfiles

Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )

Stars: ✭ 29 (-96.62%)

Mutual labels: hadoop, bigdata, hbase, flink

Flink Learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Stars: ✭ 11,378 (+1227.65%)

Mutual labels: kafka, spark, flink, hbase

Bigdata

💎🔥大数据学习笔记

Stars: ✭ 488 (-43.06%)

Mutual labels: hadoop, mapreduce, hbase, hdfs

Bdp Dataplatform

大数据生态解决方案数据平台：基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。

Stars: ✭ 456 (-46.79%)

Mutual labels: spark, flink, mapreduce, hbase

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-98.48%)

Mutual labels: spark, hadoop, bigdata, hdfs

wasp

WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.

Stars: ✭ 19 (-97.78%)

Mutual labels: yarn, hadoop, hbase, hdfs

yuzhouwan

Code Library for My Blog

Stars: ✭ 39 (-95.45%)

Mutual labels: spark, hadoop, bigdata, hbase

View All Similar Projects ➔

大数据面试题汇总与答案分享


Hadoop	Hive	Spark	Flink	HBase	Kafka	Zookeeper

一、Hadoop

二、Hive

三、Spark

四、Flink

五、HBase

六、Kafka

七、Zookeeper

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 857

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗