All Projects → zunzhuowei → qs-hadoop

zunzhuowei / qs-hadoop

Licence: other
大数据生态圈学习

Programming Languages

java
68154 projects - #9 most used programming language
scala
5932 projects

Projects that are alternatives of or similar to qs-hadoop

Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+60961.11%)
Mutual labels:  hadoop, storm, bigdata, mapreduce
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+5061.11%)
Mutual labels:  bigdata, spark-streaming, mapreduce
xxhadoop
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (+105.56%)
Mutual labels:  hadoop, storm, spark-streaming
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (+1027.78%)
Mutual labels:  hadoop, storm, bigdata
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+88.89%)
Mutual labels:  hadoop, bigdata, mapreduce
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+4661.11%)
Mutual labels:  hadoop, bigdata, mapreduce
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (+88.89%)
Mutual labels:  hadoop, bigdata, spark-streaming
Bigdata Notebook
Stars: ✭ 100 (+455.56%)
Mutual labels:  hadoop, storm, bigdata
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (+105.56%)
Mutual labels:  hadoop, bigdata, mapreduce
interview-refresh-java-bigdata
a one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
Stars: ✭ 25 (+38.89%)
Mutual labels:  spark-streaming, mapreduce
Awesome Learning
实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+994.44%)
Mutual labels:  hadoop, bigdata
Data-pipeline-project
Data pipeline project
Stars: ✭ 18 (+0%)
Mutual labels:  hadoop, mapreduce
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+883.33%)
Mutual labels:  hadoop, spark-streaming
Movie recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+11522.22%)
Mutual labels:  hadoop, spark-streaming
Recommendsys
推荐项目(实时推荐和离线推荐)
Stars: ✭ 198 (+1000%)
Mutual labels:  hadoop, storm
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (+600%)
Mutual labels:  hadoop, bigdata
Asakusafw
Asakusa Framework
Stars: ✭ 114 (+533.33%)
Mutual labels:  hadoop, mapreduce
Shifu
An end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+1050%)
Mutual labels:  hadoop, bigdata
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+1094.44%)
Mutual labels:  hadoop, bigdata
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (+61.11%)
Mutual labels:  hadoop, bigdata

qs-hadoop

此项目用于学习大数据hadoop生态圈以及storm、spark生态圈

hadoop

hadoop-Ecosphere

spark-Ecosphere.jpg

组织结构

qs-hadoop
├
├── qs-hadoop-elasticsearch-springboot -- elasticsearch集成springboot以及基本使用
|
├── qs-hadoop-elasticsearch-starter -- elasticsearch开始使用
|
├── qs-hadoop-file -- 学习相关的资料文件、以及遇到的坑点笔记
|
├── qs-hadoop-flink -- 分布式数据流处理和批量数据处理框架 flink 初次见面
|
├── qs-hadoop-hdfs -- hadoop核心框架-分布式文件系统的java api的基本使用
|
├── qs-hadoop-ipParser -- 一个ip地址数据库,用于解析ip获取地址
|
├── qs-hadoop-kafka -- kafka生产者、消费者使用
|
├── qs-hadoop-logger -- 用于产生类似nginx访问日志的产生
|
├── qs-hadoop-mapreduce -- hadoop核心框架-分布式计算框架mapreduce java api编程实现
|
├── qs-hadoop-spark -- 基于sparkContext的wordcount Demo
|
├── qs-hadoop-sparkSQL -- 基于scala编程的spark核心dataset、dataframe的使用以及hive on spark等使用
|
├── qs-hadoop-sparkStream -- 基于scala编程的sparkstreaming是基本使用,分别集成flume、kafka日志收集
|
├── qs-hadoop-sparkStream-action -- 基于scala编程的sparkstreaming实战,集成flume、kafka做实时流处理的日志分析项目实战
|
├── qs-hadoop-streaming-action-webUi -- 基于java springboot编程的sparkstreaming实战数据图像化展示web ui界面(echarts)
|
├── qs-hadoop-userBehaviorLog -- 基于java编程的Mapreduce nginx日志用户行为离线处理分析,统计nginx用户访问日志的os个数
|
├── qs-hadoop-userLog-scala -- 基于scala编程的spark nginx日志用户行为离线处理分析
|
├── qs-hadoop-webUi -- 基于java servlet编程的spark nginx日志用户行为离线处理分析的数据图像化展示web ui界面(echarts)
|
├── qs-hadoop-spring -- HDFS集成java spring开源框架的基本使用
|
├── qs-hadoop-springboot -- HDFS集成java springboot开源框架的基本使用
|
├── qs-hadoop-storm -- storm实时流计算的使用

涉及到的编程语言及技术框架

编程语言

  • java 1.8.0_161
  • scala 2.11.12

技术框架及选型

  • hadoop 2.6.0 (cdh5.7.0)

  • spark 2.1.0 (cdh5.7.0)

  • flink 1.5.0 (cdh5.7.0)

  • flume 1.6.0 (cdh5.7.0)

  • kafka kafka_2.11-0.9.0.0

  • hbase 1.2.0 (cdh5.7.0)

  • elasticsearch 5.2.0

  • hive 1.1.0 (cdh5.7.0)

  • spring boot 2.0.3.RELEASE

  • echarts 3.3.1

  • zookeeper-3.4.5 (cdh5.7.0)

  • storm 1.1.2

结束语

本项目仅用于学习大数据生态圈使用,很多地方都是仅仅是demo形式, 还有很多东西需要改进。 项目随着学习进度持续更新...

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].