learning-hadoop-and-sparkCompanion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+274.36%)
Mutual labels: hadoop, mapreduce, dataproc
BehemothBehemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Stars: ✭ 286 (+633.33%)
Mutual labels: hadoop, mapreduce
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-12.82%)
Mutual labels: hadoop, mapreduce
Bigdata💎🔥大数据学习笔记
Stars: ✭ 488 (+1151.28%)
Mutual labels: hadoop, mapreduce
qs-hadoop大数据生态圈学习
Stars: ✭ 18 (-53.85%)
Mutual labels: hadoop, mapreduce
web-click-flow网站点击流离线日志分析
Stars: ✭ 14 (-64.1%)
Mutual labels: hadoop, mapreduce
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+56433.33%)
Mutual labels: hadoop, mapreduce
CascadingCascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster. See https://github.com/Cascading/cascading for the release repository.
Stars: ✭ 318 (+715.38%)
Mutual labels: hadoop, mapreduce
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+2333.33%)
Mutual labels: hadoop, mapreduce
SrcA light-weight distributed stream computing framework for Golang
Stars: ✭ 67 (+71.79%)
Mutual labels: hadoop, mapreduce
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (+135.9%)
Mutual labels: hadoop, mapreduce
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (+182.05%)
Mutual labels: hadoop, mapreduce
GooglePlay-Web-CrawlerMapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
Stars: ✭ 18 (-53.85%)
Mutual labels: hadoop, mapreduce
bigdata-doc大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-5.13%)
Mutual labels: hadoop, mapreduce
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+2097.44%)
Mutual labels: hadoop, mapreduce
Bigdata Notes大数据入门指南 ⭐
Stars: ✭ 10,991 (+28082.05%)
Mutual labels: hadoop, mapreduce
AsakusafwAsakusa Framework
Stars: ✭ 114 (+192.31%)
Mutual labels: hadoop, mapreduce
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+451.28%)
Mutual labels: hadoop
kafka-connect-fsKafka Connect FileSystem Connector
Stars: ✭ 107 (+174.36%)
Mutual labels: hadoop