All Projects → kairen → learning-spark

kairen / learning-spark

Licence: other
Tidy up Spark and Hadoop tutorials.

Programming Languages

java
68154 projects - #9 most used programming language
shell
77523 projects
r
7636 projects

Projects that are alternatives of or similar to learning-spark

Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+39153.57%)
Mutual labels:  hadoop, bigdata
Awesome Learning
实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+603.57%)
Mutual labels:  hadoop, bigdata
Bigdata Notebook
Stars: ✭ 100 (+257.14%)
Mutual labels:  hadoop, bigdata
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+2960.71%)
Mutual labels:  hadoop, bigdata
Hadoop Attack Library
A collection of pentest tools and resources targeting Hadoop environments
Stars: ✭ 228 (+714.29%)
Mutual labels:  hadoop, bigdata
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (+164.29%)
Mutual labels:  hadoop, bigdata
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (+3.57%)
Mutual labels:  hadoop, bigdata
Big data architect skills
一个大数据架构师应该掌握的技能
Stars: ✭ 400 (+1328.57%)
Mutual labels:  hadoop, bigdata
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+667.86%)
Mutual labels:  hadoop, bigdata
Shifu
An end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+639.29%)
Mutual labels:  hadoop, bigdata
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-82.14%)
Mutual labels:  hadoop, bigdata
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (+32.14%)
Mutual labels:  hadoop, bigdata
Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+2817.86%)
Mutual labels:  hadoop, bigdata
qs-hadoop
大数据生态圈学习
Stars: ✭ 18 (-35.71%)
Mutual labels:  hadoop, bigdata
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+21357.14%)
Mutual labels:  hadoop, bigdata
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (+350%)
Mutual labels:  hadoop, bigdata
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-53.57%)
Mutual labels:  hadoop, bigdata
Spline
Data Lineage Tracking And Visualization Solution
Stars: ✭ 306 (+992.86%)
Mutual labels:  hadoop, bigdata
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (+625%)
Mutual labels:  hadoop, bigdata
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (+21.43%)
Mutual labels:  hadoop, bigdata

Spark training

本項目將所有於分享會以及課程上,所接觸的系統建置、Spark API 撰寫、HDFS 操作...等教學與整理,主要授課人員為 NUTC imac 內部團隊自我訓練。

主要包含項目

  1. Spark 概念、部署與基本範例
  2. Hadoop 概念、部署與基本範例
  3. Spark 與 Hadoop 相關系統建置整理

以上內容我們會逐一整理,並寫成文件來分享給大家。

參與貢獻

任何團隊成員都可以對該 git 做貢獻,未來也會請大家針對不一樣的作業進行提交,一個基本的貢獻流程如下所示:

  1. Githubfork 到自己的 Repository,例如:<User>/learning-spark.git,然後 clone到 local 端,並設定 Git 使用者資訊。
git clone https://github.com/kairen/learning-spark.git
cd spark-tutorial
git config user.name "User"
git config user.email [email protected]
  1. 修改程式碼或頁面後,透過 commit 來提交到自己的 Repository:
git commit -am "Fix issue #1: change helo to hello"
git push

若新增採用一般文字訊息,如Add Spark MLlib example ...

  1. 在 GitHub 上提交一個 Pull Request。
  2. 持續的針對 Project Repository 進行更新內容:
git remote add upstream https://github.com/kairen/learning-spark.git
git fetch upstream
git checkout master
git rebase upstream/master
git push -f origin master
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].