Bigdata Notes大数据入门指南 ⭐
Stars: ✭ 10,991 (+78407.14%)
Mutual labels: hive, hadoop, mapreduce, flume, sqoop
cloud云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (+242.86%)
Mutual labels: hive, hadoop, flume, sqoop
bigdata-doc大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (+164.29%)
Mutual labels: hive, hadoop, mapreduce
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+2557.14%)
Mutual labels: hive, hadoop, etl
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+8435.71%)
Mutual labels: hive, hadoop, etl
GooglePlay-Web-CrawlerMapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
Stars: ✭ 18 (+28.57%)
Mutual labels: hive, hadoop, mapreduce
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+4292.86%)
Mutual labels: hive, hadoop, etl
Bigdata💎🔥大数据学习笔记
Stars: ✭ 488 (+3385.71%)
Mutual labels: hive, hadoop, mapreduce
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+42814.29%)
Mutual labels: hive, hadoop, flume
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (+685.71%)
Mutual labels: hive, hadoop, mapreduce
DataxDataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+728.57%)
Mutual labels: hive, hadoop, etl
TitanDataOperationSystem最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;
Stars: ✭ 62 (+342.86%)
Mutual labels: hive, hadoop, flume
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+71.43%)
Mutual labels: hive, hadoop, etl
BigData-News基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (+157.14%)
Mutual labels: hive, hadoop, flume
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (+557.14%)
Mutual labels: hive, hadoop, mapreduce
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (+900%)
Mutual labels: hive, hadoop, etl
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (+278.57%)
Mutual labels: hive, hadoop, flume
smart-data-lakeSmart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (+464.29%)
Mutual labels: hive, hadoop
learning-hadoop-and-sparkCompanion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+942.86%)
Mutual labels: hadoop, mapreduce
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (+50%)
Mutual labels: hive, hadoop