sunnyandgood / Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488
Projects that are alternatives of or similar to Bigdata
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (-67.01%)
Mutual labels: zookeeper, hadoop, hive, mysql, hbase, hdfs
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-81.15%)
Mutual labels: zookeeper, hadoop, hive, mapreduce, hbase, hdfs
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+2152.25%)
Mutual labels: zookeeper, hadoop, hive, mapreduce, hbase, hdfs
Haproxy Configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (-78.28%)
Mutual labels: zookeeper, hadoop, hive, mysql, hbase
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+1131.15%)
Mutual labels: zookeeper, hadoop, hive, hbase, hdfs
Szt Bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
Stars: ✭ 826 (+69.26%)
Mutual labels: zookeeper, hadoop, hive, mysql, hbase
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+75.61%)
Mutual labels: hadoop, mapreduce, hbase, hdfs
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-94.06%)
Mutual labels: hive, hadoop, hbase, zookeeper
Nagios Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+104.92%)
Mutual labels: zookeeper, hadoop, mysql, hbase
Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+67.42%)
Mutual labels: zookeeper, hadoop, hive, hbase
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-83.2%)
Mutual labels: zookeeper, hadoop, hive, hbase
xxhadoop
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-92.42%)
Mutual labels: hive, hadoop, hbase, zookeeper
cloud
云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (-90.16%)
Mutual labels: hive, hadoop, hbase, zookeeper
BigInsights-on-Apache-Hadoop
Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix
Stars: ✭ 21 (-95.7%)
Mutual labels: hive, hadoop, hbase
wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-96.11%)
Mutual labels: hadoop, hbase, hdfs
Datafaker
Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具
Stars: ✭ 327 (-32.99%)
Mutual labels: hive, mysql, hbase
Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (-23.77%)
Mutual labels: hadoop, hive, hbase
BigBata
Hbase是数据库,Hive是数据仓库
hadoop2.2.0伪分布式搭建
HDFS 分布式文件系统
- Hadoop分布式数据分析系统概述
- Hadoop深入浅出
- HDFS fs命令
- HDFS架构
- RPC(Remote Procedure Call远程程序调用)及HDFS的读写过程
- Windows系统下运行hadoop、spark程序出错Could not locate executablenull\bin\winutils.exe in the Hadoop binaries
MapReduce
-
MapReduce原理
- MapReduce执行过程
- 数据类型与格式
- Writable接口与序列化机制
- Partitioner编程
- 自定义排序编程
- Combiners编程
- 常见的MapReduce算法
- 倒排索引
Zookeeper
hadoop集群搭建
Sqoop
HBase
Hive
使用hive(表描述在hive数据库的TBLS表中,表中的字段在COLUMNS_V2表中,表的id在CDS表中,存储HDFS上的路径在SDS表中)
flume(日志收集系统)
脚本-定时器
Linux
-
-
Linux 文件/目录管理类命令
- Linux的cd命令(文件系统目录切换)
- Linux的ls命令(显示文件和目录信息)
- Linux的touch命令(创建一个空文件)
- Linux的cp命令(复制文件)
- Linux的mv命令(重命名文件)
- Linux的rm命令(删除文件)
- Linux的ln命令(为某一个文件在另外一个位置建立一个同步的链接)
- Linux的pwd命令(显示工作目录)
- Linux的scp命令(跨主机之间的文件和目录的复制)
- Linux的mkdir命令(创建目录)
- Linux的rmdir命令(删除空的目录)
- Linux的tree命令(列出指定目录下的所有文件,包括子目录里的文件)
-
Linux 文件编辑类命令
- Linux的cat命令(显示文件内容)
- Linux的more命令(以一页一页的形式显示)
- Linux的less命令(随意地浏览文件)
- Linux的tail命令(从指定点开始将文件写到标准输出)
- Linux的head命令(显示文档结尾内容)
-
Linux 磁盘管理类命令
- Linux的df命令(显示指定的文件系统的可用空间等信息)
- Linux的du命令(显示目录或文件的大小)
-
Linux 系统管理类命令
进程是正在执行的程序实例,执行程序时,内核会将程序代码首先加载到虚拟内存,为程序变量分配内存空间,并为进程建立 记账数据结构,用于记录与进程相关的各种信息,如进程ID,用户ID,组ID以及进程的各种状态,运行或终止状态。
- Linux的ps命令(列出系统中当前运行的那些进程)
- Linux的kill命令(用于杀掉执行中的程序或工作)
- Linux的top命令(实时动态显示 Linux进程 的动态信息)
- Linux的free命令(显示内存状态)
- Linux的clear命令(清除控制台内容)
- Linux的wc命令(统计指定文件中的字节数、字数、行数,并将统计结果显示输出)
- Linux的stat命令(显示inode内容)
- Linux的which命令(查找文件)
- Linux的whoami命令(显示自身用户名)
-
Linux 网络类命令
- Linux的scp命令(跨主机之间的文件和目录的复制)
- Linux的netstat命令(检验主机端口的网络连接情况)
-
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].