Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )

Stars: ✭ 29 (-3.33%)

Mutual labels: hadoop, bigdata

bigdata-doc

大数据学习笔记，学习路线，技术案例整理。

Stars: ✭ 37 (+23.33%)

Mutual labels: hadoop, bigdata

Hadoop Attack Library

A collection of pentest tools and resources targeting Hadoop environments

Stars: ✭ 228 (+660%)

Mutual labels: hadoop, bigdata

Spline

Data Lineage Tracking And Visualization Solution

Stars: ✭ 306 (+920%)

Mutual labels: hadoop, bigdata

Sparkrdma

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (+616.67%)

Mutual labels: hadoop, bigdata

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+36536.67%)

Mutual labels: hadoop, bigdata

Awesome Learning

实践源码库：https://github.com/jast90/bigdata 。微信搜索Jast关注公众号，获取最新技术分享😯。

Stars: ✭ 197 (+556.67%)

Mutual labels: hadoop, bigdata

Hadoop For Geoevent

ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.

Stars: ✭ 5 (-83.33%)

Mutual labels: hadoop, bigdata

God Of Bigdata

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stars: ✭ 6,008 (+19926.67%)

Mutual labels: hadoop, bigdata

yuzhouwan

Code Library for My Blog

Stars: ✭ 39 (+30%)

Mutual labels: hadoop, bigdata

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-56.67%)

Mutual labels: hadoop, bigdata

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (+2756.67%)

Mutual labels: hadoop, bigdata

Hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Stars: ✭ 126 (+320%)

Mutual labels: hadoop, bigdata

the-apache-ignite-book

All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above

Stars: ✭ 65 (+116.67%)

Mutual labels: hadoop, bigdata

qs-hadoop

大数据生态圈学习

Stars: ✭ 18 (-40%)

Mutual labels: hadoop, bigdata

deadman-check

Monitoring companion for Nomad periodic jobs and Cron

Stars: ✭ 49 (+63.33%)

Mutual labels: nomad

flink-learn

Learning Flink : Flink CEP,Flink Core,Flink SQL

Stars: ✭ 70 (+133.33%)

Mutual labels: bigdata

Movies-Analytics-in-Spark-and-Scala

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

Stars: ✭ 47 (+56.67%)

Mutual labels: hadoop

schier.co

🏡 My personal website and blog powered by Go, Tailwind, Postgres

Stars: ✭ 19 (-36.67%)

Mutual labels: nomad

presto

Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data

Stars: ✭ 91 (+203.33%)

Mutual labels: hadoop

cds

Data syncing in golang for ClickHouse.

Stars: ✭ 839 (+2696.67%)

Mutual labels: bigdata

hadoop-ecosystem

Visualizations of the Hadoop Ecosystem

Stars: ✭ 20 (-33.33%)

Mutual labels: hadoop

liquibase-impala

Liquibase extension to add Impala Database support

Stars: ✭ 23 (-23.33%)

Mutual labels: hadoop

nomad

Dockerized Nomad

Stars: ✭ 33 (+10%)

Mutual labels: nomad

hadoop-etl-udfs

The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL

Stars: ✭ 17 (-43.33%)

Mutual labels: hadoop

memex-gate

General Architecture for Text Engineering

Stars: ✭ 47 (+56.67%)

Mutual labels: hadoop

web-click-flow

网站点击流离线日志分析

Stars: ✭ 14 (-53.33%)

Mutual labels: hadoop

hashidays-london

Code used for the demo of Going Multi-Cloud with Terraform and Nomad

Stars: ✭ 20 (-33.33%)

Mutual labels: nomad

awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

Stars: ✭ 11,093 (+36876.67%)

Mutual labels: bigdata

sparkucx

A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer

Stars: ✭ 32 (+6.67%)

Mutual labels: hadoop

aaocp

一个对用户行为日志进行分析的大数据项目

Stars: ✭ 53 (+76.67%)

Mutual labels: hadoop

coolplayflink

Flink: Stateful Computations over Data Streams

Stars: ✭ 14 (-53.33%)

Mutual labels: bigdata

BigDataTools

tools for bigData

Stars: ✭ 36 (+20%)

Mutual labels: bigdata

meetups-archivos

Ppts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …

Stars: ✭ 60 (+100%)

Mutual labels: bigdata

rastercube

rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)

Stars: ✭ 15 (-50%)

Mutual labels: hadoop

damon

Supervisor program to constrain Windows executables running under Nomad's raw_exec driver

Stars: ✭ 83 (+176.67%)

Mutual labels: nomad

nomad-demo

Vagrant based demo setup for running Hashicorp Nomad