big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+13.33%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+2623.33%)
ShifuAn end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+590%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+86.67%)
learning-sparkTidy up Spark and Hadoop tutorials.
Stars: ✭ 28 (-6.67%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (+146.67%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-3.33%)
Hadoop Attack LibraryA collection of pentest tools and resources targeting Hadoop environments
Stars: ✭ 228 (+660%)
SplineData Lineage Tracking And Visualization Solution
Stars: ✭ 306 (+920%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+616.67%)
Awesome Learning实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+556.67%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-83.33%)
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+19926.67%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (+30%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-56.67%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+2756.67%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (+320%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (+116.67%)
deadman-checkMonitoring companion for Nomad periodic jobs and Cron
Stars: ✭ 49 (+63.33%)
flink-learnLearning Flink : Flink CEP,Flink Core,Flink SQL
Stars: ✭ 70 (+133.33%)
schier.co🏡 My personal website and blog powered by Go, Tailwind, Postgres
Stars: ✭ 19 (-36.67%)
prestoTeradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data
Stars: ✭ 91 (+203.33%)
cdsData syncing in golang for ClickHouse.
Stars: ✭ 839 (+2696.67%)
liquibase-impalaLiquibase extension to add Impala Database support
Stars: ✭ 23 (-23.33%)
nomadDockerized Nomad
Stars: ✭ 33 (+10%)
hadoop-etl-udfsThe Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-43.33%)
memex-gateGeneral Architecture for Text Engineering
Stars: ✭ 47 (+56.67%)
hashidays-londonCode used for the demo of Going Multi-Cloud with Terraform and Nomad
Stars: ✭ 20 (-33.33%)
awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+36876.67%)
sparkucxA high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (+6.67%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (+76.67%)
coolplayflinkFlink: Stateful Computations over Data Streams
Stars: ✭ 14 (-53.33%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (+100%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-50%)
damonSupervisor program to constrain Windows executables running under Nomad's raw_exec driver
Stars: ✭ 83 (+176.67%)
nomad-demoVagrant based demo setup for running Hashicorp Nomad
Stars: ✭ 88 (+193.33%)
fsbrowserFast desktop client for Hadoop Distributed File System
Stars: ✭ 27 (-10%)
gocastGoCast is a tool for controlled BGP route announcements from a host
Stars: ✭ 55 (+83.33%)
ExposureExposure是一个帮助做曝光统计需求的库,可以很方便的对曝光事件进行埋点,在现有代码上少量侵入即可实现曝光埋点。支持RV的线性布局、网格布局、瀑布流布局、横向滑动RV,ScrollView等各种滚动布局。支持配置item的有效曝光面积。
Stars: ✭ 51 (+70%)
darwinAvro Schema Evolution made easy
Stars: ✭ 26 (-13.33%)
oci-clouderaTerraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
Stars: ✭ 20 (-33.33%)
UBAUEBA Solution for Insider Security. This repo is archived. Thanks!
Stars: ✭ 36 (+20%)
skeinA tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+326.67%)
UnROOT.jlNative Julia I/O package to work with CERN ROOT files
Stars: ✭ 52 (+73.33%)
hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (+3.33%)