Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+1352.54%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-67.8%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-66.1%)
Wifi基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (+57.63%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (+55.93%)
beanszooDistributed Java micro-services using ZooKeeper
Stars: ✭ 12 (-79.66%)
datasqueezeHadoop utility to compact small files
Stars: ✭ 18 (-69.49%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (-10.17%)
py-hdfs-mountMount HDFS with fuse, works with kerberos!
Stars: ✭ 13 (-77.97%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-77.97%)
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+10083.05%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+588.14%)
fsbrowserFast desktop client for Hadoop Distributed File System
Stars: ✭ 27 (-54.24%)
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-64.41%)
ros hadoopHadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
Stars: ✭ 92 (+55.93%)
Jsr203 HadoopA Java NIO file system provider for HDFS
Stars: ✭ 35 (-40.68%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-91.53%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+2662.71%)
CamusMirror of Linkedin's Camus
Stars: ✭ 81 (+37.29%)
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (+98.31%)
Bigdata💎🔥大数据学习笔记
Stars: ✭ 488 (+727.12%)
knitDeprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Stars: ✭ 53 (-10.17%)
XlearningAI on Hadoop
Stars: ✭ 1,709 (+2796.61%)
DynamometerA tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Stars: ✭ 122 (+106.78%)
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (+8.47%)
terasliceScalable data processing pipelines in JavaScript
Stars: ✭ 48 (-18.64%)
skeinA tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+116.95%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-76.27%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+154.24%)
AkkeeperAn easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-49.15%)
Tf YarnTrain TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (+28.81%)
Js Stack BoilerplateFinal boilerplate code of the JavaScript Stack from Scratch tutorial –
Stars: ✭ 145 (+145.76%)
Klapzero config, zero dependency bundler for tiny javascript packages
Stars: ✭ 143 (+142.37%)
YernaA Lerna-like tool for managing Javascript monorepos using Yarn
Stars: ✭ 140 (+137.29%)
Gulp Webpack StarterGulp Webpack Starter - fast static website builder. The starter uses gulp toolkit and webpack bundler. Download to get an awesome development experience!
Stars: ✭ 199 (+237.29%)
Ni💡 Use the right package manager
Stars: ✭ 179 (+203.39%)
FoxyA fast, reliable, and secure NPM/Yarn bridge for Composer
Stars: ✭ 137 (+132.2%)
YarnhookRun `yarn install`, `npm install` or `pnpm install` on git hooks automatically
Stars: ✭ 177 (+200%)
Vscode Deploy ReloadedRecoded version of Visual Studio Code extension 'vs-deploy', which provides commands to deploy files to one or more destinations.
Stars: ✭ 129 (+118.64%)
CorepackZero-runtime-dependency package acting as bridge between Node projects and their package managers
Stars: ✭ 196 (+232.2%)
Bolt⚡️ Super-powered JavaScript project management
Stars: ✭ 2,134 (+3516.95%)
BarebonesA barebones boilerplate for getting started on a bespoke front end.
Stars: ✭ 127 (+115.25%)
TeddySpark Streaming监控平台,支持任务部署与告警、自启动
Stars: ✭ 120 (+103.39%)