JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (+20.75%)
XlearningAI on Hadoop
Stars: ✭ 1,709 (+3124.53%)
AkkeeperAn easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-43.4%)
docker-hadoopDocker image for main Apache Hadoop components (Yarn/Hdfs)
Stars: ✭ 59 (+11.32%)
Tf YarnTrain TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (+43.4%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+1516.98%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-62.26%)
beanszooDistributed Java micro-services using ZooKeeper
Stars: ✭ 12 (-77.36%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-64.15%)
jumbo🐘 A local Hadoop cluster bootstrapper using Vagrant, Ansible, and Ambari.
Stars: ✭ 17 (-67.92%)
framequerySQL on dataframes - pandas and dask
Stars: ✭ 63 (+18.87%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-35.85%)
TILToday I Learned
Stars: ✭ 43 (-18.87%)
monoreact📦 React workspaces implementation
Stars: ✭ 13 (-75.47%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+1060.38%)
npm-yarn-benchmarkBash script for comparing NPM and Yarn performance
Stars: ✭ 42 (-20.75%)
mock-spy-module-importJavaScript import/require module testing do's and don'ts with Jest
Stars: ✭ 40 (-24.53%)
TitanDataOperationSystem最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;
Stars: ✭ 62 (+16.98%)
gulp-yarnAutomatically install node modules using Yarn. 😻
Stars: ✭ 22 (-58.49%)
node-safe🤠 Make using Node.js safe again with Deno-like permissions
Stars: ✭ 151 (+184.91%)
monopackA JavaScript bundler for node.js monorepo-codebased applications.
Stars: ✭ 52 (-1.89%)
gaiaGaia is a geospatial analysis library jointly developed by Kitware and Epidemico.
Stars: ✭ 29 (-45.28%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+109.43%)
typester✒️ A WYSIWYG that gives you predictable and clean HTML
Stars: ✭ 29 (-45.28%)
big-data-liteSamples to the Oracle Big Data Lite VM
Stars: ✭ 41 (-22.64%)
ibisIBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
Stars: ✭ 48 (-9.43%)
setup-linux-debianInstalls essential JavaScript development programs.
Stars: ✭ 16 (-69.81%)
sinonimo🇧🇷 Sinonimo é um pacote Node que traz sinônimos de palavras em português
Stars: ✭ 14 (-73.58%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (-26.42%)
pipelinit-cliAutomatically generates pipelines for your project.
Stars: ✭ 36 (-32.08%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-69.81%)
py-hdfs-mountMount HDFS with fuse, works with kerberos!
Stars: ✭ 13 (-75.47%)
cloud云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (-9.43%)
XLearning-GPUqihoo360 xlearning with GPU support; AI on Hadoop
Stars: ✭ 22 (-58.49%)
Bijou.jsBijou.js: Useful JavaScript snippets in one simple library
Stars: ✭ 30 (-43.4%)
YarnGdxYarnGdx is a Libgdx Library for interactive dialogue in games! This is a port of [YarnSpinner](https://github.com/thesecretlab/YarnSpinner) by thesecretlab
Stars: ✭ 25 (-52.83%)
cleanmymacA developer friendly command line cleaner program for modern macOS systems
Stars: ✭ 35 (-33.96%)
DataXServer为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Stars: ✭ 130 (+145.28%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-73.58%)
floxFast & furious GroupBy operations for dask.array
Stars: ✭ 42 (-20.75%)
cmuxA set of commands for managing CDH clusters using Cloudera Manager REST API.
Stars: ✭ 34 (-35.85%)