MoosefsMooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+831.82%)
Storm Camel ExampleReal-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-74.55%)
ChukwaMirror of Apache Chukwa
Stars: ✭ 77 (-30%)
Docker HadoopA Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-50.91%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+732.73%)
CamusMirror of Linkedin's Camus
Stars: ✭ 81 (-26.36%)
Jsr203 HadoopA Java NIO file system provider for HDFS
Stars: ✭ 35 (-68.18%)
Wifi基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (-15.45%)
Dockerfiles50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+670%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-32.73%)
LikelikeAn implementation of locality sensitive hashing with Hadoop
Stars: ✭ 58 (-47.27%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+642.73%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-25.45%)
Basehttps://www.researchgate.net/profile/Rajah_Iyer
Stars: ✭ 48 (-56.36%)
K8s Practicefollow the geekbang's k8s mooc
Stars: ✭ 94 (-14.55%)
Sagefy🔭 Learn anything, adapted for you. Free.
Stars: ✭ 80 (-27.27%)
AkkeeperAn easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-72.73%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+986.36%)
Stormtweetssentimentd3vizComputes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.
Stars: ✭ 25 (-77.27%)
Deeplearning2020course materials for introduction to deep learning 2020
Stars: ✭ 90 (-18.18%)
AtsdAxibase Time Series Database Documentation
Stars: ✭ 68 (-38.18%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-45.45%)
Awesome NeuroscienceA curated list of awesome neuroscience libraries, software and any content related to the domain.
Stars: ✭ 734 (+567.27%)
Coltsteele webdevcourseCollection of coursework I've done from Colt Steele's Udemy Web Development course
Stars: ✭ 86 (-21.82%)
Yi NoteYiNote browser extension - online video note taking tool
Stars: ✭ 96 (-12.73%)
Hadoop SolrCode to index HDFS to Solr using MapReduce
Stars: ✭ 51 (-53.64%)
Play With Machine Learning AlgorithmsCode of my MOOC Course <Play with Machine Learning Algorithms>. Updated contents and practices are also included. 我在慕课网上的课程《Python3 入门机器学习》示例代码。课程的更多更新内容及辅助练习也将逐步添加进这个代码仓。
Stars: ✭ 1,037 (+842.73%)
Py4eWeb site for www.py4e.com and source to the Python 3.0 textbook
Stars: ✭ 1,387 (+1160.91%)
Nagios Plugins450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+809.09%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-16.36%)
Stanford dbclassCollection of my solutions to the (infamous) dbclass (2014 version) offered by Stanford.
Stars: ✭ 35 (-68.18%)
Docker Spark🚢 Docker image for Apache Spark
Stars: ✭ 78 (-29.09%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+762.73%)
Tinymooc🌸 Lightweight Java Platform Online Mooc Learning Website
Stars: ✭ 110 (+0%)
Tf YarnTrain TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (-30.91%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+679.09%)
Hadoop PotA scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
Stars: ✭ 8 (-92.73%)
Docker HadoopApache Hadoop docker image
Stars: ✭ 1,190 (+981.82%)
Play With Algorithm InterviewCodes of my MOOC Course <Play with Algorithm Interviews>. Updated contents and practices are also included. 我在慕课网上的课程《玩儿转算法面试》示例代码。课程的更多更新内容及辅助练习也将逐步添加进这个代码仓。
Stars: ✭ 915 (+731.82%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-95.45%)
SrcA light-weight distributed stream computing framework for Golang
Stars: ✭ 67 (-39.09%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1587.27%)
Haproxy Configs80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (-3.64%)
AntsdbAntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase
Stars: ✭ 99 (-10%)
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-41.82%)