All Projects → Hadoop Common → Similar Projects or Alternatives

231 Open source projects that are alternatives of or similar to Hadoop Common

Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.

Stars: ✭ 125 (-19.35%)

Mutual labels: hadoop

50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu

Stars: ✭ 847 (+446.45%)

Mutual labels: hadoop

Wifi

基于wifi抓取信息的大数据查询分析系统

Stars: ✭ 93 (-40%)

Mutual labels: hadoop

Stormtweetssentimentd3viz

Computes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.

Stars: ✭ 25 (-83.87%)

Mutual labels: hadoop

Hbaseclient

HBase客户端数据管理软件

Stars: ✭ 135 (-12.9%)

Mutual labels: hadoop

Floating Elephants

Docker containers for Hadoop.

Stars: ✭ 19 (-87.74%)

Mutual labels: hadoop

Hadoop Mapreduce

Mirror of Apache Hadoop MapReduce

Stars: ✭ 88 (-43.23%)

Mutual labels: hadoop

Hadoop For Geoevent

ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.

Stars: ✭ 5 (-96.77%)

Mutual labels: hadoop

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-24.52%)

Mutual labels: hadoop

Winutils

winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows

Stars: ✭ 657 (+323.87%)

Mutual labels: hadoop

Docker Hadoop Cluster

Multiple node cluster on Docker for self development.

Stars: ✭ 82 (-47.1%)

Mutual labels: hadoop

Tony

TonY is a framework to natively run deep learning frameworks on Apache Hadoop.

Stars: ✭ 626 (+303.87%)

Mutual labels: hadoop

Hadoop

Apache Hadoop

Stars: ✭ 12,177 (+7756.13%)

Mutual labels: hadoop

H2o 3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Stars: ✭ 5,656 (+3549.03%)

Mutual labels: hadoop

Learn machine learning

Road to Machine Learning

Stars: ✭ 81 (-47.74%)

Mutual labels: hadoop

Alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

Stars: ✭ 5,379 (+3370.32%)

Mutual labels: hadoop

Ibis

A pandas-like deferred expression system, with first-class SQL support

Stars: ✭ 1,630 (+951.61%)

Mutual labels: hadoop

Bigdata

💎🔥大数据学习笔记

Stars: ✭ 488 (+214.84%)

Mutual labels: hadoop

Chukwa

Mirror of Apache Chukwa

Stars: ✭ 77 (-50.32%)

Mutual labels: hadoop

School Of Sre

At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.

Stars: ✭ 5,141 (+3216.77%)

Mutual labels: hadoop

Calcite Avatica

Mirror of Apache Calcite - Avatica

Stars: ✭ 130 (-16.13%)

Mutual labels: hadoop

Data Science Ipython Notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Stars: ✭ 22,048 (+14124.52%)

Mutual labels: hadoop

Dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Stars: ✭ 1,195 (+670.97%)

Mutual labels: hadoop

Marmaray

Generic Data Ingestion & Dispersal Library for Hadoop

Stars: ✭ 414 (+167.1%)

Mutual labels: hadoop

Asakusafw

Asakusa Framework

Stars: ✭ 114 (-26.45%)

Mutual labels: hadoop

Kafka Connect Hdfs

Kafka Connect HDFS connector

Stars: ✭ 400 (+158.06%)

Mutual labels: hadoop

Apache Spark Hands On

Educational notes,Hands on problems w/ solutions for hadoop ecosystem

Stars: ✭ 74 (-52.26%)

Mutual labels: hadoop

Iceberg

Iceberg is a table format for large, slow-moving tabular data

Stars: ✭ 393 (+153.55%)

Mutual labels: hadoop

Hadoop Hdfs

Mirror of Apache Hadoop HDFS

Stars: ✭ 152 (-1.94%)

Mutual labels: hadoop

Ignite

Apache Ignite

Stars: ✭ 4,027 (+2498.06%)

Mutual labels: hadoop

Atsd

Axibase Time Series Database Documentation

Stars: ✭ 68 (-56.13%)

Mutual labels: hadoop

Hive

Apache Hive

Stars: ✭ 4,031 (+2500.65%)

Mutual labels: hadoop

Parquet Go

Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.

Stars: ✭ 114 (-26.45%)

Mutual labels: hadoop

Ytk Learn

Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).

Stars: ✭ 337 (+117.42%)

Mutual labels: hadoop

Jumbune

Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,

Stars: ✭ 64 (-58.71%)

Mutual labels: hadoop

Gather Deployment

Gathers scalable tensorflow and infrastructure deployment

Stars: ✭ 326 (+110.32%)

Mutual labels: hadoop

Airflow Pipeline

An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR

Stars: ✭ 128 (-17.42%)

Mutual labels: hadoop

Tez

Apache Tez

Stars: ✭ 313 (+101.94%)

Mutual labels: hadoop

Likelike

An implementation of locality sensitive hashing with Hadoop

Stars: ✭ 58 (-62.58%)

Mutual labels: hadoop

Spline

Data Lineage Tracking And Visualization Solution

Stars: ✭ 306 (+97.42%)

Mutual labels: hadoop

Avro Hadoop Starter

Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.

Stars: ✭ 110 (-29.03%)

Mutual labels: hadoop

Elasticluster

Create clusters of VMs on the cloud and configure them with Ansible.

Stars: ✭ 298 (+92.26%)

Mutual labels: hadoop

Docker Hadoop

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

Stars: ✭ 54 (-65.16%)

Mutual labels: hadoop

Android Nosql

Lightweight, simple structured NoSQL database for Android

Stars: ✭ 284 (+83.23%)

Mutual labels: hadoop

Eel Sdk

Big Data Toolkit for the JVM

Stars: ✭ 140 (-9.68%)

Mutual labels: hadoop

Hadoop Mini Clusters

hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE

Stars: ✭ 265 (+70.97%)

Mutual labels: hadoop

Base

https://www.researchgate.net/profile/Rajah_Iyer

Stars: ✭ 48 (-69.03%)

Mutual labels: hadoop

pulse

phData Pulse application log aggregation and monitoring

Stars: ✭ 13 (-91.61%)

Mutual labels: hadoop

Waterdrop

Production Ready Data Integration Product, documentation：

Stars: ✭ 1,856 (+1097.42%)

Mutual labels: hadoop

knit

Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead

Stars: ✭ 53 (-65.81%)

Mutual labels: hadoop

Nagios Plugins

450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...

Stars: ✭ 1,000 (+545.16%)

Mutual labels: hadoop

bigdata-fun

A complete (distributed) BigData stack, running in containers

Stars: ✭ 14 (-90.97%)

Mutual labels: hadoop

Griffon Vm

Griffon Data Science Virtual Machine

Stars: ✭ 128 (-17.42%)

Mutual labels: hadoop

Learning Spark

零基础学习spark，大数据学习