All Projects → Hadoop → Similar Projects or Alternatives

231 Open source projects that are alternatives of or similar to Hadoop

Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (-86.61%)
Mutual labels:  hadoop
Floating Elephants
Docker containers for Hadoop.
Stars: ✭ 19 (-99.84%)
Mutual labels:  hadoop
Docker Hadoop Cluster
Multiple node cluster on Docker for self development.
Stars: ✭ 82 (-99.33%)
Mutual labels:  hadoop
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-99.96%)
Mutual labels:  hadoop
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-98.95%)
Mutual labels:  hadoop
Winutils
winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows
Stars: ✭ 657 (-94.6%)
Mutual labels:  hadoop
Learn machine learning
Road to Machine Learning
Stars: ✭ 81 (-99.33%)
Mutual labels:  hadoop
Tony
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Stars: ✭ 626 (-94.86%)
Mutual labels:  hadoop
Asakusafw
Asakusa Framework
Stars: ✭ 114 (-99.06%)
Mutual labels:  hadoop
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (-53.55%)
Mutual labels:  hadoop
Chukwa
Mirror of Apache Chukwa
Stars: ✭ 77 (-99.37%)
Mutual labels:  hadoop
Alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (-55.83%)
Mutual labels:  hadoop
Hbaseclient
HBase客户端数据管理软件
Stars: ✭ 135 (-98.89%)
Mutual labels:  hadoop
Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488 (-95.99%)
Mutual labels:  hadoop
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (-90.19%)
Mutual labels:  hadoop
School Of Sre
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
Stars: ✭ 5,141 (-57.78%)
Mutual labels:  hadoop
Parquet Go
Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.
Stars: ✭ 114 (-99.06%)
Mutual labels:  hadoop
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+81.06%)
Mutual labels:  hadoop
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-99.39%)
Mutual labels:  hadoop
Marmaray
Generic Data Ingestion & Dispersal Library for Hadoop
Stars: ✭ 414 (-96.6%)
Mutual labels:  hadoop
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-98.95%)
Mutual labels:  hadoop
Kafka Connect Hdfs
Kafka Connect HDFS connector
Stars: ✭ 400 (-96.72%)
Mutual labels:  hadoop
Atsd
Axibase Time Series Database Documentation
Stars: ✭ 68 (-99.44%)
Mutual labels:  hadoop
Iceberg
Iceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (-96.77%)
Mutual labels:  hadoop
Avro Hadoop Starter
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (-99.1%)
Mutual labels:  hadoop
Ignite
Apache Ignite
Stars: ✭ 4,027 (-66.93%)
Mutual labels:  hadoop
Jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-99.47%)
Mutual labels:  hadoop
Hive
Apache Hive
Stars: ✭ 4,031 (-66.9%)
Mutual labels:  hadoop
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (-98.85%)
Mutual labels:  hadoop
Ytk Learn
Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (-97.23%)
Mutual labels:  hadoop
Likelike
An implementation of locality sensitive hashing with Hadoop
Stars: ✭ 58 (-99.52%)
Mutual labels:  hadoop
Gather Deployment
Gathers scalable tensorflow and infrastructure deployment
Stars: ✭ 326 (-97.32%)
Mutual labels:  hadoop
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (-84.76%)
Mutual labels:  hadoop
Tez
Apache Tez
Stars: ✭ 313 (-97.43%)
Mutual labels:  hadoop
Docker Hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-99.56%)
Mutual labels:  hadoop
Spline
Data Lineage Tracking And Visualization Solution
Stars: ✭ 306 (-97.49%)
Mutual labels:  hadoop
Parquet4s
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (-98.97%)
Mutual labels:  hadoop
Elasticluster
Create clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-97.55%)
Mutual labels:  hadoop
Base
https://www.researchgate.net/profile/Rajah_Iyer
Stars: ✭ 48 (-99.61%)
Mutual labels:  hadoop
Android Nosql
Lightweight, simple structured NoSQL database for Android
Stars: ✭ 284 (-97.67%)
Mutual labels:  hadoop
Bigdata Notebook
Stars: ✭ 100 (-99.18%)
Mutual labels:  hadoop
Hadoop Mini Clusters
hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE
Stars: ✭ 265 (-97.82%)
Mutual labels:  hadoop
Nagios Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (-91.79%)
Mutual labels:  hadoop
pulse
phData Pulse application log aggregation and monitoring
Stars: ✭ 13 (-99.89%)
Mutual labels:  hadoop
Calcite Avatica
Mirror of Apache Calcite - Avatica
Stars: ✭ 130 (-98.93%)
Mutual labels:  hadoop
knit
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Stars: ✭ 53 (-99.56%)
Mutual labels:  hadoop
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-99.7%)
Mutual labels:  hadoop
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-99.89%)
Mutual labels:  hadoop
Antsdb
AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase
Stars: ✭ 99 (-99.19%)
Mutual labels:  hadoop
XLearning-GPU
qihoo360 xlearning with GPU support; AI on Hadoop
Stars: ✭ 22 (-99.82%)
Mutual labels:  hadoop
Akkeeper
An easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-99.75%)
Mutual labels:  hadoop
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-99.89%)
Mutual labels:  hadoop
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-99.04%)
Mutual labels:  hadoop
Storm Camel Example
Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-99.77%)
Mutual labels:  hadoop
Parquet Rs
Apache Parquet implementation in Rust
Stars: ✭ 144 (-98.82%)
Mutual labels:  hadoop
Xlearning
AI on Hadoop
Stars: ✭ 1,709 (-85.97%)
Mutual labels:  hadoop
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (-86.52%)
Mutual labels:  hadoop
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (-86.7%)
Mutual labels:  hadoop
Hadoop Yarn Api Python Client
Python client for Hadoop® YARN API
Stars: ✭ 91 (-99.25%)
Mutual labels:  hadoop
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (-92.96%)
Mutual labels:  hadoop
61-120 of 231 similar projects