All Projects → Ambari → Similar Projects or Alternatives

375 Open source projects that are alternatives of or similar to Ambari

Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-15.1%)
Mutual labels:  big-data
K8s Ingress Claim
An admission control policy that safeguards against accidental duplicate claiming of Hosts/Domains.
Stars: ✭ 14 (-99.11%)
Mutual labels:  big-data
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-95.49%)
Mutual labels:  big-data
Dremio Oss
Dremio - the missing link in modern data
Stars: ✭ 862 (-45.3%)
Mutual labels:  big-data
Vizuka
Explore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-93.65%)
Mutual labels:  big-data
Accumulo
Apache Accumulo
Stars: ✭ 857 (-45.62%)
Mutual labels:  big-data
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-95.62%)
Mutual labels:  big-data
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-45.81%)
Mutual labels:  big-data
Reef
Mirror of Apache REEF
Stars: ✭ 92 (-94.16%)
Mutual labels:  big-data
Pretzel
Javascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-98.35%)
Mutual labels:  big-data
Hazelcast Cpp Client
Hazelcast IMDG C++ Client
Stars: ✭ 67 (-95.75%)
Mutual labels:  big-data
Bandar Log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-98.79%)
Mutual labels:  big-data
Attic Predictionio Sdk Java
PredictionIO Java SDK
Stars: ✭ 107 (-93.21%)
Mutual labels:  big-data
Sqoop
Mirror of Apache Sqoop
Stars: ✭ 817 (-48.16%)
Mutual labels:  big-data
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-95.88%)
Mutual labels:  big-data
Titanoboa
Titanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.
Stars: ✭ 787 (-50.06%)
Mutual labels:  big-data
Bitcoin Value Predictor
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-94.23%)
Mutual labels:  big-data
Storm
Mirror of Apache Storm
Stars: ✭ 6,297 (+299.56%)
Mutual labels:  big-data
Spark Doc Zh
Apache Spark 官方文档中文版
Stars: ✭ 1,126 (-28.55%)
Mutual labels:  big-data
Cython
The most widely used Python to C compiler
Stars: ✭ 6,588 (+318.02%)
Mutual labels:  big-data
Samza Hello Samza
Mirror of Apache Samza
Stars: ✭ 99 (-93.72%)
Mutual labels:  big-data
Samza
Mirror of Apache Samza
Stars: ✭ 676 (-57.11%)
Mutual labels:  big-data
Nabhash
An extremely fast Non-crypto-safe AES Based Hash algorithm for Big Data
Stars: ✭ 62 (-96.07%)
Mutual labels:  big-data
Sdc
Intel® Scalable Dataframe Compiler for Pandas*
Stars: ✭ 623 (-60.47%)
Mutual labels:  big-data
Parquet Mr
Apache Parquet
Stars: ✭ 1,278 (-18.91%)
Mutual labels:  big-data
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+258.88%)
Mutual labels:  big-data
Attic Lens
Mirror of Apache Lens
Stars: ✭ 58 (-96.32%)
Mutual labels:  big-data
Zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+249.81%)
Mutual labels:  big-data
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-93.02%)
Mutual labels:  big-data
Scanner
Efficient video analysis at scale
Stars: ✭ 569 (-63.9%)
Mutual labels:  big-data
Docker Spark Cluster
A Spark cluster setup running on Docker containers
Stars: ✭ 57 (-96.38%)
Mutual labels:  big-data
Nipype
Workflows and interfaces for neuroimaging packages
Stars: ✭ 557 (-64.66%)
Mutual labels:  big-data
Panoptes
A Global Scale Network Telemetry Ecosystem
Stars: ✭ 80 (-94.92%)
Mutual labels:  big-data
Thrill
Thrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++
Stars: ✭ 528 (-66.5%)
Mutual labels:  big-data
Lifion Kinesis
A native Node.js producer and consumer library for Amazon Kinesis Data Streams
Stars: ✭ 54 (-96.57%)
Mutual labels:  big-data
Beam
Apache Beam is a unified programming model for Batch and Streaming
Stars: ✭ 5,149 (+226.71%)
Mutual labels:  big-data
Kudu
Mirror of Apache Kudu
Stars: ✭ 1,360 (-13.71%)
Mutual labels:  big-data
Magellan
Geo Spatial Data Analytics on Spark
Stars: ✭ 507 (-67.83%)
Mutual labels:  big-data
Oodt
Mirror of Apache OODT
Stars: ✭ 52 (-96.7%)
Mutual labels:  big-data
Stream Framework
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
Stars: ✭ 4,576 (+190.36%)
Mutual labels:  big-data
Iotdb
Apache IoTDB
Stars: ✭ 1,221 (-22.53%)
Mutual labels:  big-data
Redislite
Redis in a python module.
Stars: ✭ 464 (-70.56%)
Mutual labels:  big-data
Trck
Query engine for TrailDB
Stars: ✭ 48 (-96.95%)
Mutual labels:  big-data
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-71.19%)
Mutual labels:  big-data
Mysql perf analyzer
MySQL performance monitoring and analysis.
Stars: ✭ 1,423 (-9.71%)
Mutual labels:  big-data
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+1298.98%)
Mutual labels:  big-data
Moosefs
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (-34.96%)
Mutual labels:  big-data
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-72.97%)
Mutual labels:  big-data
Attic Predictionio Template Recommender
PredictionIO Recommendation Engine Template (Scala-based parallelized engine)
Stars: ✭ 78 (-95.05%)
Mutual labels:  big-data
Datascience Ai Machinelearning Resources
Alex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-73.73%)
Mutual labels:  big-data
Attaca
Robust, distributed version control for large files.
Stars: ✭ 41 (-97.4%)
Mutual labels:  big-data
Cogcomp Nlp
CogComp's Natural Language Processing libraries and Demos:
Stars: ✭ 410 (-73.98%)
Mutual labels:  big-data
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-93.85%)
Mutual labels:  big-data
Analysispreservation.cern.ch
Source code for the CERN Analysis Preservation portal
Stars: ✭ 37 (-97.65%)
Mutual labels:  big-data
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (-2.03%)
Mutual labels:  big-data
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-93.08%)
Mutual labels:  big-data
Maha
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-93.59%)
Mutual labels:  big-data
Streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-93.91%)
Mutual labels:  big-data
Labs
Research on distributed system
Stars: ✭ 73 (-95.37%)
Mutual labels:  big-data
Predictionio Template Text Classifier
Text Classification Engine
Stars: ✭ 30 (-98.1%)
Mutual labels:  big-data
61-120 of 375 similar projects