All Projects → orion → Similar Projects or Alternatives

294 Open source projects that are alternatives of or similar to orion

Hadoop Connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Stars: ✭ 218 (+183.12%)
Mutual labels:  hadoop
Docker Hadoop Cluster
Multiple node cluster on Docker for self development.
Stars: ✭ 82 (+6.49%)
Mutual labels:  hadoop
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+94.81%)
Mutual labels:  hadoop
Learn machine learning
Road to Machine Learning
Stars: ✭ 81 (+5.19%)
Mutual labels:  hadoop
kafka-connect-fs
Kafka Connect FileSystem Connector
Stars: ✭ 107 (+38.96%)
Mutual labels:  hadoop
Chukwa
Mirror of Apache Chukwa
Stars: ✭ 77 (+0%)
Mutual labels:  hadoop
Parquet Rs
Apache Parquet implementation in Rust
Stars: ✭ 144 (+87.01%)
Mutual labels:  hadoop
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+1451.95%)
Mutual labels:  hadoop
Calcite
Apache Calcite
Stars: ✭ 2,816 (+3557.14%)
Mutual labels:  hadoop
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-3.9%)
Mutual labels:  hadoop
Lidea
大型分布式系统实时监控平台
Stars: ✭ 28 (-63.64%)
Mutual labels:  hbase
Spydra
Ephemeral Hadoop clusters using Google Compute Platform
Stars: ✭ 128 (+66.23%)
Mutual labels:  hadoop
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (+66.23%)
Mutual labels:  hadoop
Jsr203 Hadoop
A Java NIO file system provider for HDFS
Stars: ✭ 35 (-54.55%)
Mutual labels:  hadoop
Jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-16.88%)
Mutual labels:  hadoop
Aliyun Emapreduce Datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Stars: ✭ 132 (+71.43%)
Mutual labels:  hadoop
Likelike
An implementation of locality sensitive hashing with Hadoop
Stars: ✭ 58 (-24.68%)
Mutual labels:  hadoop
Shifu
An end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+168.83%)
Mutual labels:  hadoop
Docker Hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-29.87%)
Mutual labels:  hadoop
Base
https://www.researchgate.net/profile/Rajah_Iyer
Stars: ✭ 48 (-37.66%)
Mutual labels:  hadoop
mizo
Super-fast Spark RDD for Titan Graph Database on HBase
Stars: ✭ 24 (-68.83%)
Mutual labels:  hbase
Data Algorithms Book
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+1132.47%)
Mutual labels:  hadoop
RecommendationEngine
Source code and dataset for paper "CBMR: An optimized MapReduce for item‐based collaborative filtering recommendation algorithm with empirical analysis"
Stars: ✭ 43 (-44.16%)
Mutual labels:  hadoop
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (-55.84%)
Mutual labels:  hadoop
Awesome Learning
实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+155.84%)
Mutual labels:  hadoop
Parquet4s
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (+62.34%)
Mutual labels:  hadoop
Interview Questions Collection
按知识领域整理面试题,包括C++、Java、Hadoop、机器学习等
Stars: ✭ 21 (-72.73%)
Mutual labels:  hadoop
Recommendsys
推荐项目(实时推荐和离线推荐)
Stars: ✭ 198 (+157.14%)
Mutual labels:  hadoop
Akkeeper
An easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-61.04%)
Mutual labels:  hadoop
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (+63.64%)
Mutual labels:  hadoop
Storm Camel Example
Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-63.64%)
Mutual labels:  hadoop
hadoop-ansible
Install hadoop cluster with ansible
Stars: ✭ 35 (-54.55%)
Mutual labels:  hadoop
Dynamometer
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Stars: ✭ 122 (+58.44%)
Mutual labels:  hadoop
Cdc Kafka Hadoop
MySQL to NoSQL real time dataflow
Stars: ✭ 13 (-83.12%)
Mutual labels:  hadoop
Nutch
Apache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+2857.14%)
Mutual labels:  hadoop
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (+51.95%)
Mutual labels:  hadoop
Hadoop Pot
A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
Stars: ✭ 8 (-89.61%)
Mutual labels:  hadoop
Stormtweetssentimentd3viz
Computes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.
Stars: ✭ 25 (-67.53%)
Mutual labels:  hadoop
Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+2016.88%)
Mutual labels:  hadoop
Kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+1089.61%)
Mutual labels:  hadoop
docker-hadoop
Docker image for main Apache Hadoop components (Yarn/Hdfs)
Stars: ✭ 59 (-23.38%)
Mutual labels:  hadoop
Hive Jdbc Uber Jar
Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Stars: ✭ 188 (+144.16%)
Mutual labels:  hadoop
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+50.65%)
Mutual labels:  hadoop
Floating Elephants
Docker containers for Hadoop.
Stars: ✭ 19 (-75.32%)
Mutual labels:  hadoop
Asakusafw
Asakusa Framework
Stars: ✭ 114 (+48.05%)
Mutual labels:  hadoop
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-93.51%)
Mutual labels:  hadoop
Winutils
winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows
Stars: ✭ 657 (+753.25%)
Mutual labels:  hadoop
Tensorflowonyarn
Support TensorFlow on YARN
Stars: ✭ 114 (+48.05%)
Mutual labels:  hadoop
Useractionanalyzeplatform
电商用户行为分析大数据平台
Stars: ✭ 645 (+737.66%)
Mutual labels:  hadoop
Tony
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Stars: ✭ 626 (+712.99%)
Mutual labels:  hadoop
Parquet Go
Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.
Stars: ✭ 114 (+48.05%)
Mutual labels:  hadoop
Javapdf
🍣100本 Java电子书 技术书籍PDF(以下载阅读为荣,以点赞收藏为耻)
Stars: ✭ 609 (+690.91%)
Mutual labels:  hadoop
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+7245.45%)
Mutual labels:  hadoop
ambari-hdp-docker
Dockerfiles and Docker Compose for HDP 2.6 with Blueprints
Stars: ✭ 23 (-70.13%)
Mutual labels:  hadoop
Devops Bash Tools
550+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Kafka, Docker, APIs, Hadoop, SQL, PostgreSQL, MySQL, Hive, Impala, Travis CI, Jenkins, Concourse, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, .tmux.conf, .psqlrc ...
Stars: ✭ 226 (+193.51%)
Mutual labels:  hadoop
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+15844.16%)
Mutual labels:  hadoop
Xlearning Xdml
extremely distributed machine learning
Stars: ✭ 113 (+46.75%)
Mutual labels:  hadoop
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+696.1%)
Mutual labels:  hadoop
Alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+6885.71%)
Mutual labels:  hadoop
Avro Hadoop Starter
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (+42.86%)
Mutual labels:  hadoop
61-120 of 294 similar projects