All Projects → wasp → Similar Projects or Alternatives

1114 Open source projects that are alternatives of or similar to wasp

Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+2036.84%)
Mutual labels:  hadoop, solr, hbase, hdfs, parquet
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+57747.37%)
Mutual labels:  yarn, hadoop, hbase, hdfs
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+831.58%)
Mutual labels:  hadoop, hbase, spark-streaming, parquet
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+4410.53%)
Mutual labels:  yarn, hadoop, hbase, hdfs
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (+94.74%)
Mutual labels:  hadoop, hbase, spark-streaming, hdfs
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-26.32%)
Mutual labels:  hadoop, solr, hbase, hdfs
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+31521.05%)
Mutual labels:  hadoop, hbase, hdfs
Wifi
基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (+389.47%)
Mutual labels:  hadoop, hbase, hdfs
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (+384.21%)
Mutual labels:  hadoop, hbase, hdfs
fastdata-cluster
Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (+5.26%)
Mutual labels:  yarn, hadoop, hdfs
Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488 (+2468.42%)
Mutual labels:  hadoop, hbase, hdfs
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+8542.11%)
Mutual labels:  hadoop, hbase, parquet
xxhadoop
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (+94.74%)
Mutual labels:  hadoop, hbase, spark-streaming
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+8421.05%)
Mutual labels:  hadoop, jdbc, parquet
docker-hadoop
Docker image for main Apache Hadoop components (Yarn/Hdfs)
Stars: ✭ 59 (+210.53%)
Mutual labels:  yarn, hadoop, hdfs
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (+747.37%)
Mutual labels:  hadoop, hbase, hdfs
aaocp
一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (+178.95%)
Mutual labels:  hadoop, hbase, hdfs
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+1036.84%)
Mutual labels:  jdbc, hbase, spark-streaming
Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+4357.89%)
Mutual labels:  hadoop, solr, hbase
Akkeeper
An easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (+57.89%)
Mutual labels:  akka, yarn, hadoop
Parquet4s
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (+557.89%)
Mutual labels:  akka, hadoop, parquet
litemall-dw
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (+89.47%)
Mutual labels:  solr, hbase, spark-streaming
Nagios Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+5163.16%)
Mutual labels:  hadoop, solr, hbase
BigInsights-on-Apache-Hadoop
Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix
Stars: ✭ 21 (+10.53%)
Mutual labels:  hadoop, hbase, spark-streaming
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (+331.58%)
Mutual labels:  hadoop, hbase
Antsdb
AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase
Stars: ✭ 99 (+421.05%)
Mutual labels:  hadoop, hbase
hadoop-etl-udfs
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-10.53%)
Mutual labels:  hadoop, parquet
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (+515.79%)
Mutual labels:  hadoop, hdfs
Haproxy Configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (+457.89%)
Mutual labels:  hadoop, hbase
Camus
Mirror of Linkedin's Camus
Stars: ✭ 81 (+326.32%)
Mutual labels:  hadoop, hdfs
Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+8478.95%)
Mutual labels:  hadoop, hdfs
Parquet Go
Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.
Stars: ✭ 114 (+500%)
Mutual labels:  hadoop, parquet
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+9668.42%)
Mutual labels:  hadoop, spark-streaming
Dynamometer
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Stars: ✭ 122 (+542.11%)
Mutual labels:  hadoop, hdfs
Hbaseclient
HBase客户端数据管理软件
Stars: ✭ 135 (+610.53%)
Mutual labels:  hadoop, hbase
Atsd
Axibase Time Series Database Documentation
Stars: ✭ 68 (+257.89%)
Mutual labels:  hadoop, hbase
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+689.47%)
Mutual labels:  hadoop, hdfs
Movie recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+10910.53%)
Mutual labels:  hadoop, spark-streaming
Parquet Rs
Apache Parquet implementation in Rust
Stars: ✭ 144 (+657.89%)
Mutual labels:  hadoop, parquet
Hive Jdbc Uber Jar
Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Stars: ✭ 188 (+889.47%)
Mutual labels:  hadoop, jdbc
Kyuubi
Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (+1810.53%)
Mutual labels:  yarn, jdbc
skein
A tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+573.68%)
Mutual labels:  hadoop, hdfs
Hadoop Solr
Code to index HDFS to Solr using MapReduce
Stars: ✭ 51 (+168.42%)
Mutual labels:  hadoop, solr
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (+636.84%)
Mutual labels:  hadoop, parquet
knit
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Stars: ✭ 53 (+178.95%)
Mutual labels:  yarn, hadoop
Jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (+236.84%)
Mutual labels:  yarn, hadoop
Tf Yarn
Train TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (+300%)
Mutual labels:  yarn, hadoop
Xlearning
AI on Hadoop
Stars: ✭ 1,709 (+8894.74%)
Mutual labels:  yarn, hadoop
cassandra.realtime
Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (+31.58%)
Mutual labels:  akka, spark-streaming
Tensorflowonyarn
Support TensorFlow on YARN
Stars: ✭ 114 (+500%)
Mutual labels:  yarn, hadoop
Utils4s
scala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+5531.58%)
Mutual labels:  akka, spark-streaming
kafka-connect-fs
Kafka Connect FileSystem Connector
Stars: ✭ 107 (+463.16%)
Mutual labels:  hadoop, hdfs
phoenix
Apache Phoenix / Hbase Spring Boot Microservices
Stars: ✭ 23 (+21.05%)
Mutual labels:  hadoop, hbase
orion
Management and automation platform for Stateful Distributed Systems
Stars: ✭ 77 (+305.26%)
Mutual labels:  hadoop, hbase
teraslice
Scalable data processing pipelines in JavaScript
Stars: ✭ 48 (+152.63%)
Mutual labels:  hadoop, hdfs
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (+78.95%)
Mutual labels:  hadoop, spark-streaming
beanszoo
Distributed Java micro-services using ZooKeeper
Stars: ✭ 12 (-36.84%)
Mutual labels:  yarn, hadoop
hbase-meta-repair
Repair hbase metadata table from hdfs.
Stars: ✭ 36 (+89.47%)
Mutual labels:  hbase, hdfs
dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+547.37%)
Mutual labels:  hadoop, hbase
yarn-prometheus-exporter
Export Hadoop YARN (resource-manager) metrics in prometheus format
Stars: ✭ 44 (+131.58%)
Mutual labels:  yarn, hadoop
1-60 of 1114 similar projects