All Projects → Bigdata Playground → Similar Projects or Alternatives

3071 Open source projects that are alternatives of or similar to Bigdata Playground

⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more

Stars: ✭ 58 (-67.23%)

Mutual labels: avro, parquet

Hazelcast Jet

Distributed Stream and Batch Processing

Stars: ✭ 855 (+383.05%)

Mutual labels: kafka, big-data

Spring Examples

SpringBoot Examples

Stars: ✭ 67 (-62.15%)

Mutual labels: graphql, mongodb

Atsd

Axibase Time Series Database Documentation

Stars: ✭ 68 (-61.58%)

Mutual labels: hadoop, hbase

Spark States

Custom state store providers for Apache Spark

Stars: ✭ 83 (-53.11%)

Mutual labels: apache-spark, spark-streaming

Camus

Mirror of Linkedin's Camus

Stars: ✭ 81 (-54.24%)

Mutual labels: kafka, hadoop

Boilerplate Vue Apollo Graphql Mongodb

Start your magical stack journey!

Stars: ✭ 85 (-51.98%)

Mutual labels: graphql, mongodb

Kaufmann ex

Kafka backed service library.

Stars: ✭ 86 (-51.41%)

Mutual labels: kafka, avro

Wifi

基于wifi抓取信息的大数据查询分析系统

Stars: ✭ 93 (-47.46%)

Mutual labels: hadoop, hbase

Moosefs

MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)

Stars: ✭ 1,025 (+479.1%)

Mutual labels: big-data, hadoop

Community

a community based on Node.js

Stars: ✭ 44 (-75.14%)

Mutual labels: graphql, mongodb

Awesome Recommendation Engine

The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.

Stars: ✭ 47 (-73.45%)

Mutual labels: kafka, mongodb

Streamx

kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)

Stars: ✭ 96 (-45.76%)

Mutual labels: kafka, big-data

Logisland

Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.

Stars: ✭ 97 (-45.2%)

Mutual labels: kafka, big-data

Gcs Tools

GCS support for avro-tools, parquet-tools and protobuf

Stars: ✭ 57 (-67.8%)

Mutual labels: avro, parquet

Docker Spark Cluster

A Spark cluster setup running on Docker containers

Stars: ✭ 57 (-67.8%)

Mutual labels: big-data, hadoop

Event Sourcing Castanha

An Event Sourcing service template with DDD, TDD and SOLID. It has High Cohesion and Loose Coupling, it's a good start for your next Microservice application.

Stars: ✭ 68 (-61.58%)

Mutual labels: kafka, mongodb

Zaneperfor

前端性能监控系统,消息队列,高可用,集群等相关架构

Stars: ✭ 1,085 (+512.99%)

Mutual labels: kafka, mongodb

Hadoop cookbook

Cookbook to install Hadoop 2.0+ using Chef

Stars: ✭ 82 (-53.67%)

Mutual labels: hadoop, hbase

Bigdata File Viewer

A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.

Stars: ✭ 86 (-51.41%)

Mutual labels: avro, parquet

Wertik Js

💪 A library that powers your app with GraphQL + Rest API

Stars: ✭ 56 (-68.36%)

Mutual labels: graphql, mongodb

Unchained

Headless & open-source e-commerce toolkit. The Unchained Engine is our core product and is written in Node.js ES6

Stars: ✭ 92 (-48.02%)

Mutual labels: graphql, mongodb

Bigdata Notebook

Stars: ✭ 100 (-43.5%)

Mutual labels: kafka, hadoop

Schemer

Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.

Stars: ✭ 97 (-45.2%)

Mutual labels: avro, parquet

Parquet Mr

Apache Parquet

Stars: ✭ 1,278 (+622.03%)

Mutual labels: big-data, parquet

Pmacct

pmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].

Stars: ✭ 677 (+282.49%)

Mutual labels: kafka, avro

Springboot

SpringBoot 整合各类框架和应用

Stars: ✭ 54 (-69.49%)

Mutual labels: kafka, mongodb

Antsdb

AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase

Stars: ✭ 99 (-44.07%)

Mutual labels: hadoop, hbase

Production Ready Expressjs Server

Express.js server that implements production-ready error handling and logging following latest best practices.

Stars: ✭ 101 (-42.94%)

Mutual labels: graphql, mongodb

Rsyslog

a Rocket-fast SYStem for LOG processing

Stars: ✭ 1,385 (+682.49%)

Mutual labels: kafka, mongodb

Bigdata docker

Big Data Ecosystem Docker

Stars: ✭ 161 (-9.04%)

Mutual labels: hadoop, hbase

Schema Registry

Confluent Schema Registry for Kafka

Stars: ✭ 1,647 (+830.51%)

Mutual labels: kafka, avro

Avro Hadoop Starter

Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.

Stars: ✭ 110 (-37.85%)

Mutual labels: hadoop, avro

Parquet Go

Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.

Stars: ✭ 114 (-35.59%)

Mutual labels: hadoop, parquet

Waterdrop

Production Ready Data Integration Product, documentation：

Stars: ✭ 1,856 (+948.59%)

Mutual labels: hadoop, spark-streaming

Kkbinlog

支持mysql、MongoDB数据变更订阅分发

Stars: ✭ 112 (-36.72%)

Mutual labels: kafka, mongodb

Asakusafw

Asakusa Framework

Stars: ✭ 114 (-35.59%)

Mutual labels: big-data, hadoop

Graphql Nodejs Hapi Api

How to set-up a powerful API with Nodejs, GraphQL, MongoDB, Hapi, and Swagger

Stars: ✭ 116 (-34.46%)

Mutual labels: graphql, mongodb

Cmak

CMAK is a tool for managing Apache Kafka clusters

Stars: ✭ 10,544 (+5857.06%)

Mutual labels: kafka, big-data

Example Spark Kafka

Apache Spark and Apache Kafka integration example

Stars: ✭ 120 (-32.2%)

Mutual labels: kafka, spark-streaming

Haproxy Configs

80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.

Stars: ✭ 106 (-40.11%)

Mutual labels: hadoop, hbase

Amazon S3 Find And Forget

Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

Stars: ✭ 115 (-35.03%)

Mutual labels: big-data, parquet

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-33.9%)

Mutual labels: big-data, hadoop

Scala Spark Tutorial

Project for James' Apache Spark with Scala course

Stars: ✭ 121 (-31.64%)

Mutual labels: big-data, apache-spark

Parquet4s

Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.

Stars: ✭ 125 (-29.38%)

Mutual labels: hadoop, parquet

Scrapy demo

all kinds of scrapy demo

Stars: ✭ 128 (-27.68%)

Mutual labels: kafka, mongodb

Apollo2 Subscriptions How To

Apollo Server 2 how to setup subscriptions

Stars: ✭ 125 (-29.38%)

Mutual labels: graphql, mongodb

Spark

.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

Stars: ✭ 1,721 (+872.32%)

Mutual labels: apache-spark, spark-streaming

Calcite Avatica

Mirror of Apache Calcite - Avatica

Stars: ✭ 130 (-26.55%)

Mutual labels: big-data, hadoop

Frisky

🍿 Open Source GraphQL API for Online Shows

Stars: ✭ 161 (-9.04%)

Mutual labels: graphql, mongodb

Aliyun Emapreduce Datasources

Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.

Stars: ✭ 132 (-25.42%)

Mutual labels: kafka, hadoop

Api.xiaoduyu.com

🐟小度鱼 - 年轻人的交流社区 https://www.xiaoduyu.com

Stars: ✭ 168 (-5.08%)

Mutual labels: graphql, mongodb

Spark On Lambda

Apache Spark on AWS Lambda

Stars: ✭ 137 (-22.6%)

Mutual labels: big-data, apache-spark

Flink Learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Stars: ✭ 11,378 (+6328.25%)

Mutual labels: kafka, hbase

Slimmessagebus

Lightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.

Stars: ✭ 120 (-32.2%)

Mutual labels: kafka, avro

Abris

Avro SerDe for Apache Spark structured APIs.

Stars: ✭ 130 (-26.55%)

Mutual labels: kafka, avro

Hbaseclient

HBase客户端数据管理软件