All Projects → Wedatasphere → Similar Projects or Alternatives

1957 Open source projects that are alternatives of or similar to Wedatasphere

Thingsboard
Open-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+2729.57%)
Mutual labels:  kafka, spark
Camus
Mirror of Linkedin's Camus
Stars: ✭ 81 (-78.23%)
Mutual labels:  kafka, hadoop
Cdc Kafka Hadoop
MySQL to NoSQL real time dataflow
Stars: ✭ 13 (-96.51%)
Mutual labels:  kafka, hadoop
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+37.9%)
Mutual labels:  kafka, spark
Elasticluster
Create clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-19.89%)
Mutual labels:  spark, hadoop
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-73.92%)
Mutual labels:  kafka, spark
Spark Hbase Connector
Connect Spark to HBase for reading and writing data with ease
Stars: ✭ 299 (-19.62%)
Mutual labels:  spark, hbase
Kafka Connect
equivalent to kafka-connect 🔧 for nodejs ✨🐢🚀✨
Stars: ✭ 102 (-72.58%)
Mutual labels:  kafka, etl
Awesome Recommendation Engine
The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-87.37%)
Mutual labels:  kafka, spark
Delta Architecture
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Stars: ✭ 43 (-88.44%)
Mutual labels:  kafka, spark
Connection Pool Client
💥 A simple multi-purpose connection pool client (Kafka & Hbase & Redis & RMDB & Socket & Http)
Stars: ✭ 40 (-89.25%)
Mutual labels:  kafka, hbase
Iot Traffic Monitor
Stars: ✭ 131 (-64.78%)
Mutual labels:  kafka, spark
Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (-10.75%)
Mutual labels:  kafka, spark
Abris
Avro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-65.05%)
Mutual labels:  kafka, spark
Example Spark Kafka
Apache Spark and Apache Kafka integration example
Stars: ✭ 120 (-67.74%)
Mutual labels:  kafka, spark
Spark Structured Streaming Examples
Spark Structured Streaming / Kafka / Cassandra / Elastic
Stars: ✭ 168 (-54.84%)
Mutual labels:  kafka, spark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-62.37%)
Mutual labels:  kafka, spark
Spark Kafka Writer
Write your Spark data to Kafka seamlessly
Stars: ✭ 175 (-52.96%)
Mutual labels:  kafka, spark
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-2.96%)
Mutual labels:  spark, etl
Bigdata practice
大数据分析可视化实践
Stars: ✭ 166 (-55.38%)
Mutual labels:  kafka, hive
Dagster
An orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+1001.88%)
Mutual labels:  scheduler, etl
Seldon Server
Machine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+285.75%)
Mutual labels:  kafka, spark
Spark Streaming With Kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
Stars: ✭ 180 (-51.61%)
Mutual labels:  kafka, spark
Recommendsys
推荐项目(实时推荐和离线推荐)
Stars: ✭ 198 (-46.77%)
Mutual labels:  kafka, hadoop
Devops Bash Tools
550+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Kafka, Docker, APIs, Hadoop, SQL, PostgreSQL, MySQL, Hive, Impala, Travis CI, Jenkins, Concourse, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, .tmux.conf, .psqlrc ...
Stars: ✭ 226 (-39.25%)
Mutual labels:  kafka, hadoop
Storagetapper
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (-37.63%)
Mutual labels:  kafka, etl
Every Single Day I Tldr
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-33.06%)
Mutual labels:  kafka, spark
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-33.6%)
Mutual labels:  kafka, spark
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+113.17%)
Mutual labels:  scheduler, spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+11.02%)
Mutual labels:  kafka, spark
Spring Boot 2.x Examples
Spring Boot 2.x code examples
Stars: ✭ 104 (-72.04%)
Mutual labels:  kafka, hbase
phoenix
Apache Phoenix / Hbase Spring Boot Microservices
Stars: ✭ 23 (-93.82%)
Mutual labels:  hadoop, hbase
Video Stream Analytics
Stars: ✭ 240 (-35.48%)
Mutual labels:  kafka, spark
thain
Thain is a distributed flow schedule platform.
Stars: ✭ 81 (-78.23%)
Mutual labels:  etl, scheduler
Gather Deployment
Gathers scalable tensorflow and infrastructure deployment
Stars: ✭ 326 (-12.37%)
Mutual labels:  kafka, hadoop
smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (-78.76%)
Mutual labels:  hive, hadoop
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-90.05%)
Mutual labels:  hive, hadoop
hive to es
同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-94.35%)
Mutual labels:  hive, hadoop
zdh web
大数据采集,抽取平台
Stars: ✭ 292 (-21.51%)
Mutual labels:  etl, scheduler
hive-bigquery-storage-handler
Hive Storage Handler for interoperability between BigQuery and Apache Hive
Stars: ✭ 16 (-95.7%)
Mutual labels:  hive, hadoop
the-apache-ignite-book
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-82.53%)
Mutual labels:  hive, hadoop
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+895.97%)
Mutual labels:  kafka, etl
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-89.52%)
Mutual labels:  hadoop, etl
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1131.45%)
Mutual labels:  hadoop, hive
disk
基于hadoop+hbase+springboot实现分布式网盘系统
Stars: ✭ 53 (-85.75%)
Mutual labels:  hadoop, hbase
wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-94.89%)
Mutual labels:  hadoop, hbase
liquibase-impala
Liquibase extension to add Impala Database support
Stars: ✭ 23 (-93.82%)
Mutual labels:  hive, hadoop
hive-jdbc-driver
An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (-91.67%)
Mutual labels:  hive, hadoop
hadoop-etl-udfs
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-95.43%)
Mutual labels:  hive, hadoop
darwin
Avro Schema Evolution made easy
Stars: ✭ 26 (-93.01%)
Mutual labels:  hadoop, hbase
Hbase Rdd
Spark RDD to read, write and delete from HBase
Stars: ✭ 277 (-25.54%)
Mutual labels:  spark, hbase
BigDataTools
tools for bigData
Stars: ✭ 36 (-90.32%)
Mutual labels:  hive, hbase
cobra-policytool
Manage Apache Atlas and Ranger configuration for your Hadoop environment.
Stars: ✭ 16 (-95.7%)
Mutual labels:  hive, hadoop
orion
Management and automation platform for Stateful Distributed Systems
Stars: ✭ 77 (-79.3%)
Mutual labels:  hadoop, hbase
hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-84.95%)
Mutual labels:  hive, hadoop
cmux
A set of commands for managing CDH clusters using Cloudera Manager REST API.
Stars: ✭ 34 (-90.86%)
Mutual labels:  hadoop, hbase
EngineeringTeam
와이빅타 엔지니어링팀의 자료를 정리해두는 곳입니다.
Stars: ✭ 41 (-88.98%)
Mutual labels:  hive, hadoop
litemall-dw
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-90.32%)
Mutual labels:  hive, hbase
fastdata-cluster
Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-94.62%)
Mutual labels:  spark, hadoop
TIL
Today I Learned
Stars: ✭ 43 (-88.44%)
Mutual labels:  hive, hadoop
61-120 of 1957 similar projects