All Projects → Hudi → Similar Projects or Alternatives

318 Open source projects that are alternatives of or similar to Hudi

Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (-33.45%)
Mutual labels:  bigdata
Reddit sse stream
A Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
Stars: ✭ 39 (-98.49%)
Mutual labels:  bigdata
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+339.98%)
Mutual labels:  stream-processing
Saber
Window-Based Hybrid CPU/GPU Stream Processing Engine
Stars: ✭ 35 (-98.65%)
Mutual labels:  stream-processing
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (-94.62%)
Mutual labels:  bigdata
Streamsx.messaging
This toolkit is focused on interacting with popular messaging systems such as Kafka, JMS, XMS, and MQTT. After release v5.4.2 the complete toolkit will be deprecated. See the README.md file for hints to alternative toolkits.
Stars: ✭ 31 (-98.8%)
Mutual labels:  stream-processing
Gsf
Grid Solutions Framework
Stars: ✭ 106 (-95.9%)
Mutual labels:  stream-processing
Aws Auto Terminate Idle Emr
AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-99.19%)
Mutual labels:  bigdata
Fpart
Sort files and pack them into partitions
Stars: ✭ 127 (-95.09%)
Mutual labels:  bigdata
Spark Streaming Monitoring With Lightning
Plot live-stats as graph from ApacheSpark application using Lightning-viz
Stars: ✭ 15 (-99.42%)
Mutual labels:  bigdata
Flink Notes
flink学习笔记
Stars: ✭ 106 (-95.9%)
Mutual labels:  bigdata
Streamsx.inet
This toolkit supports common internet protocols, such as HTTP and WebSockets
Stars: ✭ 11 (-99.57%)
Mutual labels:  stream-processing
Mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Stars: ✭ 15,338 (+493.12%)
Mutual labels:  stream-processing
Tuna
🐟 A streaming ETL for fish
Stars: ✭ 11 (-99.57%)
Mutual labels:  stream-processing
Leofs
The LeoFS Storage System
Stars: ✭ 1,439 (-44.35%)
Mutual labels:  datalake
Hazelcast Jet
Distributed Stream and Batch Processing
Stars: ✭ 855 (-66.94%)
Mutual labels:  stream-processing
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-95.13%)
Mutual labels:  bigdata
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-64.08%)
Mutual labels:  bigdata
Bigdata Notebook
Stars: ✭ 100 (-96.13%)
Mutual labels:  bigdata
10 Weeks
10-weeks of technology exploration
Stars: ✭ 22 (-99.15%)
Mutual labels:  bigdata
Wayeb
Wayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.
Stars: ✭ 138 (-94.66%)
Mutual labels:  stream-processing
Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (-68.41%)
Mutual labels:  bigdata
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-96.25%)
Mutual labels:  stream-processing
Spring Cloud Dataflow
A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes
Stars: ✭ 753 (-70.88%)
Mutual labels:  stream-processing
Pulsar Flink
Elastic data processing with Apache Pulsar and Apache Flink
Stars: ✭ 126 (-95.13%)
Mutual labels:  stream-processing
Json Machine
Efficient, easy-to-use, and fast PHP JSON stream parser
Stars: ✭ 376 (-85.46%)
Mutual labels:  stream-processing
Covid19 Market Waiting Times
A project to help people stand in line at the market as little as possible
Stars: ✭ 95 (-96.33%)
Mutual labels:  bigdata
Vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+162.68%)
Mutual labels:  bigdata
Avro
Apache Avro is a data serialization system.
Stars: ✭ 2,005 (-22.47%)
Mutual labels:  bigdata
Spring Cloud Stream
Framework for building Event-Driven Microservices
Stars: ✭ 662 (-74.4%)
Mutual labels:  stream-processing
Biglasso
biglasso: Extending Lasso Model Fitting to Big Data in R
Stars: ✭ 87 (-96.64%)
Mutual labels:  bigdata
Automi
A stream processing API for Go (alpha)
Stars: ✭ 617 (-76.14%)
Mutual labels:  stream-processing
Liteflow
liteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (-95.67%)
Mutual labels:  bigdata
Kafka Streams
equivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (-76.3%)
Mutual labels:  stream-processing
Bigdata File Viewer
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-96.67%)
Mutual labels:  bigdata
Streaming Readings
Streaming System 相关的论文读物
Stars: ✭ 554 (-78.58%)
Mutual labels:  stream-processing
Mara Pipelines
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (-28.81%)
Mutual labels:  data-integration
Bigslice
A serverless cluster computing system for the Go programming language
Stars: ✭ 469 (-81.86%)
Mutual labels:  bigdata
Athena Cli
Presto-like CLI tool for AWS Athena
Stars: ✭ 85 (-96.71%)
Mutual labels:  bigdata
Lambda Arch
Applying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-95.71%)
Mutual labels:  bigdata
Flinkstreamsql
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (-34.96%)
Mutual labels:  bigdata
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-97.14%)
Mutual labels:  bigdata
Jigsaw
Jigsaw七巧板 provides a set of web components based on Angular5/8/9+. The main purpose of Jigsaw is to help the application developers to construct complex & intensive interacting & user friendly web pages. Jigsaw is supporting the development of all applications of Big Data Product of ZTE.
Stars: ✭ 354 (-86.31%)
Mutual labels:  bigdata
Bigdataie
大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (-82.79%)
Mutual labels:  bigdata
Kspp
A high performance/ real-time C++ Kafka streams framework (C++17)
Stars: ✭ 80 (-96.91%)
Mutual labels:  stream-processing
Ksql
The database purpose-built for stream processing applications.
Stars: ✭ 4,668 (+80.51%)
Mutual labels:  stream-processing
Big Data Study
🐳 big data study
Stars: ✭ 141 (-94.55%)
Mutual labels:  bigdata
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-83.53%)
Mutual labels:  bigdata
Machine
Machine is a workflow/pipeline library for processing data
Stars: ✭ 78 (-96.98%)
Mutual labels:  stream-processing
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (-84.65%)
Mutual labels:  stream-processing
Wally
Distributed Stream Processing
Stars: ✭ 1,461 (-43.5%)
Mutual labels:  stream-processing
Awesome System Design
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
Stars: ✭ 4,999 (+93.31%)
Mutual labels:  stream-processing
Cleanframes
type-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-97.1%)
Mutual labels:  bigdata
Sidekick
High Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (-85.85%)
Mutual labels:  bigdata
Samsara
Samsara is a real-time analytics platform
Stars: ✭ 132 (-94.9%)
Mutual labels:  stream-processing
Datawave
DataWave is an ingest/query framework that leverages Apache Accumulo to provide fast, secure data access.
Stars: ✭ 347 (-86.58%)
Mutual labels:  bigdata
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-95.78%)
Mutual labels:  bigdata
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-97.25%)
Mutual labels:  bigdata
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-97.33%)
Mutual labels:  bigdata
Siddhi
Stream Processing and Complex Event Processing Engine
Stars: ✭ 1,185 (-54.18%)
Mutual labels:  stream-processing
61-120 of 318 similar projects