Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (-33.45%)
Reddit sse streamA Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
Stars: ✭ 39 (-98.49%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+339.98%)
SaberWindow-Based Hybrid CPU/GPU Stream Processing Engine
Stars: ✭ 35 (-98.65%)
Streamsx.messagingThis toolkit is focused on interacting with popular messaging systems such as Kafka, JMS, XMS, and MQTT. After release v5.4.2 the complete toolkit will be deprecated. See the README.md file for hints to alternative toolkits.
Stars: ✭ 31 (-98.8%)
GsfGrid Solutions Framework
Stars: ✭ 106 (-95.9%)
Aws Auto Terminate Idle EmrAWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-99.19%)
FpartSort files and pack them into partitions
Stars: ✭ 127 (-95.09%)
Streamsx.inetThis toolkit supports common internet protocols, such as HTTP and WebSockets
Stars: ✭ 11 (-99.57%)
MediapipeCross-platform, customizable ML solutions for live and streaming media.
Stars: ✭ 15,338 (+493.12%)
Tuna🐟 A streaming ETL for fish
Stars: ✭ 11 (-99.57%)
LeofsThe LeoFS Storage System
Stars: ✭ 1,439 (-44.35%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (-66.94%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-95.13%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-64.08%)
10 Weeks10-weeks of technology exploration
Stars: ✭ 22 (-99.15%)
WayebWayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.
Stars: ✭ 138 (-94.66%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (-68.41%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-96.25%)
Spring Cloud DataflowA microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes
Stars: ✭ 753 (-70.88%)
Pulsar FlinkElastic data processing with Apache Pulsar and Apache Flink
Stars: ✭ 126 (-95.13%)
Json MachineEfficient, easy-to-use, and fast PHP JSON stream parser
Stars: ✭ 376 (-85.46%)
VaexOut-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+162.68%)
AvroApache Avro is a data serialization system.
Stars: ✭ 2,005 (-22.47%)
Biglassobiglasso: Extending Lasso Model Fitting to Big Data in R
Stars: ✭ 87 (-96.64%)
AutomiA stream processing API for Go (alpha)
Stars: ✭ 617 (-76.14%)
Liteflowliteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (-95.67%)
Kafka Streamsequivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (-76.3%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-96.67%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (-28.81%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (-81.86%)
Athena CliPresto-like CLI tool for AWS Athena
Stars: ✭ 85 (-96.71%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-95.71%)
Flinkstreamsql基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (-34.96%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-97.14%)
JigsawJigsaw七巧板 provides a set of web components based on Angular5/8/9+. The main purpose of Jigsaw is to help the application developers to construct complex & intensive interacting & user friendly web pages. Jigsaw is supporting the development of all applications of Big Data Product of ZTE.
Stars: ✭ 354 (-86.31%)
Bigdataie大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (-82.79%)
KsppA high performance/ real-time C++ Kafka streams framework (C++17)
Stars: ✭ 80 (-96.91%)
KsqlThe database purpose-built for stream processing applications.
Stars: ✭ 4,668 (+80.51%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-83.53%)
MachineMachine is a workflow/pipeline library for processing data
Stars: ✭ 78 (-96.98%)
WallyDistributed Stream Processing
Stars: ✭ 1,461 (-43.5%)
Awesome System DesignA curated list of awesome System Design (A.K.A. Distributed Systems) resources.
Stars: ✭ 4,999 (+93.31%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-97.1%)
SidekickHigh Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (-85.85%)
SamsaraSamsara is a real-time analytics platform
Stars: ✭ 132 (-94.9%)
DatawaveDataWave is an ingest/query framework that leverages Apache Accumulo to provide fast, secure data access.
Stars: ✭ 347 (-86.58%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-95.78%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-97.33%)
SiddhiStream Processing and Complex Event Processing Engine
Stars: ✭ 1,185 (-54.18%)