Yomo🦖 Streaming-Serverless Framework for Low-latency Edge Computing applications, running atop QUIC protocol, engaging 5G technology.
Stars: ✭ 279 (-97.48%)
zdh web大数据采集,抽取平台
Stars: ✭ 292 (-97.37%)
Flink Sql CookbookThe Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
Stars: ✭ 189 (-98.3%)
Biglassobiglasso: Extending Lasso Model Fitting to Big Data in R
Stars: ✭ 87 (-99.22%)
keralaDistributed KV Streams
Stars: ✭ 16 (-99.86%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (-98.86%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-99.22%)
plexusPlexus - Interactive Emotion Visualization based on Social Media
Stars: ✭ 27 (-99.76%)
Javainterview最全的Java技术知识点,以及Java源码分析。为开源贡献自己的一份力。
Stars: ✭ 154 (-98.61%)
Media Stream Library JsJavaScript library to handle media streams on the command line (Node.js) and in the browser.
Stars: ✭ 192 (-98.27%)
ottlaAn opinionated clojure framework for writing kafka machines
Stars: ✭ 14 (-99.87%)
Athena CliPresto-like CLI tool for AWS Athena
Stars: ✭ 85 (-99.23%)
PersonNotes个人笔记集中营,快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧
Stars: ✭ 61 (-99.45%)
stream-registryStream Discovery and Stream Orchestration
Stars: ✭ 105 (-99.05%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-99.28%)
storm-mlan online learning algorithm library for Storm
Stars: ✭ 18 (-99.84%)
cdcA library for performing Content-Defined Chunking (CDC) on data streams.
Stars: ✭ 18 (-99.84%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-99.33%)
talariaTalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (-98.67%)
intersect一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集
Stars: ✭ 54 (-99.51%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-97.96%)
LograngeHigh performance data aggregating storage
Stars: ✭ 181 (-98.37%)
HstreamThe streaming database built for IoT data storage and real-time processing in the 5G Era
Stars: ✭ 166 (-98.5%)
KoolreportThis is an Open Source PHP Reporting Framework which you can use to write perfect data reports or to construct awesome dashboards using PHP
Stars: ✭ 204 (-98.16%)
go-riversCollection of stream processing / multiplexing / networking libs in Go
Stars: ✭ 35 (-99.68%)
RanalyticshheRepository for Online Classes
Stars: ✭ 183 (-98.35%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-91.11%)
Countly Sdk WebCountly Product Analytics SDK for websites and web applications
Stars: ✭ 165 (-98.51%)
datapackage-mPower Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
Stars: ✭ 26 (-99.77%)
Big DipperA block explorer for Cosmos
Stars: ✭ 119 (-98.93%)
Aws Auto Terminate Idle EmrAWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-99.81%)
SupersetApache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+284.33%)
jhdfA pure Java HDF5 library
Stars: ✭ 83 (-99.25%)
Danfojsdanfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Stars: ✭ 1,304 (-88.24%)
Basketball analyticsRepository which contains various scripts and work with various basketball statistics
Stars: ✭ 88 (-99.21%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-91.63%)
TrckQuery engine for TrailDB
Stars: ✭ 48 (-99.57%)
kafka-workersKafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.
Stars: ✭ 30 (-99.73%)
ArconRuntime for Writing Streaming Applications in Rust.
Stars: ✭ 44 (-99.6%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-99.95%)
InsightsOpen Source Self-Hosted Business Intelligence Platform
Stars: ✭ 917 (-91.73%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-94.65%)
Kube BatchA batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
Stars: ✭ 804 (-92.75%)
Dataanalysisinaction(已完结)《极客时间数据分析实战45讲-详细笔记》包含markdown、图片、思维导图、代码 、数据。 可直接阅读代码、测试!
Stars: ✭ 482 (-95.65%)
Ferolight, fast, scalable, streaming microservices made easy
Stars: ✭ 175 (-98.42%)
gostreamStream Processing Library for Go
Stars: ✭ 51 (-99.54%)
cnosdbAn Open Source Distributed Time Series Database with high performance, high compression ratio and high usability.
Stars: ✭ 858 (-92.27%)
ramenA stream processing language and compiler for small-scale monitoring
Stars: ✭ 14 (-99.87%)
learning-sparkTidy up Spark and Hadoop tutorials.
Stars: ✭ 28 (-99.75%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (-99.31%)
AthenacliAthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.
Stars: ✭ 151 (-98.64%)