TdengineAn open-source big data platform designed and optimized for the Internet of Things (IoT).
Stars: ✭ 17,434 (+553.45%)
PluckPluck text in a fast and intuitive way 🐓
Stars: ✭ 202 (-92.43%)
DolphinbeatA server that pulls and parses MySQL binlog, pushs change data into different sinks like Kafka.
Stars: ✭ 164 (-93.85%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-93.89%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-93.85%)
Azure Event Hubs☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (-91.27%)
Node HbaseAsynchronous HBase client for NodeJs using REST
Stars: ✭ 226 (-91.53%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-92.5%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (-12.93%)
Awesome Learning实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (-92.62%)
Vue Info CardSimple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (-94.04%)
Java Notes☕️ Java 基础 👫 面向对象思想✏️ 算法 📝 操作系统 ☁️ 网络 💾 数据库 🙊 Spring 💡 系统架构🐘大数据
Stars: ✭ 160 (-94%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (-14.77%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-94.04%)
Flink Sql CookbookThe Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
Stars: ✭ 189 (-92.92%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-94.08%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-94.3%)
WatermillBuilding event-driven applications the easy way in Go.
Stars: ✭ 3,504 (+31.33%)
Ruby SparkRuby wrapper for Apache Spark
Stars: ✭ 221 (-91.72%)
Media Stream Library JsJavaScript library to handle media streams on the command line (Node.js) and in the browser.
Stars: ✭ 192 (-92.8%)
NmflibraryMATLAB library for non-negative matrix factorization (NMF): Version 1.8.1
Stars: ✭ 153 (-94.27%)
ScannsA scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (-92.88%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-94.23%)
Javainterview最全的Java技术知识点,以及Java源码分析。为开源贡献自己的一份力。
Stars: ✭ 154 (-94.23%)
Sagemaker SparkA Spark library for Amazon SageMaker.
Stars: ✭ 219 (-91.79%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (-92.99%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (-25.11%)
AzuredatabricksbestpracticesVersion 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (-93.03%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-94.27%)
Simple It EnglishSimple-IT-English: smart wordbook from community for community
Stars: ✭ 233 (-91.27%)
6.824 2017⚡️ 6.824: Distributed Systems (Spring 2017). A course which present abstractions and implementation techniques for engineering distributed systems.
Stars: ✭ 219 (-91.79%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-94.3%)
Bats面向 OLTP、OLAP、批处理、流处理场景的大一统 SQL 引擎
Stars: ✭ 152 (-94.3%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (-94.34%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (-7.8%)
Spark ExcelA Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (-91.9%)
LograngeHigh performance data aggregating storage
Stars: ✭ 181 (-93.22%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (-31.22%)
AztkAZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-94.3%)
HstreamThe streaming database built for IoT data storage and real-time processing in the 5G Era
Stars: ✭ 166 (-93.78%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-94.38%)
AthenacliAthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.
Stars: ✭ 151 (-94.34%)
RecommendationsystemBook recommender system using collaborative filtering based on Spark
Stars: ✭ 244 (-90.85%)
Fluent BitFast and Lightweight Logs and Metrics processor for Linux, BSD, OSX and Windows
Stars: ✭ 3,223 (+20.8%)
CoreBuild platforms that flexibly mix SQL, batch, and stream processing paradigms
Stars: ✭ 231 (-91.34%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-91.9%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-94.49%)
AvroApache Avro is a data serialization system.
Stars: ✭ 2,005 (-24.85%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-93.29%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-94.49%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-93.4%)