Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,323 (-12.93%)

Mutual labels: spark

Awesome Learning

实践源码库：https://github.com/jast90/bigdata 。微信搜索Jast关注公众号，获取最新技术分享😯。

Stars: ✭ 197 (-92.62%)

Mutual labels: bigdata

Vue Info Card

Simple and beautiful card component with an elegant spark line, for VueJS.

Stars: ✭ 159 (-94.04%)

Mutual labels: spark

Java Notes

☕️ Java 基础 👫 面向对象思想✏️ 算法 📝 操作系统 ☁️ 网络 💾 数据库 🙊 Spring 💡 系统架构🐘大数据

Stars: ✭ 160 (-94%)

Mutual labels: bigdata

Spark Workshop

Apache Spark™ and Scala Workshops

Stars: ✭ 224 (-91.6%)

Mutual labels: spark

Ballista

Distributed compute platform implemented in Rust, and powered by Apache Arrow.

Stars: ✭ 2,274 (-14.77%)

Mutual labels: spark

Glow

An open-source toolkit for large-scale genomic analysis

Stars: ✭ 159 (-94.04%)

Mutual labels: spark

Scalable Data Science Platform

Content for architecting a data science platform for products using Luigi, Spark & Flask.

Stars: ✭ 158 (-94.08%)

Mutual labels: spark

Flink Sql Cookbook

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.

Stars: ✭ 189 (-92.92%)

Mutual labels: stream-processing

Handyspark

HandySpark - bringing pandas-like capabilities to Spark dataframes

Stars: ✭ 158 (-94.08%)

Mutual labels: spark

Geni

A Clojure dataframe library that runs on Spark

Stars: ✭ 152 (-94.3%)

Mutual labels: spark

Watermill

Building event-driven applications the easy way in Go.

Stars: ✭ 3,504 (+31.33%)

Mutual labels: stream-processing

Mastering Spark Sql Book

The Internals of Spark SQL

Stars: ✭ 234 (-91.23%)

Mutual labels: spark

Ruby Spark

Ruby wrapper for Apache Spark

Stars: ✭ 221 (-91.72%)

Mutual labels: spark

Media Stream Library Js

JavaScript library to handle media streams on the command line (Node.js) and in the browser.

Stars: ✭ 192 (-92.8%)

Mutual labels: stream-processing

Learningapachespark

LearningApacheSpark

Stars: ✭ 155 (-94.19%)

Mutual labels: spark

Nmflibrary

MATLAB library for non-negative matrix factorization (NMF): Version 1.8.1

Stars: ✭ 153 (-94.27%)

Mutual labels: bigdata

Scanns

A scalable nearest neighbor search library in Apache Spark

Stars: ✭ 190 (-92.88%)

Mutual labels: spark

Sparkmonitor

Monitor Apache Spark from Jupyter Notebook

Stars: ✭ 154 (-94.23%)

Mutual labels: spark

Javainterview

最全的Java技术知识点，以及Java源码分析。为开源贡献自己的一份力。

Stars: ✭ 154 (-94.23%)

Mutual labels: bigdata

Sagemaker Spark

A Spark library for Amazon SageMaker.

Stars: ✭ 219 (-91.79%)

Mutual labels: spark

Js Spark

Realtime calculation distributed system. AKA distributed lodash

Stars: ✭ 187 (-92.99%)

Mutual labels: spark

Quill

Compile-time Language Integrated Queries for Scala

Stars: ✭ 1,998 (-25.11%)

Mutual labels: spark

Azuredatabricksbestpractices

Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs

Stars: ✭ 186 (-93.03%)

Mutual labels: spark

Spark.jl

Julia binding for Apache Spark

Stars: ✭ 153 (-94.27%)

Mutual labels: spark

Simple It English

Simple-IT-English: smart wordbook from community for community

Stars: ✭ 233 (-91.27%)

Mutual labels: bigdata

6.824 2017

⚡️ 6.824: Distributed Systems (Spring 2017). A course which present abstractions and implementation techniques for engineering distributed systems.

Stars: ✭ 219 (-91.79%)

Mutual labels: mapreduce

Powderkeg

Live-coding the cluster!

Stars: ✭ 152 (-94.3%)

Mutual labels: spark

Bats

面向 OLTP、OLAP、批处理、流处理场景的大一统 SQL 引擎

Stars: ✭ 152 (-94.3%)

Mutual labels: stream-processing

Spark Tsne

Distributed t-SNE via Apache Spark

Stars: ✭ 151 (-94.34%)

Mutual labels: spark

Roaringbitmap

A better compressed bitset in Java

Stars: ✭ 2,460 (-7.8%)

Mutual labels: spark

Spark Ml Source Analysis

spark ml 算法原理剖析以及具体的源码实现分析

Stars: ✭ 1,873 (-29.8%)

Mutual labels: spark

Spark Excel

A Spark plugin for reading Excel files via Apache POI

Stars: ✭ 216 (-91.9%)

Mutual labels: spark

Logrange

High performance data aggregating storage

Stars: ✭ 181 (-93.22%)

Mutual labels: stream-processing

Benchm Ml

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).

Stars: ✭ 1,835 (-31.22%)

Mutual labels: spark

Aztk

AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure

Stars: ✭ 152 (-94.3%)

Mutual labels: spark

Hstream

The streaming database built for IoT data storage and real-time processing in the 5G Era

Stars: ✭ 166 (-93.78%)

Mutual labels: stream-processing

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples