Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

Stars: ✭ 54 (+170%)

Mutual labels: spark

docker-lamp

(Linux) + Apache + MariaDB (MySQL) + PHP 7 on Docker.

Stars: ✭ 46 (+130%)

Mutual labels: mariadb

tpch-spark

TPC-H queries in Apache Spark SQL using native DataFrames API

Stars: ✭ 63 (+215%)

Mutual labels: spark

vue3.0-elemenplus-admin-template

一个基于Vue3.0和Element-plus的后台管理模板，一个使用Koa2作为后台程序使用MongoDB作为缓存数据库和MariaDB作为数据的后台管理模板系统

Stars: ✭ 20 (+0%)

Mutual labels: mariadb

trembita

Model complex data transformation pipelines easily

Stars: ✭ 44 (+120%)

Mutual labels: spark

ODSC India 2018

My presentation at ODSC India 2018 about Deep Learning with Apache Spark

Stars: ✭ 26 (+30%)

Mutual labels: spark

BigData-News

基于Spark2.2新闻网大数据实时系统项目

Stars: ✭ 36 (+80%)

Mutual labels: spark

cockpit-sql-driver

SQL Driver for Cockpit CMS

Stars: ✭ 28 (+40%)

Mutual labels: mariadb

Spark-Ar

Resources for Spark AR

Stars: ✭ 43 (+115%)

Mutual labels: spark

docker-compose-lemp-stack

Docker Compose Linux Nginx MariaDB PHP7.2 Stack

Stars: ✭ 55 (+175%)

Mutual labels: mariadb

sentry-spark

Apache Spark Sentry Integration

Stars: ✭ 14 (-30%)

Mutual labels: spark

Covid19Tracker

A Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.

Stars: ✭ 65 (+225%)

Mutual labels: spark

spark-acid

ACID Data Source for Apache Spark based on Hive ACID

Stars: ✭ 91 (+355%)

Mutual labels: spark

Casper

A compiler for automatically re-targeting sequential Java code to Apache Spark.

Stars: ✭ 45 (+125%)

Mutual labels: spark

spark-word2vec

A parallel implementation of word2vec based on Spark

Stars: ✭ 24 (+20%)

Mutual labels: spark

spark-data-sources

Developing Spark External Data Sources using the V2 API

Stars: ✭ 36 (+80%)

Mutual labels: spark

MySQL Module

MySQL connector to Godot Engine.

Stars: ✭ 30 (+50%)

Mutual labels: mariadb

smolder

HL7 Apache Spark Datasource

Stars: ✭ 33 (+65%)

Mutual labels: spark

nifi-fds

Mirror of Apache NiFi Flow Design System

Stars: ✭ 25 (+25%)

Mutual labels: nifi

SparkV

🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.

Stars: ✭ 24 (+20%)

Mutual labels: spark

docker-redmine-orchestration

🐳 An easy docker-compose for Redmine (Nginx + Unicorn + MariaDB)

Stars: ✭ 18 (-10%)

Mutual labels: mariadb

spark-demos

Collection of different demo applications using Apache Spark

Stars: ✭ 15 (-25%)

Mutual labels: spark

prometheus-mysql-exporter

Prometheus MySQL Exporter

Stars: ✭ 33 (+65%)

Mutual labels: mariadb

Spotify-Song-Recommendation-ML

UC Berkeley team's submission for RecSys Challenge 2018

Stars: ✭ 70 (+250%)

Mutual labels: spark

spark-gradle-template

Apache Spark in your IDE with gradle

Stars: ✭ 39 (+95%)

Mutual labels: spark

NiFi-Rule-engine-processor

Drools processor for Apache NiFi

Stars: ✭ 34 (+70%)

Mutual labels: nifi

spark-util

low-level helpers for Apache Spark libraries and tests

Stars: ✭ 16 (-20%)

Mutual labels: spark

mmb

Set of Dockerfiles and assets related to them for building Docker images with different services

Stars: ✭ 34 (+70%)

Mutual labels: mariadb

data processing course

Some class materials for a data processing course using PySpark

Stars: ✭ 50 (+150%)

Mutual labels: spark

incubator-linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,459 (+12195%)

Mutual labels: spark

awesome-AI-kubernetes

❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc

Stars: ✭ 95 (+375%)

Mutual labels: spark

bigkube

Minikube for big data with Scala and Spark

Stars: ✭ 16 (-20%)

Mutual labels: spark

spark-druid-olap

Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.

Stars: ✭ 286 (+1330%)

Mutual labels: spark

Spark-PMoF

Spark Shuffle Optimization with RDMA+AEP

Stars: ✭ 28 (+40%)

Mutual labels: spark

swordfish

Open-source distribute workflow schedule tools, also support streaming task.

Stars: ✭ 35 (+75%)

Mutual labels: spark

spark-extension

A library that provides useful extensions to Apache Spark and PySpark.

Stars: ✭ 25 (+25%)

Mutual labels: spark

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-35%)

Mutual labels: spark

dllib

dllib is a distributed deep learning library running on Apache Spark

Stars: ✭ 32 (+60%)

Mutual labels: spark

spark learning

尚硅谷大数据Spark-2019版最新 Spark 学习

Stars: ✭ 42 (+110%)

Mutual labels: spark

confluent-spark-avro

Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.