❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc

Stars: ✭ 95 (+163.89%)

Mutual labels: spark

Video Stream Analytics

Stars: ✭ 240 (+566.67%)

Mutual labels: spark

Covid19Tracker

A Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.

Stars: ✭ 65 (+80.56%)

Mutual labels: spark

Azure Event Hubs

☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs

Stars: ✭ 233 (+547.22%)

Mutual labels: spark

ODSC India 2018

My presentation at ODSC India 2018 about Deep Learning with Apache Spark

Stars: ✭ 26 (-27.78%)

Mutual labels: spark

Installations mac ubuntu windows

Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).

Stars: ✭ 231 (+541.67%)

Mutual labels: spark

BigData-News

基于Spark2.2新闻网大数据实时系统项目

Stars: ✭ 36 (+0%)

Mutual labels: spark

Spark.fish

▁▂▄▆▇█▇▆▄▂▁

Stars: ✭ 229 (+536.11%)

Mutual labels: spark

sparkar-volts

An extensive non-reactive Typescript framework that eases the development experience in Spark AR

Stars: ✭ 15 (-58.33%)

Mutual labels: spark

Ruby Spark

Ruby wrapper for Apache Spark

Stars: ✭ 221 (+513.89%)

Mutual labels: spark

bigdata-fun

A complete (distributed) BigData stack, running in containers

Stars: ✭ 14 (-61.11%)

Mutual labels: spark

Spark Excel

A Spark plugin for reading Excel files via Apache POI

Stars: ✭ 216 (+500%)

Mutual labels: spark

experiments

Code examples for my blog posts

Stars: ✭ 21 (-41.67%)

Mutual labels: spark

Sparkrdma

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (+497.22%)

Mutual labels: spark

kafka-compose

🎼 Docker compose files for various kafka stacks

Stars: ✭ 32 (-11.11%)

Mutual labels: spark

Example Spark

Spark, Spark Streaming and Spark SQL unit testing strategies

Stars: ✭ 205 (+469.44%)

Mutual labels: spark

splink

Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters

Stars: ✭ 181 (+402.78%)

Mutual labels: spark

Javaorbigdata Interview

Java开发者或者大数据开发者面试知识点整理

Stars: ✭ 203 (+463.89%)

Mutual labels: spark

bigkube

Minikube for big data with Scala and Spark

Stars: ✭ 16 (-55.56%)

Mutual labels: spark

Spark Practice

Apache Spark (PySpark) Practice on Real Data

Stars: ✭ 200 (+455.56%)

Mutual labels: spark

visualize-data-with-python

A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.

Stars: ✭ 60 (+66.67%)

Mutual labels: spark

Scanns

A scalable nearest neighbor search library in Apache Spark

Stars: ✭ 190 (+427.78%)

Mutual labels: spark

docker-spark

Apache Spark docker container image (Standalone mode)

Stars: ✭ 34 (-5.56%)

Mutual labels: spark

Azuredatabricksbestpractices

Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs

Stars: ✭ 186 (+416.67%)

Mutual labels: spark

big data

A collection of tutorials on Hadoop, MapReduce, Spark, Docker

Stars: ✭ 34 (-5.56%)

Mutual labels: spark-sql

Roaringbitmap

A better compressed bitset in Java

Stars: ✭ 2,460 (+6733.33%)

Mutual labels: spark

smolder

HL7 Apache Spark Datasource

Stars: ✭ 33 (-8.33%)

Mutual labels: spark

Sparkstreaming

💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算)；🚀 支持运行过程中增删topic；🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。

Stars: ✭ 179 (+397.22%)

Mutual labels: spark

spark-sql-internals

The Internals of Spark SQL

Stars: ✭ 331 (+819.44%)

Mutual labels: spark-sql

Spark Kafka Writer

Write your Spark data to Kafka seamlessly

Stars: ✭ 175 (+386.11%)

Mutual labels: spark

Python Master Courses

人生苦短我用Python

Stars: ✭ 61 (+69.44%)

Mutual labels: spark

Spark

Firely's open source FHIR server

Stars: ✭ 174 (+383.33%)

Mutual labels: spark

Real-time-Data-Warehouse

Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi

Stars: ✭ 52 (+44.44%)

Mutual labels: spark-sql

Deeplearning4j

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…

Stars: ✭ 12,277 (+34002.78%)

Mutual labels: spark

SparkV

🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.

Stars: ✭ 24 (-33.33%)

Mutual labels: spark

Spark Structured Streaming Examples

Spark Structured Streaming / Kafka / Cassandra / Elastic

Stars: ✭ 168 (+366.67%)

Mutual labels: spark

wow-spark

🔆 spark自学手册，包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake，以及scala基础练习，还有一些例如master、shuﬄe源码分析，总结及翻译。

Stars: ✭ 20 (-44.44%)

Mutual labels: spark-sql

Geopyspark

GeoTrellis for PySpark

Stars: ✭ 167 (+363.89%)

Mutual labels: spark

spark-sql-flow-plugin

Visualize column-level data lineage in Spark SQL

Stars: ✭ 20 (-44.44%)

Mutual labels: spark

Big Whale

Spark、Flink等离线任务的调度以及实时任务的监控

Stars: ✭ 163 (+352.78%)

Mutual labels: spark

databricks-notebooks

Collection of Databricks and Jupyter Notebooks

Stars: ✭ 19 (-47.22%)

Mutual labels: spark-sql

Bigdata docker

Big Data Ecosystem Docker

Stars: ✭ 161 (+347.22%)

Mutual labels: spark

spark-demos

Collection of different demo applications using Apache Spark

Stars: ✭ 15 (-58.33%)

Mutual labels: spark

Vue Info Card

Simple and beautiful card component with an elegant spark line, for VueJS.

Stars: ✭ 159 (+341.67%)

Mutual labels: spark

spark-kubernetes

spark on kubernetes

Stars: ✭ 80 (+122.22%)

Mutual labels: spark

datalake-etl-pipeline

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations

Stars: ✭ 39 (+8.33%)

Mutual labels: spark-sql

opaque-sql

An encrypted data analytics platform

Stars: ✭ 169 (+369.44%)

Mutual labels: spark-sql

arcgis-experience-builder-sdk-resources

ArcGIS Experience Builder samples

Stars: ✭ 47 (+30.56%)

Mutual labels: data-sources

prosto

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

Stars: ✭ 54 (+50%)

Mutual labels: spark

confluent-spark-avro

Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.

Stars: ✭ 18 (-50%)

Mutual labels: spark

61-120 of 419 similar projects

‹

›

next*5