Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Stars: ✭ 247 (+105.83%)

Mutual labels: spark, spark-streaming

Real Time Stream Processing Engine

This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.

Stars: ✭ 37 (-69.17%)

Mutual labels: spark, spark-streaming

Pyspark Examples

Code examples on Apache Spark using python

Stars: ✭ 58 (-51.67%)

Mutual labels: spark, spark-streaming

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+1015%)

Mutual labels: spark

Pyspark Cheatsheet

🐍 Quick reference guide to common patterns & functions in PySpark.

Stars: ✭ 108 (-10%)

Mutual labels: spark

Spark Summit 2017 Sanfrancisco

spark summit 2017 SanFrancisco

Stars: ✭ 93 (-22.5%)

Mutual labels: spark

Spark On Kubernetes Helm

Spark on Kubernetes infrastructure Helm charts repo

Stars: ✭ 92 (-23.33%)

Mutual labels: spark

Archivespark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Stars: ✭ 111 (-7.5%)

Mutual labels: spark

Flink Learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Stars: ✭ 11,378 (+9381.67%)

Mutual labels: spark

Ammonite Spark

Run spark calculations from Ammonite

Stars: ✭ 88 (-26.67%)

Mutual labels: spark

Relation extraction

Relation Extraction using Deep learning(CNN)

Stars: ✭ 96 (-20%)

Mutual labels: spark

Distributed Dataset

A distributed data processing framework in Haskell.

Stars: ✭ 108 (-10%)

Mutual labels: spark

Repository

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (-23.33%)

Mutual labels: spark

Python Bigdata

Data science and Big Data with Python

Stars: ✭ 112 (-6.67%)

Mutual labels: spark

Big Data

🔧 Use dplyr to analyze Big Data 🐘

Stars: ✭ 93 (-22.5%)

Mutual labels: spark

Hnswlib

Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs

Stars: ✭ 108 (-10%)

Mutual labels: spark

Udacity Data Engineering

Udacity Data Engineering Nano Degree (DEND)

Stars: ✭ 89 (-25.83%)

Mutual labels: spark

Spark Lucenerdd

Spark RDD with Lucene's query and entity linkage capabilities

Stars: ✭ 114 (-5%)

Mutual labels: spark

Spark Nlp Models

Models and Pipelines for the Spark NLP library

Stars: ✭ 88 (-26.67%)

Mutual labels: spark

Seldon Server

Machine Learning Platform and Recommendation Engine built on Kubernetes

Stars: ✭ 1,435 (+1095.83%)

Mutual labels: spark

Spark python ml examples

Spark 2.0 Python Machine Learning examples

Stars: ✭ 87 (-27.5%)

Mutual labels: spark

Laravel Spark Google2fa

Google Authenticator support for Laravel Spark

Stars: ✭ 86 (-28.33%)

Mutual labels: spark

Elephas

Distributed Deep learning with Keras & Spark

Stars: ✭ 1,521 (+1167.5%)

Mutual labels: spark

Logigsk

A Linux based software package to control led's on Logitech G910, G810, G610 and G410.

Stars: ✭ 107 (-10.83%)

Mutual labels: spark

Cuesheet

A framework for writing Spark 2.x applications in a pretty way

Stars: ✭ 86 (-28.33%)

Mutual labels: spark

Flint

Webex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)

Stars: ✭ 85 (-29.17%)

Mutual labels: spark

Spark On K8s Operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Stars: ✭ 1,780 (+1383.33%)

Mutual labels: spark

Hops Examples

Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops

Stars: ✭ 84 (-30%)

Mutual labels: spark

Ibis

A pandas-like deferred expression system, with first-class SQL support

Stars: ✭ 1,630 (+1258.33%)

Mutual labels: spark

Spring Shiro Spark

Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试

Stars: ✭ 114 (-5%)

Mutual labels: spark

Lambda Arch

Applying Lambda Architecture with Spark, Kafka, and Cassandra.

Stars: ✭ 111 (-7.5%)

Mutual labels: spark

Sparktutorial

Source code for James Lee's Aparch Spark with Java course

Stars: ✭ 105 (-12.5%)

Mutual labels: spark

Hadoop cookbook

Cookbook to install Hadoop 2.0+ using Chef

Stars: ✭ 82 (-31.67%)

Mutual labels: spark

Spark Dependencies

Spark job for dependency links

Stars: ✭ 82 (-31.67%)

Mutual labels: spark

Splash

Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange

Stars: ✭ 105 (-12.5%)

Mutual labels: spark

Mleap

MLeap: Deploy ML Pipelines to Production

Stars: ✭ 1,232 (+926.67%)

Mutual labels: spark

Lehar

Visualize data using relative ordering

Stars: ✭ 81 (-32.5%)

Mutual labels: spark

Spark Terasort

Stars: ✭ 101 (-15.83%)

Mutual labels: spark

Spark Gbtlr

Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark

Stars: ✭ 81 (-32.5%)

Mutual labels: spark

Setl

A simple Spark-powered ETL framework that just works 🍺

Stars: ✭ 79 (-34.17%)

Mutual labels: spark

Spark Ffm

FFM (Field-Awared Factorization Machine) on Spark

Stars: ✭ 101 (-15.83%)

Mutual labels: spark

Docker Spark

🚢 Docker image for Apache Spark

Stars: ✭ 78 (-35%)

Mutual labels: spark

1-60 of 435 similar projects

›

next*5