Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Stars: ✭ 247 (+888%)

Mutual labels: spark

frovedis

Framework of vectorized and distributed data analytics

Stars: ✭ 59 (+136%)

Mutual labels: spark

Neo4j Spark Connector

Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs

Stars: ✭ 245 (+880%)

Mutual labels: spark

big data

A collection of tutorials on Hadoop, MapReduce, Spark, Docker

Stars: ✭ 34 (+36%)

Mutual labels: pyspark

Recommendationsystem

Book recommender system using collaborative filtering based on Spark

Stars: ✭ 244 (+876%)

Mutual labels: spark

shamash

Autoscaling for Google Cloud Dataproc

Stars: ✭ 31 (+24%)

Mutual labels: spark

Hadoop Docker

基于Docker构建的Hadoop开发测试环境，包含Hadoop，Hive，HBase，Spark

Stars: ✭ 238 (+852%)

Mutual labels: spark

machine-learning-course

Machine Learning Course @ Santa Clara University

Stars: ✭ 17 (-32%)

Mutual labels: pyspark

Azure Event Hubs

☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs

Stars: ✭ 233 (+832%)

Mutual labels: spark

Mastering Spark Sql Book

The Internals of Spark SQL

Stars: ✭ 234 (+836%)

Mutual labels: spark

Casper

A compiler for automatically re-targeting sequential Java code to Apache Spark.

Stars: ✭ 45 (+80%)

Mutual labels: spark

visions

Type System for Data Analysis in Python

Stars: ✭ 136 (+444%)

Mutual labels: spark

Spark-PMoF

Spark Shuffle Optimization with RDMA+AEP

Stars: ✭ 28 (+12%)

Mutual labels: spark

Spark-and-Kafka IoT-Data-Processing-and-Analytics

Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time

Stars: ✭ 42 (+68%)

Mutual labels: pyspark

lineage

Generate beautiful documentation for your data pipelines in markdown format

Stars: ✭ 16 (-36%)

Mutual labels: pyspark

Installations mac ubuntu windows

Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).

Stars: ✭ 231 (+824%)

Mutual labels: spark

Mydatascienceportfolio

Applying Data Science and Machine Learning to Solve Real World Business Problems

Stars: ✭ 227 (+808%)

Mutual labels: spark

DataEngineering

This repo contains commands that data engineers use in day to day work.

Stars: ✭ 47 (+88%)

Mutual labels: pyspark

Spark.fish

▁▂▄▆▇█▇▆▄▂▁

Stars: ✭ 229 (+816%)

Mutual labels: spark

Spark Workshop

Apache Spark™ and Scala Workshops

Stars: ✭ 224 (+796%)

Mutual labels: spark

Search Ads Web Service

Online search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]

Stars: ✭ 30 (+20%)

Mutual labels: spark

kuwala

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…

Stars: ✭ 474 (+1796%)

Mutual labels: pyspark

Ruby Spark

Ruby wrapper for Apache Spark

Stars: ✭ 221 (+784%)

Mutual labels: spark

Sagemaker Spark

A Spark library for Amazon SageMaker.

Stars: ✭ 219 (+776%)

Mutual labels: spark

Springboard-Data-Science-Immersive

No description or website provided.