All Projects → spark-extension → Similar Projects or Alternatives

456 Open source projects that are alternatives of or similar to spark-extension

sentry-spark
Apache Spark Sentry Integration
Stars: ✭ 14 (-44%)
Mutual labels:  spark
flask-spark-docker
Just a boilerplate for PySpark and Flask
Stars: ✭ 32 (+28%)
Mutual labels:  pyspark
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+10572%)
Mutual labels:  spark
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+188%)
Mutual labels:  pyspark
tpch-spark
TPC-H queries in Apache Spark SQL using native DataFrames API
Stars: ✭ 63 (+152%)
Mutual labels:  spark
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (+120%)
Mutual labels:  pyspark
sparkar-volts
An extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-40%)
Mutual labels:  spark
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (+132%)
Mutual labels:  pyspark
spark-acid
ACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+264%)
Mutual labels:  spark
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (+12%)
Mutual labels:  pyspark
experiments
Code examples for my blog posts
Stars: ✭ 21 (-16%)
Mutual labels:  spark
smolder
HL7 Apache Spark Datasource
Stars: ✭ 33 (+32%)
Mutual labels:  spark
dlsa
Distributed least squares approximation (dlsa) implemented with Apache Spark
Stars: ✭ 25 (+0%)
Mutual labels:  pyspark
Video Stream Analytics
Stars: ✭ 240 (+860%)
Mutual labels:  spark
workshop-spark
Código para workshops Spark com ambiente de desenvolvimento em docker
Stars: ✭ 27 (+8%)
Mutual labels:  pyspark
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+624%)
Mutual labels:  spark
Koalas
Koalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+12076%)
Mutual labels:  spark
spark-word2vec
A parallel implementation of word2vec based on Spark
Stars: ✭ 24 (-4%)
Mutual labels:  spark
Every Single Day I Tldr
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (+896%)
Mutual labels:  spark
visualize-data-with-python
A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (+140%)
Mutual labels:  spark
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+888%)
Mutual labels:  spark
frovedis
Framework of vectorized and distributed data analytics
Stars: ✭ 59 (+136%)
Mutual labels:  spark
Neo4j Spark Connector
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Stars: ✭ 245 (+880%)
Mutual labels:  spark
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+36%)
Mutual labels:  pyspark
Recommendationsystem
Book recommender system using collaborative filtering based on Spark
Stars: ✭ 244 (+876%)
Mutual labels:  spark
shamash
Autoscaling for Google Cloud Dataproc
Stars: ✭ 31 (+24%)
Mutual labels:  spark
Hadoop Docker
基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Stars: ✭ 238 (+852%)
Mutual labels:  spark
machine-learning-course
Machine Learning Course @ Santa Clara University
Stars: ✭ 17 (-32%)
Mutual labels:  pyspark
Azure Event Hubs
☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (+832%)
Mutual labels:  spark
Mastering Spark Sql Book
The Internals of Spark SQL
Stars: ✭ 234 (+836%)
Mutual labels:  spark
Casper
A compiler for automatically re-targeting sequential Java code to Apache Spark.
Stars: ✭ 45 (+80%)
Mutual labels:  spark
visions
Type System for Data Analysis in Python
Stars: ✭ 136 (+444%)
Mutual labels:  spark
Spark-PMoF
Spark Shuffle Optimization with RDMA+AEP
Stars: ✭ 28 (+12%)
Mutual labels:  spark
Spark-and-Kafka IoT-Data-Processing-and-Analytics
Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time
Stars: ✭ 42 (+68%)
Mutual labels:  pyspark
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-36%)
Mutual labels:  pyspark
Installations mac ubuntu windows
Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).
Stars: ✭ 231 (+824%)
Mutual labels:  spark
Mydatascienceportfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (+808%)
Mutual labels:  spark
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (+88%)
Mutual labels:  pyspark
Spark.fish
▁▂▄▆▇█▇▆▄▂▁
Stars: ✭ 229 (+816%)
Mutual labels:  spark
Spark Workshop
Apache Spark™ and Scala Workshops
Stars: ✭ 224 (+796%)
Mutual labels:  spark
Search Ads Web Service
Online search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (+20%)
Mutual labels:  spark
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+1796%)
Mutual labels:  pyspark
Ruby Spark
Ruby wrapper for Apache Spark
Stars: ✭ 221 (+784%)
Mutual labels:  spark
Sagemaker Spark
A Spark library for Amazon SageMaker.
Stars: ✭ 219 (+776%)
Mutual labels:  spark
Springboard-Data-Science-Immersive
No description or website provided.
Stars: ✭ 52 (+108%)
Mutual labels:  pyspark
Spark Excel
A Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (+764%)
Mutual labels:  spark
BigData-News
基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (+44%)
Mutual labels:  spark
yuzhouwan
Code Library for My Blog
Stars: ✭ 39 (+56%)
Mutual labels:  spark
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-32%)
Mutual labels:  pyspark
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+760%)
Mutual labels:  spark
Hydro Serving
MLOps Platform
Stars: ✭ 213 (+752%)
Mutual labels:  spark
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+0%)
Mutual labels:  pyspark
Example Spark
Spark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (+720%)
Mutual labels:  spark
Spark Knn
k-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (+720%)
Mutual labels:  spark
spark-gradle-template
Apache Spark in your IDE with gradle
Stars: ✭ 39 (+56%)
Mutual labels:  spark
Spark-for-data-engineers
Apache Spark for data engineers
Stars: ✭ 22 (-12%)
Mutual labels:  pyspark
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (+712%)
Mutual labels:  spark
check-engine
Data validation library for PySpark 3.0.0
Stars: ✭ 29 (+16%)
Mutual labels:  pyspark
Ballista
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+8996%)
Mutual labels:  spark
spark-demos
Collection of different demo applications using Apache Spark
Stars: ✭ 15 (-40%)
Mutual labels:  spark
61-120 of 456 similar projects