All Projects → spark-data-sources → Similar Projects or Alternatives

419 Open source projects that are alternatives of or similar to spark-data-sources

tpch-spark
TPC-H queries in Apache Spark SQL using native DataFrames API
Stars: ✭ 63 (+75%)
Mutual labels:  spark
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (-5.56%)
Mutual labels:  spark-sql
spark-gradle-template
Apache Spark in your IDE with gradle
Stars: ✭ 39 (+8.33%)
Mutual labels:  spark
Spark Jobserver
REST job server for Apache Spark
Stars: ✭ 2,748 (+7533.33%)
Mutual labels:  spark
trembita
Model complex data transformation pipelines easily
Stars: ✭ 44 (+22.22%)
Mutual labels:  spark
Spark Fast Tests
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Stars: ✭ 249 (+591.67%)
Mutual labels:  spark
openverse-catalog
Identifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-25%)
Mutual labels:  spark
Hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+583.33%)
Mutual labels:  spark
frovedis
Framework of vectorized and distributed data analytics
Stars: ✭ 59 (+63.89%)
Mutual labels:  spark
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+7311.11%)
Mutual labels:  spark
awesome-AI-kubernetes
❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (+163.89%)
Mutual labels:  spark
Video Stream Analytics
Stars: ✭ 240 (+566.67%)
Mutual labels:  spark
Covid19Tracker
A Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.
Stars: ✭ 65 (+80.56%)
Mutual labels:  spark
Azure Event Hubs
☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (+547.22%)
Mutual labels:  spark
ODSC India 2018
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-27.78%)
Mutual labels:  spark
Installations mac ubuntu windows
Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).
Stars: ✭ 231 (+541.67%)
Mutual labels:  spark
BigData-News
基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (+0%)
Mutual labels:  spark
Spark.fish
▁▂▄▆▇█▇▆▄▂▁
Stars: ✭ 229 (+536.11%)
Mutual labels:  spark
sparkar-volts
An extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-58.33%)
Mutual labels:  spark
Ruby Spark
Ruby wrapper for Apache Spark
Stars: ✭ 221 (+513.89%)
Mutual labels:  spark
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-61.11%)
Mutual labels:  spark
Spark Excel
A Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (+500%)
Mutual labels:  spark
experiments
Code examples for my blog posts
Stars: ✭ 21 (-41.67%)
Mutual labels:  spark
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+497.22%)
Mutual labels:  spark
kafka-compose
🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-11.11%)
Mutual labels:  spark
Example Spark
Spark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (+469.44%)
Mutual labels:  spark
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+402.78%)
Mutual labels:  spark
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (+463.89%)
Mutual labels:  spark
bigkube
Minikube for big data with Scala and Spark
Stars: ✭ 16 (-55.56%)
Mutual labels:  spark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+455.56%)
Mutual labels:  spark
visualize-data-with-python
A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (+66.67%)
Mutual labels:  spark
Scanns
A scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (+427.78%)
Mutual labels:  spark
docker-spark
Apache Spark docker container image (Standalone mode)
Stars: ✭ 34 (-5.56%)
Mutual labels:  spark
Azuredatabricksbestpractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (+416.67%)
Mutual labels:  spark
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-5.56%)
Mutual labels:  spark-sql
Roaringbitmap
A better compressed bitset in Java
Stars: ✭ 2,460 (+6733.33%)
Mutual labels:  spark
smolder
HL7 Apache Spark Datasource
Stars: ✭ 33 (-8.33%)
Mutual labels:  spark
Sparkstreaming
💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (+397.22%)
Mutual labels:  spark
spark-sql-internals
The Internals of Spark SQL
Stars: ✭ 331 (+819.44%)
Mutual labels:  spark-sql
Spark Kafka Writer
Write your Spark data to Kafka seamlessly
Stars: ✭ 175 (+386.11%)
Mutual labels:  spark
Python Master Courses
人生苦短 我用Python
Stars: ✭ 61 (+69.44%)
Mutual labels:  spark
Spark
Firely's open source FHIR server
Stars: ✭ 174 (+383.33%)
Mutual labels:  spark
Real-time-Data-Warehouse
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Stars: ✭ 52 (+44.44%)
Mutual labels:  spark-sql
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+34002.78%)
Mutual labels:  spark
SparkV
🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.
Stars: ✭ 24 (-33.33%)
Mutual labels:  spark
Spark Structured Streaming Examples
Spark Structured Streaming / Kafka / Cassandra / Elastic
Stars: ✭ 168 (+366.67%)
Mutual labels:  spark
wow-spark
🔆 spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。
Stars: ✭ 20 (-44.44%)
Mutual labels:  spark-sql
Geopyspark
GeoTrellis for PySpark
Stars: ✭ 167 (+363.89%)
Mutual labels:  spark
spark-sql-flow-plugin
Visualize column-level data lineage in Spark SQL
Stars: ✭ 20 (-44.44%)
Mutual labels:  spark
Big Whale
Spark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (+352.78%)
Mutual labels:  spark
databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Stars: ✭ 19 (-47.22%)
Mutual labels:  spark-sql
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (+347.22%)
Mutual labels:  spark
spark-demos
Collection of different demo applications using Apache Spark
Stars: ✭ 15 (-58.33%)
Mutual labels:  spark
Vue Info Card
Simple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (+341.67%)
Mutual labels:  spark
spark-kubernetes
spark on kubernetes
Stars: ✭ 80 (+122.22%)
Mutual labels:  spark
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+8.33%)
Mutual labels:  spark-sql
opaque-sql
An encrypted data analytics platform
Stars: ✭ 169 (+369.44%)
Mutual labels:  spark-sql
arcgis-experience-builder-sdk-resources
ArcGIS Experience Builder samples
Stars: ✭ 47 (+30.56%)
Mutual labels:  data-sources
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+50%)
Mutual labels:  spark
confluent-spark-avro
Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
Stars: ✭ 18 (-50%)
Mutual labels:  spark
61-120 of 419 similar projects