All Projects → Quinn → Similar Projects or Alternatives

200 Open source projects that are alternatives of or similar to Quinn

Pyspark Boilerplate
A boilerplate for writing PySpark Jobs
Stars: ✭ 318 (+46.54%)
Mutual labels:  apache-spark, pyspark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-23.96%)
Mutual labels:  apache-spark, pyspark
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-82.03%)
Mutual labels:  apache-spark, pyspark
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (-87.1%)
Mutual labels:  apache-spark, pyspark
Awesome Spark
A curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+388.94%)
Mutual labels:  apache-spark, pyspark
pyspark-cheatsheet
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (-47%)
Mutual labels:  apache-spark, pyspark
Pyspark Stubs
Apache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (-54.84%)
Mutual labels:  apache-spark, pyspark
spark3D
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Stars: ✭ 23 (-89.4%)
Mutual labels:  apache-spark, pyspark
pyspark-asyncactions
Asynchronous actions for PySpark
Stars: ✭ 30 (-86.18%)
Mutual labels:  apache-spark, pyspark
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (-74.65%)
Mutual labels:  apache-spark, pyspark
learn-by-examples
Real-world Spark pipelines examples
Stars: ✭ 84 (-61.29%)
Mutual labels:  apache-spark, pyspark
mmtf-workshop-2018
Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (-76.96%)
Mutual labels:  apache-spark, pyspark
Spark-for-data-engineers
Apache Spark for data engineers
Stars: ✭ 22 (-89.86%)
Mutual labels:  apache-spark, pyspark
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-48.85%)
Mutual labels:  apache-spark, pyspark
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (-64.06%)
Mutual labels:  apache-spark, pyspark
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+1235.94%)
Mutual labels:  pyspark, apache-spark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-30.88%)
Mutual labels:  apache-spark, pyspark
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+1446.08%)
Mutual labels:  apache-spark, pyspark
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-76.5%)
Mutual labels:  apache-spark, pyspark
Spark Gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (+41.94%)
Mutual labels:  apache-spark, pyspark
Live log analyzer spark
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-93.55%)
Mutual labels:  apache-spark, pyspark
Scala Spark Tutorial
Project for James' Apache Spark with Scala course
Stars: ✭ 121 (-44.24%)
Mutual labels:  apache-spark
Learningapachespark
LearningApacheSpark
Stars: ✭ 155 (-28.57%)
Mutual labels:  pyspark
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-50.23%)
Mutual labels:  pyspark
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+720.28%)
Mutual labels:  apache-spark
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+1060.37%)
Mutual labels:  pyspark
Albedo
A recommender system for discovering GitHub repos, built with Apache Spark
Stars: ✭ 149 (-31.34%)
Mutual labels:  apache-spark
Docker Spark
Apache Spark docker image
Stars: ✭ 1,396 (+543.32%)
Mutual labels:  apache-spark
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-41.94%)
Mutual labels:  pyspark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-27.19%)
Mutual labels:  pyspark
Eat pyspark in 10 days
pyspark🍒🥭 is delicious,just eat it!😋😋
Stars: ✭ 116 (-46.54%)
Mutual labels:  pyspark
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-18.43%)
Mutual labels:  apache-spark
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-50.23%)
Mutual labels:  pyspark
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-51.61%)
Mutual labels:  apache-spark
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (-55.76%)
Mutual labels:  pyspark
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-32.26%)
Mutual labels:  pyspark
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (-23.5%)
Mutual labels:  pyspark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+516.59%)
Mutual labels:  pyspark
Pyspark Learning
Updated repository
Stars: ✭ 147 (-32.26%)
Mutual labels:  pyspark
Pyspark Tutorial
PySpark Code for Hands-on Learners
Stars: ✭ 91 (-58.06%)
Mutual labels:  pyspark
Bitcoin Value Predictor
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-58.06%)
Mutual labels:  pyspark
Parquetviewer
Simple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-33.18%)
Mutual labels:  apache-spark
Spark python ml examples
Spark 2.0 Python Machine Learning examples
Stars: ✭ 87 (-59.91%)
Mutual labels:  pyspark
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-0.92%)
Mutual labels:  apache-spark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-7.83%)
Mutual labels:  pyspark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-60.37%)
Mutual labels:  apache-spark
Oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+722.58%)
Mutual labels:  apache-spark
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-61.75%)
Mutual labels:  apache-spark
Mlflow
Open source platform for the machine learning lifecycle
Stars: ✭ 10,898 (+4922.12%)
Mutual labels:  apache-spark
Hydrograph
A visual ETL development and debugging tool for big data
Stars: ✭ 144 (-33.64%)
Mutual labels:  apache-spark
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-70.51%)
Mutual labels:  pyspark
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-70.97%)
Mutual labels:  pyspark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-24.42%)
Mutual labels:  apache-spark
Scalable Data Science
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142 (-34.56%)
Mutual labels:  apache-spark
Petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Stars: ✭ 1,108 (+410.6%)
Mutual labels:  pyspark
Awesome Pulsar
A curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-73.73%)
Mutual labels:  apache-spark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-35.48%)
Mutual labels:  apache-spark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-74.65%)
Mutual labels:  apache-spark
Analytics Zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Stars: ✭ 2,448 (+1028.11%)
Mutual labels:  apache-spark
Spark Atlas Connector
A Spark Atlas connector to track data lineage in Apache Atlas
Stars: ✭ 160 (-26.27%)
Mutual labels:  apache-spark
1-60 of 200 similar projects