All Projects → Azure Cosmosdb Spark → Similar Projects or Alternatives

6452 Open source projects that are alternatives of or similar to Azure Cosmosdb Spark

Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-9.09%)
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-86.06%)
Mutual labels:  jupyter-notebook, spark, pyspark
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-61.82%)
Mutual labels:  jupyter-notebook, spark, pyspark
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+478.18%)
Mutual labels:  jupyter-notebook, spark, pyspark
Live log analyzer spark
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-91.52%)
Mutual labels:  spark, apache-spark, pyspark
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-61.21%)
Mutual labels:  jupyter-notebook, spark, pyspark
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-32.73%)
Mutual labels:  spark, apache-spark, pyspark
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+497.58%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+21.21%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+710.91%)
Mutual labels:  jupyter-notebook, spark, pyspark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-4.24%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Jupyter Aws
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (+56.97%)
Mutual labels:  jupyter-notebook, spark, apache-spark
Pyspark Learning
Updated repository
Stars: ✭ 147 (-10.91%)
Mutual labels:  jupyter-notebook, spark, pyspark
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+1656.97%)
Mutual labels:  spark, pyspark, apache-spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+150.3%)
Mutual labels:  jupyter-notebook, spark, apache-spark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-15.15%)
Mutual labels:  spark, apache-spark, connector
Datahacksummit 2017
Apache Zeppelin notebooks for Recommendation Engines using Keras and Machine Learning on Apache Spark
Stars: ✭ 30 (-81.82%)
Mutual labels:  jupyter-notebook, apache-spark
Spark Flamegraph
Easy CPU Profiling for Apache Spark applications
Stars: ✭ 30 (-81.82%)
Mutual labels:  spark, apache-spark
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-77.58%)
Mutual labels:  spark, apache-spark
Spark As Service Using Embedded Server
This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server
Stars: ✭ 46 (-72.12%)
Mutual labels:  spark, apache-spark
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+504.85%)
Mutual labels:  jupyter-notebook, spark
Apache Spark Internals
The Internals of Apache Spark
Stars: ✭ 1,045 (+533.33%)
Mutual labels:  spark, apache-spark
Awesome Spark
A curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+543.03%)
Mutual labels:  apache-spark, pyspark
Data Science Cookbook
🎓 Jupyter notebooks from UFC data science course
Stars: ✭ 60 (-63.64%)
Mutual labels:  jupyter-notebook, spark
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-56.97%)
Mutual labels:  jupyter-notebook, spark
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-49.7%)
Mutual labels:  spark, apache-spark
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-46.67%)
Mutual labels:  jupyter-notebook, spark
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-91.52%)
Mutual labels:  jupyter-notebook, spark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-66.67%)
Mutual labels:  spark, apache-spark
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-49.09%)
Mutual labels:  jupyter-notebook, spark
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-46.06%)
Mutual labels:  jupyter-notebook, spark
Bitcoin Value Predictor
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-44.85%)
Mutual labels:  jupyter-notebook, pyspark
Spark Tda
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-72.73%)
Mutual labels:  spark, apache-spark
Spark Examples
Spark examples
Stars: ✭ 41 (-75.15%)
Mutual labels:  spark, apache-spark
Spark Nkp
Natural Korean Processor for Apache Spark
Stars: ✭ 50 (-69.7%)
Mutual labels:  spark, apache-spark
Sparkling Titanic
Training models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-92.73%)
Mutual labels:  spark, pyspark
Pyspark Examples
Code examples on Apache Spark using python
Stars: ✭ 58 (-64.85%)
Mutual labels:  jupyter-notebook, spark
Awesome Pulsar
A curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-65.45%)
Mutual labels:  spark, apache-spark
Spark python ml examples
Spark 2.0 Python Machine Learning examples
Stars: ✭ 87 (-47.27%)
Mutual labels:  spark, pyspark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-47.88%)
Mutual labels:  spark, apache-spark
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+463.03%)
Mutual labels:  spark, apache-spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-0.61%)
Mutual labels:  spark, apache-spark
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-36.36%)
Mutual labels:  spark, apache-spark
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+720.61%)
Mutual labels:  jupyter-notebook, spark
Pyspark Stubs
Apache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (-40.61%)
Mutual labels:  apache-spark, pyspark
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-34.55%)
Mutual labels:  spark, pyspark
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+978.79%)
Mutual labels:  spark, apache-spark
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-34.55%)
Mutual labels:  spark, pyspark
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-22.42%)
Mutual labels:  jupyter-notebook, apache-spark
Eat pyspark in 10 days
pyspark🍒🥭 is delicious,just eat it!😋😋
Stars: ✭ 116 (-29.7%)
Mutual labels:  spark, pyspark
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+943.03%)
Mutual labels:  spark, apache-spark
Spark Tpc Ds Performance Test
Use the TPC-DS benchmark to test Spark SQL performance
Stars: ✭ 133 (-19.39%)
Mutual labels:  jupyter-notebook, apache-spark
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (-41.82%)
Mutual labels:  spark, pyspark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-32.12%)
Mutual labels:  jupyter-notebook, spark
Repo 2019
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Stars: ✭ 133 (-19.39%)
Mutual labels:  jupyter-notebook, pyspark
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (-16.97%)
Mutual labels:  spark, apache-spark
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-10.91%)
Mutual labels:  spark, pyspark
Data science blogs
A repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-15.76%)
Mutual labels:  jupyter-notebook, spark
Learningapachespark
LearningApacheSpark
Stars: ✭ 155 (-6.06%)
Mutual labels:  spark, pyspark
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (-4.24%)
Mutual labels:  jupyter-notebook, spark
1-60 of 6452 similar projects