All Projects → Azure-Databricks-NYC-Taxi-Workshop → Similar Projects or Alternatives

108 Open source projects that are alternatives of or similar to Azure-Databricks-NYC-Taxi-Workshop

databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Stars: ✭ 19 (-73.24%)
Mutual labels:  pyspark, azure-databricks
python mozetl
ETL jobs for Firefox Telemetry
Stars: ✭ 25 (-64.79%)
Mutual labels:  pyspark
kafka-twitter-spark-streaming
Counting Tweets Per User in Real-Time
Stars: ✭ 38 (-46.48%)
Mutual labels:  pyspark
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (-60.56%)
Mutual labels:  pyspark
flask-spark-docker
Just a boilerplate for PySpark and Flask
Stars: ✭ 32 (-54.93%)
Mutual labels:  pyspark
phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
Stars: ✭ 115 (+61.97%)
Mutual labels:  pyspark
aml-registermodel
GitHub Action that allows you to register models to your Azure Machine Learning Workspace.
Stars: ✭ 14 (-80.28%)
Mutual labels:  azure-machine-learning
Springboard-Data-Science-Immersive
No description or website provided.
Stars: ✭ 52 (-26.76%)
Mutual labels:  pyspark
pyspark-ML-in-Colab
Pyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (-54.93%)
Mutual labels:  pyspark
workshop-spark
Código para workshops Spark com ambiente de desenvolvimento em docker
Stars: ✭ 27 (-61.97%)
Mutual labels:  pyspark
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+204.23%)
Mutual labels:  pyspark
anovos
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+8.45%)
Mutual labels:  pyspark
aml-deploy
GitHub Action that allows you to deploy machine learning models in Azure Machine Learning.
Stars: ✭ 37 (-47.89%)
Mutual labels:  azure-machine-learning
OSCI
Open Source Contributor Index
Stars: ✭ 107 (+50.7%)
Mutual labels:  pyspark
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+567.61%)
Mutual labels:  pyspark
learn-by-examples
Real-world Spark pipelines examples
Stars: ✭ 84 (+18.31%)
Mutual labels:  pyspark
aml-keras-image-recognition
A sample Azure Machine Learning project for Transfer Learning-based custom image recognition by utilizing Keras.
Stars: ✭ 14 (-80.28%)
Mutual labels:  azure-machine-learning
jgit-spark-connector
jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.
Stars: ✭ 71 (+0%)
Mutual labels:  pyspark
az-ml-batch-score
Deploying a Batch Scoring Pipeline for Python Models
Stars: ✭ 17 (-76.06%)
Mutual labels:  azure-machine-learning
spark3D
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Stars: ✭ 23 (-67.61%)
Mutual labels:  pyspark
ceja
PySpark phonetic and string matching algorithms
Stars: ✭ 24 (-66.2%)
Mutual labels:  pyspark
Morphl Community Edition
MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization
Stars: ✭ 253 (+256.34%)
Mutual labels:  pyspark
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-64.79%)
Mutual labels:  pyspark
pyspark-for-data-processing
Code for my presentation: Using PySpark to Process Boat Loads of Data
Stars: ✭ 20 (-71.83%)
Mutual labels:  pyspark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+181.69%)
Mutual labels:  pyspark
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (+133.8%)
Mutual labels:  pyspark
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-45.07%)
Mutual labels:  pyspark
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+4625.35%)
Mutual labels:  pyspark
SeeingAI-Currency-Detection
This repository contains the code for the blogpost: How to Develop a Currency Detection Model using Azure Machine Learning
Stars: ✭ 39 (-45.07%)
Mutual labels:  azure-machine-learning
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (-33.8%)
Mutual labels:  pyspark
Azure-Certification-DP-200
Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution
Stars: ✭ 54 (-23.94%)
Mutual labels:  azure-databricks
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (+9.86%)
Mutual labels:  pyspark
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+1.41%)
Mutual labels:  pyspark
dlsa
Distributed least squares approximation (dlsa) implemented with Apache Spark
Stars: ✭ 25 (-64.79%)
Mutual labels:  pyspark
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (-22.54%)
Mutual labels:  pyspark
pyspark-k8s-boilerplate
Boilerplate for PySpark on Cloud Kubernetes
Stars: ✭ 24 (-66.2%)
Mutual labels:  pyspark
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (-18.31%)
Mutual labels:  pyspark
aml-workspace
GitHub Action that allows you to create or connect to your Azure Machine Learning Workspace.
Stars: ✭ 22 (-69.01%)
Mutual labels:  azure-machine-learning
AI-on-Microsoft-Azure
Microsoft buduje i tworzy Polską Dolinę Cyfrową. W ramach tej inicjatywy podjęliśmy się wyzwania zbudowania chmurowych kompetencji wśród 150tys osób w Polsce. Jednym z elementów tej inicjatywy jest dedykowany kurs na studiach inzynierskich i magisterskich na Politechnice Warszawskiej poświęcony chmurze obliczeniowej oraz sztucznej inteligencji.
Stars: ✭ 11 (-84.51%)
Mutual labels:  azure-machine-learning
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (+132.39%)
Mutual labels:  pyspark
Advanced-Databricks-for-ML-Build-2019
Using Azure Databricks (Spark) for ML, this is the //build 2019 repository with homework examples, code and notebooks
Stars: ✭ 13 (-81.69%)
Mutual labels:  azure-databricks
pyspark-cheatsheet
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+61.97%)
Mutual labels:  pyspark
pyspark-cassandra
pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4
Stars: ✭ 70 (-1.41%)
Mutual labels:  pyspark
gallery
BentoML Example Projects 🎨
Stars: ✭ 120 (+69.01%)
Mutual labels:  azure-machine-learning
optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+1802.82%)
Mutual labels:  pyspark
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-76.06%)
Mutual labels:  pyspark
spark-dgraph-connector
A connector for Apache Spark and PySpark to Dgraph databases.
Stars: ✭ 36 (-49.3%)
Mutual labels:  pyspark
aml-compute
GitHub Action that allows you to attach, create and scale Azure Machine Learning compute resources.
Stars: ✭ 19 (-73.24%)
Mutual labels:  azure-machine-learning
Quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (+205.63%)
Mutual labels:  pyspark
machine-learning-course
Machine Learning Course @ Santa Clara University
Stars: ✭ 17 (-76.06%)
Mutual labels:  pyspark
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+3983.1%)
Mutual labels:  pyspark
azureml-cheatsheets
Azure Machine Learning Cheat Sheets
Stars: ✭ 23 (-67.61%)
Mutual labels:  azure-machine-learning
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+3446.48%)
Mutual labels:  pyspark
Spark-for-data-engineers
Apache Spark for data engineers
Stars: ✭ 22 (-69.01%)
Mutual labels:  pyspark
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-28.17%)
Mutual labels:  pyspark
ODSC India 2018
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-63.38%)
Mutual labels:  pyspark
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-52.11%)
Mutual labels:  pyspark
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-77.46%)
Mutual labels:  pyspark
check-engine
Data validation library for PySpark 3.0.0
Stars: ✭ 29 (-59.15%)
Mutual labels:  pyspark
oshinko-s2i
This is a place to put s2i images and utilities for spark application builders for openshift
Stars: ✭ 16 (-77.46%)
Mutual labels:  pyspark
1-60 of 108 similar projects