All Projects → yokawasa → databricks-notebooks

yokawasa / databricks-notebooks

Licence: MIT license
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )

Programming Languages

Jupyter Notebook
11667 projects
shell
77523 projects

Projects that are alternatives of or similar to databricks-notebooks

architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (-29.82%)
Mutual labels:  databricks
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+4985.96%)
Mutual labels:  databricks
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (-61.4%)
Mutual labels:  elt
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+5785.96%)
Mutual labels:  databricks
Redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+35245.61%)
Mutual labels:  databricks
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+152.63%)
Mutual labels:  elt
databricks-dbapi
DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters
Stars: ✭ 21 (-63.16%)
Mutual labels:  databricks
Dbt
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Stars: ✭ 3,919 (+6775.44%)
Mutual labels:  elt
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+2919.3%)
Mutual labels:  databricks
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+38.6%)
Mutual labels:  elt
stowage
Bloat-free, no BS cloud storage SDK.
Stars: ✭ 85 (+49.12%)
Mutual labels:  databricks
databricksConnectDocker
Docker Images with Databricks Connect Ready to go
Stars: ✭ 19 (-66.67%)
Mutual labels:  databricks
rivery cli
Rivery CLI
Stars: ✭ 16 (-71.93%)
Mutual labels:  elt
azure.databricks.cicd.tools
Tools for Deploying Databricks Solutions in Azure
Stars: ✭ 87 (+52.63%)
Mutual labels:  databricks
dbd
dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (-47.37%)
Mutual labels:  elt
mlops-platforms
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
Stars: ✭ 293 (+414.04%)
Mutual labels:  databricks
wikirepo
Python based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-42.11%)
Mutual labels:  elt
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+8529.82%)
Mutual labels:  elt
alphasql
AlphaSQL provides Integrated Type and Schema Check and Parallelization for SQL file set mainly for BigQuery
Stars: ✭ 35 (-38.6%)
Mutual labels:  elt
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+731.58%)
Mutual labels:  elt

databricks-notebook

Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )

Sample Notebooks

Notebook Description Lang
File Operations Sample Various file operations sample such as Azure Blob Storage mount & umount, ls/rm/cp/mv, read CSV file, etc Python
ELT Sample: Azure Blob Stroage - Databricks - CosmosDB In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure Cosmos DB Python
ELT Sample: Azure Blob Stroage - Databricks - SQLDW In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure SQL Data Warehouse Python
Streaming Sample: Cosmos DB ChangeFeed - Databricks In this notebook, you read a live stream of tweets that stored in Cosmos DB by leveraging Apache Spart to read the Cosmos DB's Change Feed, and run transformations on the data in Databricks cluster Python
Streaming Sample: Azure Event Hub - Databricks In this notebook, you connect Azure Event Hub (hyper-scale data ingestion service) with Azure Databricks to stream data into an Apache Spark cluster in near real-time Python

Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/yokawasa/databricks-notebook

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].