GoogleCloudDataproc / Cloud Dataproc
Licence: apache-2.0
Cloud Dataproc: Samples and Utils
Stars: ✭ 128
Labels
Projects that are alternatives of or similar to Cloud Dataproc
Accelerated dl pytorch
Accelerated Deep Learning with PyTorch at Jupyter Day Atlanta II
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Object detection demo
How to train an object detection model easy for free
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Stldecompose
A Python implementation of Seasonal and Trend decomposition using Loess (STL) for time series data.
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Numba tutorial scipy2016
Numba tutorial materials for Scipy 2016
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Mml Companion
This is a companion to the ‘Mathematical Foundations’ section of the book, Mathematics for Machine Learning by Marc Deisenroth, Aldo Faisal and Cheng Ong, written in python for Jupyter Notebook.
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Slides Scipyconf 2018
A repository for public storage of slides given at the 17th Python in Science Conferences (2018)
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Pytorch Book
Source codes for the book "Application of Neural Network and PyTorch"
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Deep Learning
This repository contains Deep Learning examples using Tensorflow. This repository will be useful for Deep Learning starters who find difficulty in understanding the example codes.
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Citylearn
Official reinforcement learning environment for demand response and load shaping
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Inferpy
InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Siamese net
This package shows how to train a siamese network using Lasagne and Theano and includes network definitions for state-of-the-art networks including: DeepID, DeepID2, Chopra et. al, and Hani et. al. We also include one pre-trained model using a custom convolutional network.
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Mltutorial
Machine Learning Tutorial in IPython Notebooks
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Download Celeba Hq
Python script to download the celebA-HQ dataset from google drive
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Stars: ✭ 1,848 (+1343.75%)
Mutual labels: jupyter-notebook
Reptile Pytorch
A PyTorch implementation of OpenAI's REPTILE algorithm
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Regularized Linear Autoencoders
Loss Landscapes of Regularized Linear Autoencoders
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Cn Machine Learning
https://cn.udacity.com/mlnd/
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Sometimes deep sometimes learning
A collection of DL experiments and notes
Stars: ✭ 129 (+0.78%)
Mutual labels: jupyter-notebook
Waterfall
An easy to use waterfall chart function for Python
Stars: ✭ 130 (+1.56%)
Mutual labels: jupyter-notebook
Google Cloud Dataproc
This repository contains code and documentation for use with Google Cloud Dataproc.
Samples in this Repository
-
codelabs/opencv-haarcascade
provides the source code for the OpenCV Dataproc Codelab, which demonstrates a Spark job that adds facial detection to a set of images. -
codelabs/spark-bigquery
provides the source code for the PySpark for Preprocessing BigQuery Data Codelab, which demonstrates using PySpark on Cloud Dataproc to process data from BigQuery. -
codelabs/spark-nlp
provides the source code for the PySpark for Natural Language Processing Codelab, which demonstrates using spark-nlp library for Natural Language Processing. -
notebooks/python
provides example Jupyter notebooks to demonstrate using PySpark with the BigQuery Storage Connector and the Spark GCS Connector -
spark-tensorflow
provides an example of using Spark as a preprocessing toolchain for Tensorflow jobs. Optionally, it demonstrates the spark-tensorflow-connector to convert CSV files to TFRecords. -
spark-translate
provides a simple demo Spark application that translates words using Google's Translation API and running on Cloud Dataproc.
See each directories README for more information.
Additional Dataproc Repositories
You can find more Dataproc resources in these github repositories:
Dataproc projects
Connectors
- Hadoop/Spark GCS Connector
- Spark BigQuery Connector
- Hadoop BigQuery Connector
- Spark Pubsub Connector
- Spark Spanner Connector
- Hive Bigquery Storage Handler
Kubernetes Operators
Examples
- Dataproc Python examples
- Dataproc Pubsub Spark Streaming example
- Dataproc Java Bigtable sample
- Dataproc Spark-Bigtable samples
For more information
For more information, review the Dataproc
documentation. You can also
pose questions to the Stack
Overflow community
with the tag google-cloud-dataproc
.
See our other Google Cloud Platform github
repos for sample applications and
scaffolding for other frameworks and use cases.
Contributing changes
- See CONTRIBUTING.md
Licensing
- See LICENSE
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].