All Projects → GoogleCloudDataproc → Cloud Dataproc

GoogleCloudDataproc / Cloud Dataproc

Licence: apache-2.0
Cloud Dataproc: Samples and Utils

Projects that are alternatives of or similar to Cloud Dataproc

Accelerated dl pytorch
Accelerated Deep Learning with PyTorch at Jupyter Day Atlanta II
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Object detection demo
How to train an object detection model easy for free
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Stldecompose
A Python implementation of Seasonal and Trend decomposition using Loess (STL) for time series data.
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Numba tutorial scipy2016
Numba tutorial materials for Scipy 2016
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Mml Companion
This is a companion to the ‘Mathematical Foundations’ section of the book, Mathematics for Machine Learning by Marc Deisenroth, Aldo Faisal and Cheng Ong, written in python for Jupyter Notebook.
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Slides Scipyconf 2018
A repository for public storage of slides given at the 17th Python in Science Conferences (2018)
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Pytorch Book
Source codes for the book "Application of Neural Network and PyTorch"
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Deep Learning
This repository contains Deep Learning examples using Tensorflow. This repository will be useful for Deep Learning starters who find difficulty in understanding the example codes.
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Citylearn
Official reinforcement learning environment for demand response and load shaping
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Inferpy
InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Siamese net
This package shows how to train a siamese network using Lasagne and Theano and includes network definitions for state-of-the-art networks including: DeepID, DeepID2, Chopra et. al, and Hani et. al. We also include one pre-trained model using a custom convolutional network.
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Mltutorial
Machine Learning Tutorial in IPython Notebooks
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Download Celeba Hq
Python script to download the celebA-HQ dataset from google drive
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Stars: ✭ 1,848 (+1343.75%)
Mutual labels:  jupyter-notebook
Tutorials
DEPRECATED - DO NOT USE
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Reptile Pytorch
A PyTorch implementation of OpenAI's REPTILE algorithm
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Regularized Linear Autoencoders
Loss Landscapes of Regularized Linear Autoencoders
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Cn Machine Learning
https://cn.udacity.com/mlnd/
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook
Sometimes deep sometimes learning
A collection of DL experiments and notes
Stars: ✭ 129 (+0.78%)
Mutual labels:  jupyter-notebook
Waterfall
An easy to use waterfall chart function for Python
Stars: ✭ 130 (+1.56%)
Mutual labels:  jupyter-notebook

Google Cloud Dataproc

This repository contains code and documentation for use with Google Cloud Dataproc.

Samples in this Repository

  • codelabs/opencv-haarcascade provides the source code for the OpenCV Dataproc Codelab, which demonstrates a Spark job that adds facial detection to a set of images.
  • codelabs/spark-bigquery provides the source code for the PySpark for Preprocessing BigQuery Data Codelab, which demonstrates using PySpark on Cloud Dataproc to process data from BigQuery.
  • codelabs/spark-nlp provides the source code for the PySpark for Natural Language Processing Codelab, which demonstrates using spark-nlp library for Natural Language Processing.
  • notebooks/python provides example Jupyter notebooks to demonstrate using PySpark with the BigQuery Storage Connector and the Spark GCS Connector
  • spark-tensorflow provides an example of using Spark as a preprocessing toolchain for Tensorflow jobs. Optionally, it demonstrates the spark-tensorflow-connector to convert CSV files to TFRecords.
  • spark-translate provides a simple demo Spark application that translates words using Google's Translation API and running on Cloud Dataproc.

See each directories README for more information.

Additional Dataproc Repositories

You can find more Dataproc resources in these github repositories:

Dataproc projects

Connectors

Kubernetes Operators

Examples

For more information

For more information, review the Dataproc documentation. You can also pose questions to the Stack Overflow community with the tag google-cloud-dataproc. See our other Google Cloud Platform github repos for sample applications and scaffolding for other frameworks and use cases.

Contributing changes

Licensing

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].