All Projects → GoogleCloudPlatform → Data Science On Gcp

GoogleCloudPlatform / Data Science On Gcp

Licence: other
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017

Projects that are alternatives of or similar to Data Science On Gcp

Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+170.49%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-37.38%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-80.21%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-85.76%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-18.52%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Data Science Portfolio
A Portfolio of my Data Science Projects
Stars: ✭ 149 (-82.75%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315 (-63.54%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+36%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-68.4%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (-72.22%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+75.46%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Pydataroad
open source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-65.05%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-86.81%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-47.45%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-86.92%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+231.48%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+4834.49%)
Mutual labels:  data-science, data-analysis, data-engineering, data-visualization
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-94.33%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-74.77%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization
Cryptocurrency Analysis Python
Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data
Stars: ✭ 278 (-67.82%)
Mutual labels:  jupyter-notebook, data-science, data-analysis, data-visualization

data-science-on-gcp

Source code accompanying book:

Data Science on the Google Cloud Platform
Valliappa Lakshmanan
O'Reilly, Jan 2017

Try out the code on Google Cloud Platform

Open in Cloud Shell

The code on Qwiklabs (see below) is continually tested, and this repo is kept up-to-date. The code should work as-is for you, however, there are three very common problems that readers report:

  • Ch 2: Download data fails. The Bureau of Transportation website to download the airline dataset periodically goes down or changes availability due to government furloughs and the like. Please use the instructions in 02_ingest/README.md to copy the data from my bucket. The rest of the chapters work off the data in the bucket, and will be fine.
  • Ch 3: Permission errors. These typically occur because we expect that you will copy the airline data to your bucket. You don't have write access to gs://cloud-training-demos-ml/. The instructions will tell you to change the bucket name to one that you own. Please do that.
  • Ch 4, 10: Dataflow doesn't do anything.. The real-time simulation requires that you simultaneously run simulate.py and the Dataflow pipeline. If the Dataflow pipeline is not progressing, make sure that the simulate program is still running.

If the code doesn't work for you, I recommend that you try the corresponding Qwiklab lab to see if there is some step that you missed. If you still have problems, please leave feedback in Qwiklabs, or file an issue in this repo.

Try out the code on Qwiklabs

Purchase book

Read on-line or download PDF of book

Buy on Amazon.com

Updates to book

I updated the book in Nov 2019 with TensorFlow 2.0, Cloud Functions, and BigQuery ML.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].