All Projects → pablo14 → Data Science Live Book

pablo14 / Data Science Live Book

Licence: cc-by-sa-4.0
An open source book to learn data science, data analysis and machine learning, suitable for all ages!

Projects that are alternatives of or similar to Data Science Live Book

Pachyderm
Reproducible Data Science at Scale!
Stars: ✭ 5,305 (+2648.7%)
Mutual labels:  data-science, analytics, data-analysis, big-data
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-44.56%)
Mutual labels:  data-science, statistics, data-analysis, big-data
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-43.52%)
Mutual labels:  data-science, data-analysis, big-data
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+859.07%)
Mutual labels:  data-science, statistics, data-analysis
Covid19 Severity Prediction
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
Stars: ✭ 170 (-11.92%)
Mutual labels:  data-science, statistics, data-analysis
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+593.26%)
Mutual labels:  data-science, data-analysis, big-data
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+21990.16%)
Mutual labels:  data-science, analytics, data-analysis
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-11.4%)
Mutual labels:  data-science, data-analysis, learning
Hyperlearn
50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+523.83%)
Mutual labels:  data-science, statistics, data-analysis
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-27.46%)
Mutual labels:  data-science, statistics, analytics
Deeplearning Notes
Notes for Deep Learning Specialization Courses led by Andrew Ng.
Stars: ✭ 126 (-34.72%)
Mutual labels:  data-science, statistics, data-analysis
Uci Ml Api
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Stars: ✭ 190 (-1.55%)
Mutual labels:  data-science, statistics, learning
Bayesian Cognitive Modeling In Pymc3
PyMC3 codes of Lee and Wagenmakers' Bayesian Cognitive Modeling - A Pratical Course
Stars: ✭ 93 (-51.81%)
Mutual labels:  data-science, statistics, data-analysis
Scikit Learn
scikit-learn: machine learning in Python
Stars: ✭ 48,322 (+24937.31%)
Mutual labels:  data-science, statistics, data-analysis
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-59.07%)
Mutual labels:  data-science, data-analysis, big-data
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-41.45%)
Mutual labels:  data-science, data-analysis, big-data
Virgilio
Virgilio is developed and maintained by these awesome people. You can email us virgilio.datascience (at) gmail.com or join the Discord chat.
Stars: ✭ 13,200 (+6739.38%)
Mutual labels:  data-science, statistics, learning
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-64.25%)
Mutual labels:  analytics, data-analysis, big-data
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+508.81%)
Mutual labels:  data-science, data-analysis, big-data
Mlr
Machine Learning in R
Stars: ✭ 1,542 (+698.96%)
Mutual labels:  data-science, statistics, predictive-modeling

Data Science Live Book

Data Science Live Book

tl;dr: Hi there! I invite you to read the book online and/or download here. Thanks and have a nice day :)

Paperback & Kindle at Amazon

This book is now available at Amazon in [Kindle]( Link: http://a.co/d/dIj1XwD) Black & White and color 📗 🚀.

LiIt can be shipped to over 100 countries. 🌎

Also available in PPDF :)

!(tl;dr): An overview...

It's a book to learn data science, machine learning, data analysis with tons of examples and explanations around several topics like:

  • Exploratory data analysis
  • Data preparation
  • Selecting best variables
  • Model performance

Most of the written R code can be used in real scenarios! I worked on the funModeling R package at the same time, so it is used many times in the book.


How about some examples?

It's a playbook with full of data preparation receipts.

I.e. in the missing values chapter you'll find how to input and convert these values into something useful for both, analysis and predictive modeling.

Other example, in the outliers chapter you'll get to know to some methods that spot outliers based on different criteria; funModeling contains a function that can help to process all data at once...

Or more conceptually, we have a numeric variable and we need to convert it into categorical, or vice-versa, do we have to convert or just leave it as it comes?

And so on and so on...


Book's philosophy

The book has all of its chapters interrelated, so you can start by any of them. My apologies if the number of links distracts from the reading. I wanted it that way just to show how all the machine learning concepts are somehow related.

There is a lot of effort in justifying what the book states. Yet, this is not enough, the reader can replicate and improve the examples, and thus generate their own knowledge.

To develop a critical thinking, without taking any statement as the "truly truth", it?s really important in this sea of books, courses, videos and any kind of technical material to learn. This book is just another view in the data science perspective.


I put some random errors...

... both technical and grammatical, the problem is I don't know where! So if you want to raise your hand and shout: "That's not correct! I think the correct form is... {replace-with-your-detailed-answer-here}", I invite you to report on the github repository, or email me at pcasas.biz -at- gmail.com


Download the PDF, epub and Kindle version!

If you learn anything new with this book, or it helped you somehow to saving time at your work, you can support the project by acquiring the portable version. (name your price starting at US$ 5)

There is no difference between the portable and web versions :)

After the purchase you'll will receive an email to download it in the three formats.

Download here!



Keep in touch: @pabloc_ds.

~ Thanks for reading !.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].