All Projects → EricSchles → Datascience_book

EricSchles / Datascience_book

Projects that are alternatives of or similar to Datascience book

Droneblocks Tello Python
A DroneBlocks course on drone programming with Tello using Python scripts
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Scipy 2014 julia
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Physlight
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Keras Gradcam
Keras implementation of GradCAM.
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Keras Tutorial
Tutorial teaching the basics of Keras and some deep learning concepts
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Objectron
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Stars: ✭ 1,352 (+1265.66%)
Mutual labels:  jupyter-notebook
Hmm
An implementation of the Viterbi Algorithm for training Hidden Markov models. This repo accompanies the video found here: https://www.youtube.com/watch?v=kqSzLo9fenk
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Hands On Exploratory Data Analysis With Python
Hands-on Exploratory Data Analysis with Python, published by Packt
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Ds For Telco
Source material for Data Science for Telecom Tutorial at Strata Singapore 2015
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Linear algebra with python
Lecture Notes for Linear Algebra Featuring Python
Stars: ✭ 1,355 (+1268.69%)
Mutual labels:  jupyter-notebook
Interaction network pytorch
Pytorch Implementation of Interaction Networks for Learning about Objects, Relations and Physics
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Estimation Of Remaining Useful Life Using Cnn
Convolutional Neural Network based regression approach for estimating machinery's remaining useful life
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+1267.68%)
Mutual labels:  jupyter-notebook
Bigdata
NJU Master Course **Big Data Mining and Analysis**
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Kmeans pytorch
kmeans using PyTorch
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Pytorch learning
书籍:深度学习框架pytorch入门与实践
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Cbe20255
Introduction to Chemical Engineering Analysis
Stars: ✭ 98 (-1.01%)
Mutual labels:  jupyter-notebook
Pytorch Bert Document Classification
Enriching BERT with Knowledge Graph Embedding for Document Classification (PyTorch)
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Gp regression
A Primer on Gaussian Processes for Regression Analysis (PyData NYC 2019)
Stars: ✭ 99 (+0%)
Mutual labels:  jupyter-notebook
Nab
The Numenta Anomaly Benchmark
Stars: ✭ 1,352 (+1265.66%)
Mutual labels:  jupyter-notebook

Programming Probabilistically: An introduction to the world of data science

By

Eric Schles

Hello and welcome to my book! You'll find the following sections:

  1. Descriptive Statistics and Hypothesis testing
  2. Applied Statistical Tests - A/B testing
  3. Regression Introduction
  4. Classification Introduction
  5. Information Theory, Entropy and Tree Models
  6. Neural Network Models

Each section covers about 4 to 5 chapters worth of materail broken out into:

  • Basics
  • Mathematical Intuition
  • Implementation
  • Typical API
  • Advanced Use Cases

Sections to come:

  • Reinforcement Learning
  • Engineering for Data Science
  • Text Processing
  • Image Processing
  • Support Vector Machines
  • Genetic Algorithms
  • neural network optimizers
  • Recommender Systems
  • A/B testing and other related workflows
  • SQL best practice
  • Timeseries Forecasting and Analysis
  • Geospatial Analysis
  • Geospatial and Timeseries forecasting
  • Video Processing
  • Building Data Dashboards
  • Working With Search
  • Building An OCR System
  • Advanced Python Usage
  • Active Learning
  • Recurrent Neural Networks
  • Convolutional Neural Networks
  • Capsule Networks
  • Adversarial Machine Learning
  • Open World - in distribution out of distribution
  • Bayesian Machine Learning
  • Graph Based Neural Networks
  • Monitoring
  • Working with Spark
  • Working with Streaming Data
  • Ensembling - scikit learn ensembling strategies
  • Random Forests
  • Additive models:
    • Gradient boosted trees
    • splines
    • General Additive Models
    • adaboost
  • explainability metrics
    • litany of examples
    • showing when and how they can fail
  • Metrics
  • Hyper parameter tunning
  • Randomness in your models
  • Counterfactual examples
  • testing in machine learning applications

To Dos

  • fix Decision Tree Implementation
  • add SVM chapter
  • add dimensionality reduction chapter
  • add clustering chapter
  • add RNN chapter
  • add conv net chapter
  • discuss attention
  • create engineering productionization chapter
  • hypothesis test as a ticket within engineering scrum context
  • reproducibility of results
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].