All Projects → amitkaps → Hackermath

amitkaps / Hackermath

Licence: mit
Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Hackermath

Basic Mathematics For Machine Learning
The motive behind Creating this repo is to feel the fear of mathematics and do what ever you want to do in Machine Learning , Deep Learning and other fields of AI
Stars: ✭ 300 (-78.26%)
Mutual labels:  jupyter-notebook, statistics, linear-algebra, calculus
Quant Notes
Quantitative Interview Preparation Guide, updated version here ==>
Stars: ✭ 180 (-86.96%)
Mutual labels:  jupyter-notebook, statistics, linear-algebra, calculus
data-science-notes
Open-source project hosted at https://makeuseofdata.com to crowdsource a robust collection of notes related to data science (math, visualization, modeling, etc)
Stars: ✭ 52 (-96.23%)
Mutual labels:  statistics, calculus, linear-algebra
Machine Learning Curriculum
Complete path for a beginner to become a Machine Learning Scientist!
Stars: ✭ 279 (-79.78%)
Mutual labels:  statistics, linear-algebra, calculus
Statistical Rethinking With Python And Pymc3
Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath
Stars: ✭ 713 (-48.33%)
Mutual labels:  jupyter-notebook, statistics
Notes
Notes On Using Data Science & Artificial Intelligence To Fight For Something That Matters.
Stars: ✭ 710 (-48.55%)
Mutual labels:  jupyter-notebook, statistics
Owl
Owl - OCaml Scientific and Engineering Computing @ http://ocaml.xyz
Stars: ✭ 919 (-33.41%)
Mutual labels:  statistics, linear-algebra
Helioml
A book about machine learning, statistics, and data mining for heliophysics
Stars: ✭ 36 (-97.39%)
Mutual labels:  jupyter-notebook, statistics
Smile
Statistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+292.17%)
Mutual labels:  statistics, linear-algebra
Dominhhai.github.io
My Blog
Stars: ✭ 8 (-99.42%)
Mutual labels:  jupyter-notebook, statistics
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (-28.84%)
Mutual labels:  jupyter-notebook, statistics
Fecon235
Notebooks for financial economics. Keywords: Jupyter notebook pandas Federal Reserve FRED Ferbus GDP CPI PCE inflation unemployment wage income debt Case-Shiller housing asset portfolio equities SPX bonds TIPS rates currency FX euro EUR USD JPY yen XAU gold Brent WTI oil Holt-Winters time-series forecasting statistics econometrics
Stars: ✭ 708 (-48.7%)
Mutual labels:  jupyter-notebook, statistics
Cracking The Data Science Interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Stars: ✭ 672 (-51.3%)
Mutual labels:  jupyter-notebook, statistics
Causal Inference For Data Scientists
Notebooks of Python and R code which illustrates basic causal inference using simulated data
Stars: ✭ 17 (-98.77%)
Mutual labels:  jupyter-notebook, statistics
Machine learning tutorials
Code, exercises and tutorials of my personal blog ! 📝
Stars: ✭ 601 (-56.45%)
Mutual labels:  jupyter-notebook, statistics
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+503.55%)
Mutual labels:  jupyter-notebook, statistics
25daysinmachinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-96.16%)
Mutual labels:  jupyter-notebook, statistics
Ppd599
USC urban data science course series with Python and Jupyter
Stars: ✭ 1,062 (-23.04%)
Mutual labels:  jupyter-notebook, statistics
Datacamp
🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-95%)
Mutual labels:  jupyter-notebook, statistics
Openml R
R package to interface with OpenML
Stars: ✭ 81 (-94.13%)
Mutual labels:  jupyter-notebook, statistics

HackerMath for Machine Learning

“Study hard what interests you the most in the most undisciplined, irreverent and original manner possible.” ― Richard Feynman

Math literacy, including proficiency in Linear Algebra and Statistics,is a must for anyone pursuing a career in data science. The goal of this workshop is to introduce some key concepts from these domains that get used repeatedly in data science applications. Our approach is what we call the “Hacker’s way”. Instead of going back to formulae and proofs, we teach the concepts by writing code. And in practical applications. Concepts don’t remain sticky if the usage is never taught.

The focus will be on depth rather than breadth. Three areas are chosen - Hypothesis Testing, Supervised Learning and Unsupervised Learning. They will be covered to sufficient depth - 50% of the time will be on the concepts and 50% of the time will be spent coding them.

More details at http://amitkaps.com/hackermath

See it in action: Binder

Module #1: Hypothesis Testing

Math Concepts

  • Basic Metrics: Mean, Variance, Covariance, Correlation
  • Discrete Probability Distributions: Bernoulli, Binomial
  • Cumulative Mass Function, Probability Mass Function
  • Continuous Probability Distributions: Poisson, Uniform, Normal, Beta, Gamma
  • Cumulative Distribution Function, Probability Density Function

ML Applications

  • Direct Simulation
  • Shuffling
  • Bootstrapping
  • Application to A/B Testing

Module #2: Supervised Learning

Math Concepts

  • Basics of Matrix Operation
  • Matrix Determinant, Inverse
  • Basics of Linear Algebra
  • Solve for Ax=b for nxn
  • Solve for Ax=b for nxp+1

ML Applications

  • Linear Regression
  • L2 Regularization
  • Gradient Descent
  • Linear Classifier
  • Logistic Regression

Module #3: Unsupervised Learning

Math Concepts

  • Matrix Projections
  • Solve for Ax=λx for nxn
  • Eigenvectors & Eigenvalues
  • Distance in Vector Space

ML Applications

  • Dimensionality Reduction
  • Principle Component Analysis
  • Cluster Analysis

Target Audience

  • Someone with a background in programming who wants to pick the math needed for data science and get a flavor for different data science problems
  • Someone who is a beginner in data science or has been doing data analysis (at least using Excel at a minimum) and wants to pick skills to take the next step in their data science career

Pre-requisites

  • Having a basic understanding of linear algebra would help. And we know you may have forgotten all about it from your school or college days. So here is an amazing video playlist by @3blue1brown to learn The Essence of Linear Algebra in a very visual way.
  • Also, a touch of calculus knowledge would make it also easier. So if you want to brush up your basic calculus skills, then @3blue1brown has another amazing video playlist to learn The Essence of Calculus in a very visual way.
  • Programming knowledge is mandatory. You should, at the bare minimum, be able to write conditional statements, use loops, be comfortable writing functions and be able to understand code snippets and come up with programming logic. Since we will be using Python - brush up your basics there. Specifically, we expect you to know the first three sections from this: http://anandology.com/python-practice-book/

Software Requirements

You will require the Python data stack for the workshop. Please install Ananconda for Python 3.5 for the workshop. That has everything we need for the workshop. For attendees more curious, we will be using Jupyter Notebook as our IDE. We will be introducing numpy, scipy, seaborn, matplotlib, plotnine, statsmodel and scikit-learn.

The working repo for this workshop is at https://github.com/amitkaps/hackermath/


Authors:

Amit Kapoor

Bargava Subramanian

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].