All Projects → khanhnamle1994 → Cracking The Data Science Interview

khanhnamle1994 / Cracking The Data Science Interview

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Cracking The Data Science Interview

Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (-73.96%)
Mutual labels:  jupyter-notebook, data-science, data-wrangling
Facet
Human-explainable AI.
Stars: ✭ 269 (-59.97%)
Mutual labels:  jupyter-notebook, data-science, statistics
Virgilio
Virgilio is developed and maintained by these awesome people. You can email us virgilio.datascience (at) gmail.com or join the Discord chat.
Stars: ✭ 13,200 (+1864.29%)
Mutual labels:  jupyter-notebook, data-science, statistics
Probabilistic Programming And Bayesian Methods For Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Stars: ✭ 23,912 (+3458.33%)
Mutual labels:  jupyter-notebook, data-science, statistics
Probability
Probabilistic reasoning and statistical analysis in TensorFlow
Stars: ✭ 3,550 (+428.27%)
Mutual labels:  jupyter-notebook, data-science, statistics
Aulas
Aulas da Escola de Inteligência Artificial de São Paulo
Stars: ✭ 166 (-75.3%)
Mutual labels:  jupyter-notebook, data-science, statistics
Datascienceprojects
The code repository for projects and tutorials in R and Python that covers a variety of topics in data visualization, statistics sports analytics and general application of probability theory.
Stars: ✭ 223 (-66.82%)
Mutual labels:  jupyter-notebook, data-science, statistics
Book
This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
Stars: ✭ 141 (-79.02%)
Mutual labels:  jupyter-notebook, data-science, statistics
120 Ds Interview Questions
My Answer to 120 Data Science Interview Questions
Stars: ✭ 304 (-54.76%)
Mutual labels:  jupyter-notebook, data-science, statistics
Data Science Learning
Repository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
Stars: ✭ 273 (-59.37%)
Mutual labels:  jupyter-notebook, data-science, statistics
Zigzag
Python library for identifying the peaks and valleys of a time series.
Stars: ✭ 156 (-76.79%)
Mutual labels:  jupyter-notebook, data-science, statistics
Edward2
A simple probabilistic programming language.
Stars: ✭ 419 (-37.65%)
Mutual labels:  jupyter-notebook, data-science, statistics
Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+226.93%)
Mutual labels:  jupyter-notebook, data-science, statistics
Covid19 Severity Prediction
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
Stars: ✭ 170 (-74.7%)
Mutual labels:  jupyter-notebook, data-science, statistics
Data Science Question Answer
A repo for data science related questions and answers
Stars: ✭ 2,000 (+197.62%)
Mutual labels:  jupyter-notebook, data-science, statistics
Imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Stars: ✭ 194 (-71.13%)
Mutual labels:  jupyter-notebook, data-science, statistics
Bayesian Cognitive Modeling In Pymc3
PyMC3 codes of Lee and Wagenmakers' Bayesian Cognitive Modeling - A Pratical Course
Stars: ✭ 93 (-86.16%)
Mutual labels:  jupyter-notebook, data-science, statistics
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-79.17%)
Mutual labels:  jupyter-notebook, data-science, statistics
Notebooks Statistics And Machinelearning
Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog
Stars: ✭ 270 (-59.82%)
Mutual labels:  jupyter-notebook, data-science, statistics
Stats Maths With Python
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (-43.3%)
Mutual labels:  jupyter-notebook, data-science, statistics

Here are the sections:

Data Science Cheatsheets

This section contains cheatsheets of basic concepts in data science that will be asked in interviews:

Data Science EBooks

This section contains books that I have read about data science and machine learning:

Data Science Question Bank

This section contains sample questions that were asked in actual data science interviews:

Data Science Case Studies

This section contains case study questions that concern designing machine learning systems to solve practical problems.

Data Science Portfolio

This section contains portfolio of data science projects completed by me for academic, self learning, and hobby purposes.

For a more visually pleasant experience for browsing the portfolio, check out jameskle.com/data-portfolio

  • Recommendation Systems

    • Transfer Rec: My ongoing research work that intersects deep learning and recommendation systems.

    • Movie Recommendation: Designed 4 different models that recommend items on the MovieLens dataset.

    Tools: PyTorch, TensorBoard, Keras, Pandas, NumPy, SciPy, Matplotlib, Seaborn, Scikit-Learn, Surprise, Wordcloud

  • Machine Learning

    • Trip Optimizer: Used XGBoost and evolutionary algorithms to optimize the travel time for taxi vehicles in New York City.

    • Instacart Market Basket Analysis: Tackled the Instacart Market Basket Analysis challenge to predict which products will be in a user's next order.

    Tools: Pandas, NumPy, Matplotlib, XGBoost, Geopy, Scikit-Learn

  • Computer Vision

    • Fashion Recommendation: Built a ResNet-based model that classifies and recommends fashion images in the DeepFashion database based on semantic similarity.

    • Fashion Classification: Developed 4 different Convolutional Neural Networks that classify images in the Fashion MNIST dataset.

    • Dog Breed Classification: Designed a Convolutional Neural Network that identifies dog breed.

    • Road Segmentation: Implemented a Fully-Convolutional Network for semantic segmentation task in the Kitty Road Dataset.

    Tools: TensorFlow, Keras, Pandas, NumPy, Matplotlib, Scikit-Learn, TensorBoard

  • Natural Language Processing

  • Data Analysis and Visualization

    • World Cup 2018 Team Analysis: Analysis and visualization of the FIFA 18 dataset to predict the best possible international squad lineups for 10 teams at the 2018 World Cup in Russia.

    • Spotify Artists Analysis: Analysis and visualization of musical styles from 50 different artists with a wide range of genres on Spotify.

    Tools: Pandas, NumPy, Matplotlib, Rspotify, httr, dplyr, tidyr, radarchart, ggplot2

Data Journalism Portfolio

This section contains portfolio of data journalism articles completed by me for freelance clients and self-learning purposes.

For a more visually pleasant experience for browsing the portfolio, check out jameskle.com/data-journalism

Downloadable Cheatsheets

These PDF cheatsheets come from BecomingHuman.AI.

1 - Neural Network Basics

Neural Network Basics

2 - Neural Network Graphs

Neural Network Graphs

3 - Machine Learning with Emojis

Machine Learning with Emojis

4 - Scikit-Learn With Python

Scikit-Learn With Python

5 - Python Basics

Python Basics

6 - NumPy Basics

NumPy Basics

7 - Pandas Basics

Pandas Basics

8 - Data Wrangling With Pandas

Data Wrangling With Pandas Part 1

Data Wrangling With Pandas Part 2

9 - SciPy Linear Algebra

SciPy Linear Algebra

10 - Matplotlib Basics

Matplotlib Basics

11 - Keras

Keras

12 - Big-O

Big-O

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].