All Projects → pdeguzman96 → Data_cleaning_workshop

pdeguzman96 / Data_cleaning_workshop

Created the contents of this repo originally for a workshop I gave at UCLA

Projects that are alternatives of or similar to Data cleaning workshop

Cheat Sheets
A cheat sheet can be really helpful when you're trying a set of exercises related to a specific topic, or working on a project. Because you can only fit so much information on a single sheet of paper, most cheat sheets are a simple listing of syntax rules. This set of cheat sheets aims to remind you of syntax rules, but also remind you of important concepts as well.
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Ox Ipynb
org-mode exporter to Jupyter notebooks
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Data Exploration
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Machinelearning
A repo with tutorials for algorithms from scratch
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Ngsim env
Learning human driver models from NGSIM data with imitation learning.
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Pythonplot.com
📈 Interactive comparison of Python plotting libraries for exploratory data analysis. Examples of using Pandas plotting, plotnine, Seaborn, and Matplotlib.
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Mimic Cxr
Code, documentation, and discussion around the MIMIC-CXR database
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Publicservant ai
Stars: ✭ 97 (+0%)
Mutual labels:  jupyter-notebook
Ml At Work
電子書籍『仕事ではじめる機械学習』のサポートリポジトリです
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
How to do style transfer in tensorflow
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Python Cheatsheet
Basic Cheat Sheet for Python (PDF, Markdown and Jupyter Notebook)
Stars: ✭ 1,334 (+1275.26%)
Mutual labels:  jupyter-notebook
Objdet train tensorflow colab
Google Colab (Jupyter) notebook to retrain Object Detection Tensorflow model with custom dataset.
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Holiday Similarity
Finding similar images in the Holidays dataset
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Rexgen direct
Template-free prediction of organic reaction outcomes
Stars: ✭ 95 (-2.06%)
Mutual labels:  jupyter-notebook
Nobslanotebooks
Jupyter notebooks with exercises for the No bullshit guide to linear algebra.
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Bert Token Embeddings
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Ubernet
Demo for CVPR 2017 paper
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook
Graphnn Multi Object Tracking
Unofficial PyTorch implementation of "Learning a Neural Solver for Multiple Object Tracking"
Stars: ✭ 97 (+0%)
Mutual labels:  jupyter-notebook
Stock cnn blog pub
This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"
Stars: ✭ 97 (+0%)
Mutual labels:  jupyter-notebook
Ijcai cup 2017
IJCAI-17 口碑商家客流量预测
Stars: ✭ 96 (-1.03%)
Mutual labels:  jupyter-notebook

Machine Learning Project Checklist

Summary: This checklist was created to help ML students/practitioners structure their projects and problems in a way that makes sense to me.


When I just got started learning Python for Machine Learning and worked on my first few projects, I found it very overwhelming because...

  • it was difficult to remember all of the steps I needed to take in order to make my data ML-friendly,
  • I couldn't easily remember the functions, methods, and estimators from pandas, numpy, and sklearn, and
  • it was tedious and time-consuming to try to understand large (>50 feature) datasets

So, I created the ML checklist (Pictured Below) to be a handy tool for whenever I start to feel lost creating an ML project.

Machine Learning Checklist

In this repo, I also created...

  1. ml_project_checklist_template.ipynb: (Pictured below) a Jupyter .ipynb that you can use as a template for your project or Kaggle competition
  2. data_cleaning_for_ml_lab_EXERCISES.ipynb: An exercises/lab that you can finish for data cleaning practice, originally made for a workshop that I gave
  3. data_cleaning_for_ml_lab_SOLUTIONS.ipynb: A solutions file for the exercises I give above
  4. boston.csv and cambridge.csv: Airbnb datasets from here used for the exercises
  5. I also included a PDF version of the checklist.

iPynb Template


I hope you find these resources as useful as I do!

Happy learning :).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].