All Projects → huseinzol05 → Machine Learning Data Science Reuse

huseinzol05 / Machine Learning Data Science Reuse

Licence: mit
Gathers machine learning and data science techniques for problem solving.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Machine Learning Data Science Reuse

Techtalks
Slides and Supplementary Material of the past TechTalks at the Karlsruhe Machine Learning, Statistics and AI Meetup
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Movie recommender
MovieLens based recommender system.使用MovieLens数据集训练的电影推荐系统。
Stars: ✭ 914 (+3285.19%)
Mutual labels:  jupyter-notebook
Imagenetmultilabel
Fine-grained ImageNet annotations
Stars: ✭ 22 (-18.52%)
Mutual labels:  jupyter-notebook
Word2vec Workshop
word2vec workshop - a conceptual introduction and practical application
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Notebook playground
Notebooks for playing around with datasets etc.
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Osgeolive Notebooks
Repository for OSGeo-Live Jupyter Notebooks
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Pytorch Examples
MNIST Clasification with Pytorch
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
World Models Sonic Pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more work needed
Stars: ✭ 27 (+0%)
Mutual labels:  jupyter-notebook
Pytorch Struct
Fast, general, and tested differentiable structured prediction in PyTorch
Stars: ✭ 913 (+3281.48%)
Mutual labels:  jupyter-notebook
Facial Landmarking
facial landmarking using dlib
Stars: ✭ 22 (-18.52%)
Mutual labels:  jupyter-notebook
Intro Python
Python pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Boston Housing Prices
🏠 Predict the selling price of a new home in Boston, Massachusetts area
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Juliadocs
Contributions to Julia Documentation
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Deeplearning tutorial
Deep Learning Tutorial in Python with Keras library
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Pythondatasciencehandbook
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
Stars: ✭ 31,995 (+118400%)
Mutual labels:  jupyter-notebook
Rnn Syn
Analogs of Linguistic Structure in Deep Representations
Stars: ✭ 20 (-25.93%)
Mutual labels:  jupyter-notebook
Nolearn utils
Utilities for nolearn.lasagne
Stars: ✭ 21 (-22.22%)
Mutual labels:  jupyter-notebook
Sdtm mapper
AI SDTM mapping (R for ML, Python, TensorFlow for DL)
Stars: ✭ 27 (+0%)
Mutual labels:  jupyter-notebook
Stat406
STAT406 @ UBC - "Elements of Statistical Learning"
Stars: ✭ 27 (+0%)
Mutual labels:  jupyter-notebook
Zillow scraper
Repo for Zillow Web scraper
Stars: ✭ 22 (-18.52%)
Mutual labels:  jupyter-notebook

Machine-Learning-Data-Science-Reuse

Gathers machine learning and data science techniques for problem solving.

Warning

THIS REPOSITORY WILL LACK OF COMMENT, LACK OF DOCUMENTATION AND LACK OF STORY TELLING. PURPOSELY FOR SELF-REUSE.

Most of visualizations are self-explained, and at-least required basic understanding in statistics and python.

Some of visualization will not able to visualize because Github not able to render specific libraries that are using svg based, so please run it on any machine to see the results.

Why Genie? Because he can solved anything!

Table of contents

R vs Python

  1. CSV, Data Manipulation, Visualization

Preprocessing

  1. Handle missing values
  2. Rescaling (log, vector normalization, standardization, min-max scaling, boxcox)
  3. Features understanding
  4. Detecting outliers
  5. Encoding type comparison

Natural Language Processing

  1. Bag Of Word
  2. TF-IDF
  3. Hashing algorithm
  4. Models gathering (Bayes, SVM, XGB, LightGBM)
  5. sklearn pipeline
  6. N-gram
  7. Topic Modelling
  8. Naive-Bayes-SVM on hate speech
  9. Black panther visualization using wordclouds, semantic and kmean similarity network
  10. Semantic similarity on Malaysia hot topics

Suggestion Engine using Nearest-Euclidean and Gaussian Distribution

  1. Anime
  2. Game
  3. Movie
  4. Kickstarter projects

Image processing

  1. Augmentation (flip, rotate, shifting, zoom, shear, channel shift, grayscale, contrast, saturation)
  2. RGB subdivide
  3. hog-featuring
  4. image segmentation, nucleus
  5. K Nearest Neighbors on PCA / NMF
  6. SVD study on nearest neighbors
  7. Image wrapping to full A4

Signal processing

  1. Blurring on 1D Signal (loop, and FFT)
  2. Blurring on 2D Signal (loop)
  3. Conv 2 signals
  4. Pass-filter for freqs
  5. Signal smoothing
  6. Signal cross-correlation
  7. Augmentation (pitching, speed, distribution noise, shifting, silent shifting)
  8. Featuring (mfcc, log-energy, feature cube, power spectrum)

Stacking

  1. binary
  2. regression
  3. multi-classes
  4. stack multiple models from sklearn regressor with XGB

Stochastic study

  1. Cryptocurrencies correlation
  2. Predict crpytocurrencies multiple stack
  3. Simple stock analysis
  4. ARIMA for flight prediction
  5. TESLA market study

Big-query

  1. integrate big-query with Pandas Python
  2. Medicare queries with plotly visualization

Network study

  1. graph nodes for a person most spoke to whom
  2. Spooky social network analysis
  3. Taxi nodes analysis
  4. Stackoverflow tags analysis
  5. donald trump news social network
  6. najib razak twitter social network

Visualization

  1. Geographic using basemap
  2. Folium map and time analysis
  3. Israel graph visualization
  4. Israel political landscape
  5. Distribution age vs type for library
  6. Growth study for library
  7. botnet attack analysis
  8. Plotly geo-mapping 101
  9. Plotly bombing mapping visualization
  10. Easy plotly using cufflink
  11. Plotly pokemon data
  12. Rare visualization
  13. Dynamic map visualization using plotly and folium
  14. Kaggle 2018 Report

Markov

  1. Independent variables on weather forecast
  2. Dependent variables on text dataset
  3. Shakespeare character-wise generator

English-text normalization

  1. normalized texts (Dates, Measure, Decimals, Cardinals, Electronic - URL, Currency - Dollars, Telephone Numbers)
  2. normalized texts (Cardinal, Digit, Ordinal, Letters, Address, Telephone, Electronic, Fractions, Money)
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].