All Projects → tifabi → 100DaysOfMLCode

tifabi / 100DaysOfMLCode

Licence: other
No description or website provided.

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to 100DaysOfMLCode

25daysinmachinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (+178.95%)
Mutual labels:  machine-learning-algorithms, machinelearning
Mariana
The Cutest Deep Learning Framework which is also a wonderful Declarative Language
Stars: ✭ 151 (+694.74%)
Mutual labels:  machine-learning-algorithms, machinelearning
Makine Ogrenmesi
Makine Öğrenmesi Türkçe Kaynak
Stars: ✭ 82 (+331.58%)
Mutual labels:  machine-learning-algorithms, machinelearning
Articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (+1742.11%)
Mutual labels:  machine-learning-algorithms, machinelearning
Free Ai Resources
🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (+910.53%)
Mutual labels:  machine-learning-algorithms, machinelearning
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (+15.79%)
Mutual labels:  machine-learning-algorithms, machinelearning
Real Time Ml Project
A curated list of applied machine learning and data science notebooks and libraries across different industries.
Stars: ✭ 143 (+652.63%)
Mutual labels:  machine-learning-algorithms, machinelearning
Machine-learning-implement
Teach you how to implement machine learning algorithms
Stars: ✭ 37 (+94.74%)
Mutual labels:  machine-learning-algorithms, machinelearning
Bet On Sibyl
Machine Learning Model for Sport Predictions (Football, Basketball, Baseball, Hockey, Soccer & Tennis)
Stars: ✭ 190 (+900%)
Mutual labels:  machine-learning-algorithms, machinelearning
Boostaroota
A fast xgboost feature selection algorithm
Stars: ✭ 165 (+768.42%)
Mutual labels:  machine-learning-algorithms, machinelearning
mlzero to hero
No description or website provided.
Stars: ✭ 40 (+110.53%)
Mutual labels:  machine-learning-algorithms, machinelearning
Echotorch
A Python toolkit for Reservoir Computing and Echo State Network experimentation based on pyTorch. EchoTorch is the only Python module available to easily create Deep Reservoir Computing models.
Stars: ✭ 231 (+1115.79%)
Mutual labels:  machine-learning-algorithms, machinelearning
Homemade Machine Learning
🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained
Stars: ✭ 18,594 (+97763.16%)
Mutual labels:  machine-learning-algorithms, machinelearning
Letslearnai.github.io
Lets Learn AI
Stars: ✭ 33 (+73.68%)
Mutual labels:  machine-learning-algorithms, machinelearning
Notebooks Statistics And Machinelearning
Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog
Stars: ✭ 270 (+1321.05%)
Mutual labels:  machine-learning-algorithms, machinelearning
Tslearn
A machine learning toolkit dedicated to time-series data
Stars: ✭ 1,910 (+9952.63%)
Mutual labels:  machine-learning-algorithms, machinelearning
zoofs
zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.
Stars: ✭ 142 (+647.37%)
Mutual labels:  machine-learning-algorithms, machinelearning
Data-Scientist-In-Python
This repository contains notes and projects of Data scientist track from dataquest course work.
Stars: ✭ 23 (+21.05%)
Mutual labels:  machine-learning-algorithms, machinelearning
Nano Neuron
🤖 NanoNeuron is 7 simple JavaScript functions that will give you a feeling of how machines can actually "learn"
Stars: ✭ 2,050 (+10689.47%)
Mutual labels:  machine-learning-algorithms, machinelearning
Awesome Deep Learning And Machine Learning Questions
【不定期更新】收集整理的一些网站中(如知乎、Quora、Reddit、Stack Exchange等)与深度学习、机器学习、强化学习、数据科学相关的有价值的问题
Stars: ✭ 203 (+968.42%)
Mutual labels:  machine-learning-algorithms, machinelearning

100DaysOfMLCode

The creation of this repository was inspired by Siraj Raval's challenge to code machine learning for at least an hour everyday for 100 days.

I nervously accepted this challenge in addition to working full time and taking 6 hours of graduate courseowrk in the 2018 summer semester. I will use this repository to store code, jupyter notebook examples, and thought processes.

Topics Explored:

Day 1 - July 7 | Principal Component Analysis (PCA) and explained variance ratio

Day 2 - July 8 | SparsePCA -> CODE

Day 3 - July 9 | Bag of Words

Day 4 - July 10 | Tokenization & Vectorization time trials -> CODE

Day 5 - July 11 | Stemming and Lemmatizing with CountVectorizer, TfidfVectorizer, and HashingVectorizer -> CODE

Day 6 - July 12 | Development of visualization pipeline for ML -> CODE

Day 7 - July 13 | Big Data Visualization with Datashader

Day 8 - July 14 | t-SNE and Datashader Failure -> CODE

Day 9 - July 15 | Gene Expression - Getting Started -> FOLDER

Day 10 - July 16 | Gene Expression - Reading in Data

Day 11 - July 17 | Gene Expression - Preprocessing & Boxplot

Day 12 - July 18 | Intro to Data Splitting -> CODE

Day 13 - July 19 | Text Relationships with spaCy -> CODE

Day 14 - July 20 | Gene Expression - Cytoscape and Orange3

Day 15 - July 21 | Trial-and-error Data Splitting Research

Day 16 - July 22 | Trial-and-error Data Splitting Implimentation -> CODE

Day 17 - July 23 | NMF -> CODE

Day 18 - July 24 | RFE -> CODE

Day 19 - July 25 | Exploring Variable Replacement

Day 20 - July 26 | Pipelines - Introduction

Day 21 - July 27 | A list of 10,000 dictionaries -> CODE

Day 22 - July 28 | Linear Regression - Simple in R -> Folder

Day 23 - July 29 | Data Visualization, Dimensionality Reduction, Feature Selection, and a hand full of models. -> CODE

Day 24 - July 30 | Linear Regression - Continue to draft description -> Folder

Day 25 - July 31 | Linear Regression - Simple in Python -> CODE

Day 26 - Aug 1 | Pipeline - Start of Pipeline Example -> CODE

Day 27 - Aug 2 | Pipeline - Ridge Regression for Pipeline Example -> CODE

Day 28 - Aug 3 | Pipeline - Flexibility for selecting columns with missing values -> CODE

Day 29 - Aug 4 | Pipeline - Pipeline to compare methods of handling missing values -> CODE

Day 30 - Aug 5 | Pipeline - Identify categorical columns and convert to dummy -> CODE

Day 31 - Aug 6 | Pipeline - Custom Imputer using sklearn linear_model -> CODE

Day 32 - Aug 7 | kNN - add to Pipeline & normalizing -> CODE

Day 33 - Aug 8 | Pipeline - Researching topics to come

Day 34 - Aug 9 | What's great about bias?

Day 35 - Aug 10 | Bias-Variance decomposition - rounding error & elimination

Day 36 - Aug 11 | Bias-Variance decomposition from scratch in Python

Day 37 - Aug 12 | Continued work on Bias-Variance decomposition

Day 38 - Aug 13 | Bias-Variance decomposition working example

Day 39 - Aug 14 | Scatterplots for Collinearity

Day 40 - Aug 15 | ML Work for Client - not shared publicly

Day 41 - Aug 16 | Correlation Matrix for Collinearity

Day 42 - Aug 17 | Ontology from web scraping

Day 43 - Aug 18 | Eigen Values for MultiCollinearity

Day 44 - Aug 19 | Eigen Values & Vectors for MultiCollinearity

Day 45 - Aug 20 | Word frequencies from PDFs

Day 46 - Aug 21 | NLP with Regression - Expoloring the literature

Day 47 - Aug 22 | Text mining for Google Chips

Day 48 - Aug 23 | Methods of Web scraping

Day 49 - Aug 24 | Selenium for web scraping

Day 50 - Aug 25 | Reformatting results of web scraping

Day 51 - Aug 26 | NLP methods from web scraped results

Day 52 - Aug 27 | Applied Algorithms - different methods of sorting

Day 53 - Aug 28 | Methods of NLP for Social Media Data

List of Topics to Explore:

  • PCA on Genetic Data - Gene Expression
    • Create Jupyter Notebook foundation
    • Find Good Data
    • Explain how to differentiate good data from bad data
  • GPU
  • Efficient Use of Data Structures
  • Write computationally expensive parts in C++
  • Make good use of memory & caching
  • Multireading / multiprocessing in Python, Celery for parallel processing
  • Kernal PCA
  • Differences (pro/cons) between Stemming and Lemmatizing methods
  • PCA to display failure risk
    • Lots / batches that take too long
    • Determine coorinary value
    • adjust threshold & critical thresholds
  • Producing Production Quality code
  • How tokenized data is used for ML algorithms
  • Use of predeveloped vocabularies
  • Hypertools
  • MongoDB with Neo4j and Orient
  • AutoML
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].