All Projects → serengil → decision-trees-for-ml

serengil / decision-trees-for-ml

Licence: MIT license
Building Decision Trees From Scratch In Python

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to decision-trees-for-ml

Awesome Decision Tree Papers
A collection of research papers on decision, classification and regression trees with implementations.
Stars: ✭ 1,908 (+3027.87%)
Mutual labels:  random-forest, xgboost, lightgbm, gradient-boosting-machine, cart, decision-tree, gradient-boosting
Awesome Gradient Boosting Papers
A curated list of gradient boosting research papers with implementations.
Stars: ✭ 704 (+1054.1%)
Mutual labels:  random-forest, xgboost, lightgbm, gradient-boosting
stackgbm
🌳 Stacked Gradient Boosting Machines
Stars: ✭ 24 (-60.66%)
Mutual labels:  xgboost, gbm, lightgbm, gradient-boosting
aws-machine-learning-university-dte
Machine Learning University: Decision Trees and Ensemble Methods
Stars: ✭ 119 (+95.08%)
Mutual labels:  random-forest, xgboost, lightgbm, boosting
Auto ml
[UNMAINTAINED] Automated machine learning for analytics & production
Stars: ✭ 1,559 (+2455.74%)
Mutual labels:  xgboost, lightgbm, gradient-boosting
HyperGBM
A full pipeline AutoML tool for tabular data
Stars: ✭ 172 (+181.97%)
Mutual labels:  xgboost, gbm, lightgbm
Chefboost
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (+188.52%)
Mutual labels:  random-forest, cart, gradient-boosting
Machine-Learning-Models
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
Stars: ✭ 30 (-50.82%)
Mutual labels:  random-forest, xgboost, decision-tree
fast retraining
Show how to perform fast retraining with LightGBM in different business cases
Stars: ✭ 56 (-8.2%)
Mutual labels:  xgboost, gbm, lightgbm
STOCK-RETURN-PREDICTION-USING-KNN-SVM-GUASSIAN-PROCESS-ADABOOST-TREE-REGRESSION-AND-QDA
Forecast stock prices using machine learning approach. A time series analysis. Employ the Use of Predictive Modeling in Machine Learning to Forecast Stock Return. Approach Used by Hedge Funds to Select Tradeable Stocks
Stars: ✭ 94 (+54.1%)
Mutual labels:  random-forest, adaboost, decision-tree
Text Classification Benchmark
文本分类基准测试
Stars: ✭ 18 (-70.49%)
Mutual labels:  random-forest, xgboost, lightgbm
Apartment-Interest-Prediction
Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text and images.
Stars: ✭ 17 (-72.13%)
Mutual labels:  xgboost, lightgbm, gradient-boosting
Tpot
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Stars: ✭ 8,378 (+13634.43%)
Mutual labels:  random-forest, xgboost, gradient-boosting
yggdrasil-decision-forests
A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models.
Stars: ✭ 156 (+155.74%)
Mutual labels:  random-forest, cart, gradient-boosting
scoruby
Ruby Scoring API for PMML
Stars: ✭ 69 (+13.11%)
Mutual labels:  random-forest, gbm, decision-tree
Mljar Supervised
Automated Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning 🚀
Stars: ✭ 961 (+1475.41%)
Mutual labels:  random-forest, xgboost, lightgbm
Lightgbm
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Stars: ✭ 13,293 (+21691.8%)
Mutual labels:  gbm, lightgbm, gradient-boosting
cheapml
Machine Learning algorithms coded from scratch
Stars: ✭ 17 (-72.13%)
Mutual labels:  random-forest, gradient-boosting, regression-trees
handson-ml
도서 "핸즈온 머신러닝"의 예제와 연습문제를 담은 주피터 노트북입니다.
Stars: ✭ 285 (+367.21%)
Mutual labels:  random-forest, xgboost, gradient-boosting
Benchm Ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+2908.2%)
Mutual labels:  random-forest, xgboost, gradient-boosting-machine

Decision Trees for Machine Learning

Disclaimer: This repository is a sketchbook learning the background of decision tree algorithms. It is neither clean nor readable. Please direct yourself to Chefboost repository to have clean one.

This is the repository of Decision Trees for Machine Learning online course published on Udemy. In this course, the following algorithms will be covered. All project is going to be developed on Python (3.6.4), and neither out-of-the-box library nor framework will be used to build decision trees.

1- ID3

2- C4.5

3- CART (Classification And Regression Trees)

4- Regression Trees (CART for regression)

5- Random Forest

6- Gradient Boosting Decision Trees for Regression

7- Gradient Boosting Decision Trees for Classification

8- Adaboost

Just call the decision.py file to run the program. You might want to change the running algorithm. You just need to set algorithm variable.

algorithm = "ID3" #Please set this variable to ID3, C4.5, CART or Regression

Moreover, you might want to apply random forest. Please set this to True in this case.

enableRandomForest = False

Furthermore, you can apply gradient boosting regression trees.

enableGradientBoosting = True

Besides, adaptive boosting is allowed to run

enableAdaboost = True

Finally, you can change the data set to build different decision trees. Just pass the file name, and its column names if it does not exist.

df = pd.read_csv("car.data"
  #column names can either be defined in the source file or names parameter in read_csv command
  ,names=["buying","maint","doors","persons","lug_boot","safety","Decision"] 
)

Prerequisites

Pandas and numpy python libraries are used to load data sets in this repository. You might run the following commands to install these packages if you are going to use them first time.

pip install pandas
pip install numpy

Updates

To keep yourself up-to-date you might check posts in my blog about decision trees

License

This repository is licensed under the MIT License - see LICENSE for more details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].