All Projects → barisesmer → C4.5

barisesmer / C4.5

Licence: mit
A python implementation of C4.5 algorithm by R. Quinlan

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to C4.5

Fuku Ml
Simple machine learning library / 簡單易用的機器學習套件
Stars: ✭ 280 (+449.02%)
Mutual labels:  classification, decision-trees
Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+4207.84%)
Mutual labels:  classification, decision-trees
Sporf
This is the implementation of Sparse Projection Oblique Randomer Forest
Stars: ✭ 70 (+37.25%)
Mutual labels:  classification, decision-trees
supervised-machine-learning
This repo contains regression and classification projects. Examples: development of predictive models for comments on social media websites; building classifiers to predict outcomes in sports competitions; churn analysis; prediction of clicks on online ads; analysis of the opioids crisis and an analysis of retail store expansion strategies using…
Stars: ✭ 34 (-33.33%)
Mutual labels:  classification, decision-trees
AdaptiveRandomForest
Repository for the AdaptiveRandomForest algorithm implemented in MOA 2016-04
Stars: ✭ 28 (-45.1%)
Mutual labels:  classification, decision-trees
Machine Learning From Scratch
Machine Learning models from scratch with a better visualisation
Stars: ✭ 15 (-70.59%)
Mutual labels:  classification, decision-trees
Orange3
🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+6080.39%)
Mutual labels:  classification, decision-trees
df-dn-paper
Conceptual & empirical comparisons between decision forests & deep networks
Stars: ✭ 14 (-72.55%)
Mutual labels:  classification, decision-trees
The Deep Learning With Keras Workshop
An Interactive Approach to Understanding Deep Learning with Keras
Stars: ✭ 34 (-33.33%)
Mutual labels:  classification, decision-trees
Urban Sound Classification
Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)
Stars: ✭ 39 (-23.53%)
Mutual labels:  classification
Street View House Numbers Svhn Detection And Classification Using Cnn
A 2-CNN pipeline to do both detection (using bounding box regression) and classification of numbers on SVHN dataset.
Stars: ✭ 46 (-9.8%)
Mutual labels:  classification
Ml Classify Text Js
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Stars: ✭ 38 (-25.49%)
Mutual labels:  classification
Chemometricstools.jl
A collection of tools for chemometrics and machine learning written in Julia.
Stars: ✭ 39 (-23.53%)
Mutual labels:  classification
Detext
DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
Stars: ✭ 1,039 (+1937.25%)
Mutual labels:  classification
Timbl
TiMBL implements several memory-based learning algorithms.
Stars: ✭ 38 (-25.49%)
Mutual labels:  classification
Gd Uap
Generalized Data-free Universal Adversarial Perturbations
Stars: ✭ 50 (-1.96%)
Mutual labels:  classification
Yannl
Yet another neural network library
Stars: ✭ 37 (-27.45%)
Mutual labels:  classification
Face Mask Detection
Face masks are crucial in minimizing the propagation of Covid-19, and are highly recommended or even obligatory in many situations. In this project, we develop a pipeline to detect unmasked faces in images. This can, for example, be used to alert people that do not wear a mask when entering a building.
Stars: ✭ 37 (-27.45%)
Mutual labels:  classification
Php Ml
PHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+15390.2%)
Mutual labels:  classification
Lightning Covid19
Classification for covid-19 chest X-ray images using Lightning
Stars: ✭ 49 (-3.92%)
Mutual labels:  classification

C4.5

An implementation of C4.5 machine learning algorithm in python

C4.5 Algorithm

C4.5 is an algorithm developed by John Ross Quinlan that creates decision tress. A decision tree is a tool that is used for classification in machine learning, which uses a tree structure where internal nodes represent tests and leaves represent decisions. C4.5 makes use of information theoretic concepts such as entropy to classify the data.

alt text

Data Format

For each dataset there should be two files, one that describes the classes and attributes and one that consists of the actual data. The file for attributes and classes should contain all the classes in first line and after that, line by line the attributes and their possible values if the attribute is discrete. For continuos(numerical) attributes, possible values would be "continuos". Check the iris dataset folder for actual data and more specific syntax.

Usage

Create a C4.5 object like this

c1 = C45("path_to_data_file", "path_to_description_file")

After this, you can fetch and preprocess the data, generate the tree and print it to screen.

Running Tests

Navigate to the directory "C4.5" and type python -m unittest discover to run all the test modules under "C4.5/tests" folder. (the names of the modules should start with "test" and end with ".py")

Relevant Links

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].