mlr-org / Mlr

Licence: other

Machine Learning in R

Programming Languages

7636 projects

Projects that are alternatives of or similar to Mlr

Mlj.jl

A Julia machine learning framework

Stars: ✭ 982 (-36.32%)

Mutual labels: data-science, statistics, classification, predictive-modeling, stacking, clustering, regression

Machine Learning With Python

Practice and tutorial-style notebooks covering wide variety of machine learning techniques

Stars: ✭ 2,197 (+42.48%)

Mutual labels: data-science, statistics, classification, clustering, regression

Smile

Statistical Machine Intelligence & Learning Engine

Stars: ✭ 5,412 (+250.97%)

Mutual labels: data-science, statistics, classification, clustering, regression

Uci Ml Api

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

Stars: ✭ 190 (-87.68%)

Mutual labels: data-science, statistics, classification, clustering, regression

Openml R

R package to interface with OpenML

Stars: ✭ 81 (-94.75%)

Mutual labels: data-science, statistics, classification, cran, regression

Interactive machine learning

IPython widgets, interactive plots, interactive machine learning

Stars: ✭ 140 (-90.92%)

Mutual labels: data-science, statistics, classification, regression

Data Science Toolkit

Collection of stats, modeling, and data science tools in Python and R.

Stars: ✭ 169 (-89.04%)

Mutual labels: data-science, statistics, classification, regression

Lightautoml

LAMA - automatic model creation framework

Stars: ✭ 196 (-87.29%)

Mutual labels: data-science, classification, stacking, regression

Orange3

🍊 📊 💡 Orange: Interactive data analysis

Stars: ✭ 3,152 (+104.41%)

Mutual labels: data-science, classification, clustering, regression

Neuroflow

Artificial Neural Networks for Scala

Stars: ✭ 105 (-93.19%)

Mutual labels: data-science, classification, clustering, regression

Pycaret

An open-source, low-code machine learning library in Python

Stars: ✭ 4,594 (+197.92%)

Mutual labels: data-science, clustering, regression, classification

Metriculous

Measure and visualize machine learning model performance without the usual boilerplate.

Stars: ✭ 71 (-95.4%)

Mutual labels: data-science, statistics, classification, regression

Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

Stars: ✭ 2,936 (+90.4%)

Mutual labels: statistics, clustering, regression, classification

BAS

BAS R package https://merliseclyde.github.io/BAS/

Stars: ✭ 36 (-97.67%)

Mutual labels: cran, regression, r-package, predictive-modeling

Mlbox

MLBox is a powerful Automated Machine Learning python library.

Stars: ✭ 1,199 (-22.24%)

Mutual labels: data-science, classification, stacking, regression

A high-level machine learning and deep learning library for the PHP language.

Stars: ✭ 1,270 (-17.64%)

Mutual labels: data-science, classification, clustering, regression

Tensorflow Book

Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.

Stars: ✭ 4,448 (+188.46%)

Mutual labels: classification, clustering, regression

Mlr3

mlr3: Machine Learning in R - next generation

Stars: ✭ 463 (-69.97%)

Mutual labels: data-science, classification, regression

Alphapy

Automated Machine Learning [AutoML] with Python, scikit-learn, Keras, XGBoost, LightGBM, and CatBoost

Stars: ✭ 564 (-63.42%)

Mutual labels: data-science, classification, regression

Fivethirtyeight

R package of data and code behind the stories and interactives at FiveThirtyEight

Stars: ✭ 422 (-72.63%)

Mutual labels: data-science, statistics, cran

View All Similar Projects ➔

mlr

Package website: release | dev

Machine learning in R.

Deprecated

{mlr} is considered retired from the mlr-org team. We won't add new features anymore and will only fix severe bugs. We suggest to use the new mlr3 framework from now on and for future projects.

Not all features of {mlr} are already implemented in {mlr3}. If you are missing a crucial feature, please open an issue in the respective mlr3 extension package and do not hesitate to follow-up on it.

Installation

Release

install.packages("mlr")

Development

remotes::install_github("mlr-org/mlr")

Citing {mlr} in publications

Please cite our JMLR paper [bibtex].

Some parts of the package were created as part of other publications. If you use these parts, please cite the relevant work appropriately. An overview of all {mlr} related publications can be found here.

Introduction

R does not define a standardized interface for its machine-learning algorithms. Therefore, for any non-trivial experiments, you need to write lengthy, tedious and error-prone wrappers to call the different algorithms and unify their respective output.

Additionally you need to implement infrastructure to

resample your models
optimize hyperparameters
select features
cope with pre- and post-processing of data and compare models in a statistically meaningful way.

As this becomes computationally expensive, you might want to parallelize your experiments as well. This often forces users to make crummy trade-offs in their experiments due to time constraints or lacking expert programming skills.

{mlr} provides this infrastructure so that you can focus on your experiments! The framework provides supervised methods like classification, regression and survival analysis along with their corresponding evaluation and optimization methods, as well as unsupervised methods like clustering. It is written in a way that you can extend it yourself or deviate from the implemented convenience methods and construct your own complex experiments or algorithms.

Furthermore, the package is nicely connected to the OpenML R package and its online platform, which aims at supporting collaborative machine learning online and allows to easily share datasets as well as machine learning tasks, algorithms and experiments in order to support reproducible research.

Features

Clear S3 interface to R classification, regression, clustering and survival analysis methods
Abstract description of learners and tasks by properties
Convenience methods and generic building blocks for your machine learning experiments
Resampling methods like bootstrapping, cross-validation and subsampling
Extensive visualizations (e.g. ROC curves, predictions and partial predictions)
Simplified benchmarking across data sets and learners
Easy hyperparameter tuning using different optimization strategies, including potent configurators like
- iterated F-racing (irace)
- sequential model-based optimization
Variable selection with filters and wrappers
Nested resampling of models with tuning and feature selection
Cost-sensitive learning, threshold tuning and imbalance correction
Wrapper mechanism to extend learner functionality in complex ways
Possibility to combine different processing steps to a complex data mining chain that can be jointly optimized
OpenML connector for the Open Machine Learning server
Built-in parallelization
Detailed tutorial

Miscellaneous

Simple usage questions are better suited at Stackoverflow using the mlr tag.

Please note that all of us work in academia and put a lot of work into this project - simply because we like it, not because we are paid for it.

New development efforts should go into {mlr3}. We have a own style guide which can easily applied by using the mlr_style from the styler package. See our wiki for more information.

Talks, Workshops, etc.

mlr-outreach holds all outreach activities related to {mlr} and {mlr3}.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

mlr-org / Mlr

Programming Languages

Labels

Projects that are alternatives of or similar to Mlr

mlr

Deprecated

Installation

Citing {mlr} in publications

Introduction

Features

Miscellaneous

Talks, Workshops, etc.