Alternatives and detailed information of arboreto

aertslab / arboreto

Licence: BSD-3-Clause license

A scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.

Programming Languages

Jupyter Notebook

11667 projects

Projects that are alternatives of or similar to arboreto

handson-ml

도서 "핸즈온 머신러닝"의 예제와 연습문제를 담은 주피터 노트북입니다.

Stars: ✭ 285 (+763.64%)

Mutual labels: random-forest, ensemble-learning, gradient-boosting

Awesome Decision Tree Papers

A collection of research papers on decision, classification and regression trees with implementations.

Stars: ✭ 1,908 (+5681.82%)

Mutual labels: random-forest, ensemble-learning, gradient-boosting

Tpot

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Stars: ✭ 8,378 (+25287.88%)

Mutual labels: random-forest, gradient-boosting

Gcforest

This is the official implementation for the paper 'Deep forest: Towards an alternative to deep neural networks'

Stars: ✭ 1,214 (+3578.79%)

Mutual labels: random-forest, ensemble-learning

Emlearn

Machine Learning inference engine for Microcontrollers and Embedded devices

Stars: ✭ 154 (+366.67%)

Mutual labels: random-forest, inference

H2o 3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Stars: ✭ 5,656 (+17039.39%)

Mutual labels: random-forest, ensemble-learning

Awesome Gradient Boosting Papers

A curated list of gradient boosting research papers with implementations.

Stars: ✭ 704 (+2033.33%)

Mutual labels: random-forest, gradient-boosting

Predicting real estate prices using scikit Learn

Predicting Amsterdam house / real estate prices using Ordinary Least Squares-, XGBoost-, KNN-, Lasso-, Ridge-, Polynomial-, Random Forest-, and Neural Network MLP Regression (via scikit-learn)

Stars: ✭ 78 (+136.36%)

Mutual labels: random-forest, ensemble-learning

AdaptiveRandomForest

Repository for the AdaptiveRandomForest algorithm implemented in MOA 2016-04

Stars: ✭ 28 (-15.15%)

Mutual labels: random-forest, ensemble-learning

Variational gradient matching for dynamical systems

Sample code for the NIPS paper "Scalable Variational Inference for Dynamical Systems"

Stars: ✭ 22 (-33.33%)

Mutual labels: scalable, inference

Infiniteboost

InfiniteBoost: building infinite ensembles with gradient descent

Stars: ✭ 180 (+445.45%)

Mutual labels: random-forest, gradient-boosting

decision-trees-for-ml

Building Decision Trees From Scratch In Python

Stars: ✭ 61 (+84.85%)

Mutual labels: random-forest, gradient-boosting

Deep Forest

An Efficient, Scalable and Optimized Python Framework for Deep Forest (2021.2.1)

Stars: ✭ 547 (+1557.58%)

Mutual labels: random-forest, ensemble-learning

User Machine Learning Tutorial

useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive http://user2016.org/tutorials/10.html

Stars: ✭ 393 (+1090.91%)

Mutual labels: random-forest, ensemble-learning

Awesome Fraud Detection Papers

A curated list of data mining papers about fraud detection.

Stars: ✭ 843 (+2454.55%)

Mutual labels: random-forest, gradient-boosting

Sharplearning

Machine learning for C# .Net

Stars: ✭ 294 (+790.91%)

Mutual labels: random-forest, ensemble-learning

yggdrasil-decision-forests

A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models.

Stars: ✭ 156 (+372.73%)

Mutual labels: random-forest, gradient-boosting

forestError

A Unified Framework for Random Forest Prediction Error Estimation

Stars: ✭ 23 (-30.3%)

Mutual labels: random-forest, inference

Chefboost

A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python

Stars: ✭ 176 (+433.33%)

Mutual labels: random-forest, gradient-boosting

stackgbm

🌳 Stacked Gradient Boosting Machines

Stars: ✭ 24 (-27.27%)

Mutual labels: ensemble-learning, gradient-boosting

View All Similar Projects ➔

The most satisfactory definition of man from the scientific point of view is probably Man the Tool-maker.

Inferring a gene regulatory network (GRN) from gene expression data is a computationally expensive task, exacerbated by increasing data sizes due to advances in high-throughput gene profiling technology.

The arboreto software library addresses this issue by providing a computational strategy that allows executing the class of GRN inference algorithms exemplified by GENIE3 [1] on hardware ranging from a single computer to a multi-node compute cluster. This class of GRN inference algorithms is defined by a series of steps, one for each target gene in the dataset, where the most important candidates from a set of regulators are determined from a regression model to predict a target gene's expression profile.

Members of the above class of GRN inference algorithms are attractive from a computational point of view because they are parallelizable by nature. In arboreto, we specify the parallelizable computation as a dask graph [2], a data structure that represents the task schedule of a computation. A dask scheduler assigns the tasks in a dask graph to the available computational resources. Arboreto uses the dask distributed scheduler to spread out the computational tasks over multiple processes running on one or multiple machines.

Arboreto currently supports 2 GRN inference algorithms:

GRNBoost2: a novel and fast GRN inference algorithm using Stochastic Gradient Boosting Machine (SGBM) [3] regression with early-stopping regularization.
GENIE3: the classic GRN inference algorithm using Random Forest (RF) or ExtraTrees (ET) regression.

Get Started

Arboreto was conceived with the working bioinformatician or data scientist in mind. We provide extensive documentation and examples to help you get up to speed with the library.

Read the arboreto documentation.
Browse example notebooks.
Report an issue.

License

BSD 3-Clause License

pySCENIC

Arboreto is a component in pySCENIC: a lightning-fast python implementation of the SCENIC pipeline [5] (Single-Cell rEgulatory Network Inference and Clustering) which enables biologists to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data.

References

Huynh-Thu VA, Irrthum A, Wehenkel L, Geurts P (2010) Inferring Regulatory Networks from Expression Data Using Tree-Based Methods. PLoS ONE
Rocklin, M. (2015). Dask: parallel computation with blocked algorithms and task scheduling. In Proceedings of the 14th Python in Science Conference (pp. 130-136).
Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4), 367-378.
Marbach, D., Costello, J. C., Kuffner, R., Vega, N. M., Prill, R. J., Camacho, D. M., ... & Dream5 Consortium. (2012). Wisdom of crowds for robust gene network inference. Nature methods, 9(8), 796-804.
Aibar S, Bravo Gonzalez-Blas C, Moerman T, Wouters J, Huynh-Thu VA, Imrichova H, Kalender Atak Z, Hulselmans G, Dewaele M, Rambow F, Geurts P, Aerts J, Marine C, van den Oord J, Aerts S. SCENIC: Single-cell regulatory network inference and clustering. Nature Methods 14, 1083–1086 (2017). doi: 10.1038/nmeth.4463

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

aertslab / arboreto

Programming Languages

Labels

Projects that are alternatives of or similar to arboreto

Get Started

License

pySCENIC

References