Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → h2oai → Pystacknet

h2oai / Pystacknet

Licence: mit

Labels

jupyter-notebook

Projects that are alternatives of or similar to Pystacknet

Fusenet

Deep fusion project of deeply-fused nets, and the study on the connection to ensembling

Stars: ✭ 230 (-0.86%)

Mutual labels: jupyter-notebook

Neural Network From Scratch

Ever wondered how to code your Neural Network using NumPy, with no frameworks involved?

Stars: ✭ 230 (-0.86%)

Mutual labels: jupyter-notebook

Introduction To Python

Python is an interpreted, high-level, general-purpose programming language. Created by Guido van Rossum and first released in 1991, Python's design philosophy emphasizes code readability with its notable use of significant white space. (This repository contains Python 3 Code)

Stars: ✭ 232 (+0%)

Mutual labels: jupyter-notebook

Quantiacs Python

Python version of Quantiacs toolbox and sample trading strategies

Stars: ✭ 230 (-0.86%)

Mutual labels: jupyter-notebook

Hamiltonian Nn

Code for our paper "Hamiltonian Neural Networks"

Stars: ✭ 229 (-1.29%)

Mutual labels: jupyter-notebook

Dagmm

My attempt at reproducing the paper Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection

Stars: ✭ 231 (-0.43%)

Mutual labels: jupyter-notebook

Comp Genomics Class

Code and examples for JHU Computational Genomics class

Stars: ✭ 228 (-1.72%)

Mutual labels: jupyter-notebook

Wassdistance

Approximating Wasserstein distances with PyTorch

Stars: ✭ 229 (-1.29%)

Mutual labels: jupyter-notebook

Nlp made easy

Explains nlp building blocks in a simple manner.

Stars: ✭ 232 (+0%)

Mutual labels: jupyter-notebook

Stylegan2 Face Modificator

Simple Encoder, Generator and Face Modificator with StyleGAN2

Stars: ✭ 232 (+0%)

Mutual labels: jupyter-notebook

Structuredinference

Structured Inference Networks for Nonlinear State Space Models

Stars: ✭ 230 (-0.86%)

Mutual labels: jupyter-notebook

Stat451 Machine Learning Fs20

STAT 451: Intro to Machine Learning @ UW-Madison (Fall 2020)

Stars: ✭ 230 (-0.86%)

Mutual labels: jupyter-notebook

Imaginary Numbers Are Real

Code To Accompany YouTube Series Imaginary Numbers Are Real

Stars: ✭ 231 (-0.43%)

Mutual labels: jupyter-notebook

Beakerx

Beaker Extensions for Jupyter Notebook

Stars: ✭ 2,594 (+1018.1%)

Mutual labels: jupyter-notebook

Aotodata

朱小五写文章涉及到的数据分析，爬虫，源数据

Stars: ✭ 232 (+0%)

Mutual labels: jupyter-notebook

Mydatascienceportfolio

Applying Data Science and Machine Learning to Solve Real World Business Problems

Stars: ✭ 227 (-2.16%)

Mutual labels: jupyter-notebook

Machine Learning By Andrew Ng In Python

Documenting my python implementation of Andrew Ng's Machine Learning course

Stars: ✭ 231 (-0.43%)

Mutual labels: jupyter-notebook

Statannot

add statistical annotations (pvalue significance) on an existing boxplot generated by seaborn boxplot

Stars: ✭ 228 (-1.72%)

Mutual labels: jupyter-notebook

Learn Statistical Learning Method

Implementation of Statistical Learning Method, Second Edition.《统计学习方法》第二版，算法实现。

Stars: ✭ 228 (-1.72%)

Mutual labels: jupyter-notebook

Installations mac ubuntu windows

Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).

Stars: ✭ 231 (-0.43%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

About

pystacknet is a light python version of StackNet which was originally made in Java.

It supports many of the original features, with some new elements.

Installation

git clone https://github.com/h2oai/pystacknet
cd pystacknet
python setup.py install

New features

pystacknet's main object is a 2-dimensional list of sklearn type of models. This list defines the StackNet structure. This is the equivalent of parameters in the Java version. A representative example could be:

from sklearn.ensemble import RandomForestClassifier, ExtraTreesClassifier, GradientBoostingClassifier
from sklearn.linear_model import LogisticRegression

    models=[ 
            ######## First level ########
            [RandomForestClassifier (n_estimators=100, criterion="entropy", max_depth=5, max_features=0.5, random_state=1),
             ExtraTreesClassifier (n_estimators=100, criterion="entropy", max_depth=5, max_features=0.5, random_state=1),
             GradientBoostingClassifier(n_estimators=100, learning_rate=0.1, max_depth=5, max_features=0.5, random_state=1),
             LogisticRegression(random_state=1)
             ],
            ######## Second level ########
            [RandomForestClassifier (n_estimators=200, criterion="entropy", max_depth=5, max_features=0.5, random_state=1)]
            ]

pystacknet is not as strict as in the Java version and can allow Regressors, Classifiers or even Transformers at any level of StackNet. In other words the following could work just fine:

from sklearn.ensemble import RandomForestClassifier, RandomForestRegressor, ExtraTreesClassifier, ExtraTreesRegressor, GradientBoostingClassifier,GradientBoostingRegressor
from sklearn.linear_model import LogisticRegression, Ridge
from sklearn.decomposition import PCA
    models=[ 
            
            [RandomForestClassifier (n_estimators=100, criterion="entropy", max_depth=5, max_features=0.5, random_state=1),
             ExtraTreesRegressor (n_estimators=100, max_depth=5, max_features=0.5, random_state=1),
             GradientBoostingClassifier(n_estimators=100, learning_rate=0.1, max_depth=5, max_features=0.5, random_state=1),
             LogisticRegression(random_state=1),
             PCA(n_components=4,random_state=1)
             ],
            
            [RandomForestClassifier (n_estimators=200, criterion="entropy", max_depth=5, max_features=0.5, random_state=1)]
            
            
            ]

Note that not all transformers are meaningful in this context and you should use it at your own risk.

Parameters

A typical usage for classification could be :

from pystacknet.pystacknet import StackNetClassifier

model=StackNetClassifier(models, metric="auc", folds=4,
	restacking=False,use_retraining=True, use_proba=True, 
	random_state=12345,n_jobs=1, verbose=1)

model.fit(x,y)
preds=model.predict_proba(x_test)

Where :

Command	Explanation
models	List of models. This should be a 2-dimensional list . The first level hould defice the stacking level and each entry is the model.
metric	Can be "auc","logloss","accuracy","f1","matthews" or your own custom metric as long as it implements (ytrue,ypred,sample_weight=)
folds	This can be either integer to define the number of folds used in `StackNet` or an iterable yielding train/test splits.
restacking	True for restacking else False
use_proba	When evaluating the metric, it will use probabilities instead of class predictions if `use_proba==True`
use_retraining	If `True` it does one model based on the whole training data in order to score the test data. Otherwise it takes the average of all models used in the folds ( however this takes more memory and there is no guarantee that it will work better.)
random_state	Integer for randomised procedures
n_jobs	Number of models to run in parallel. This is independent of any extra threads allocated
n_jobs	Number of models to run in parallel. This is independent of any extra threads allocated from the selected algorithms. e.g. it is possible to run 4 models in parallel where one is a randomforest that runs on 10 threads (it selected).
verbose	Integer value higher than zero to allow printing at the console.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 232

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (17) 🔗