All Projects → tidymodels → Broom

tidymodels / Broom

Licence: other
Convert statistical analysis objects from R into tidy format

Programming Languages

r
7636 projects

Labels

Projects that are alternatives of or similar to Broom

Cobrapy
COBRApy is a package for constraint-based modeling of metabolic networks.
Stars: ✭ 267 (-75.44%)
Mutual labels:  modeling
Lime
Local Interpretable Model-Agnostic Explanations (R port of original Python package)
Stars: ✭ 438 (-59.71%)
Mutual labels:  modeling
Imagine Old
Modeling and simulations using computational graphs
Stars: ✭ 25 (-97.7%)
Mutual labels:  modeling
Simpeg
Simulation and Parameter Estimation in Geophysics - A python package for simulation and gradient based parameter estimation in the context of geophysical applications.
Stars: ✭ 283 (-73.97%)
Mutual labels:  modeling
Gempy
GemPy is an open-source, Python-based 3-D structural geological modeling software, which allows the implicit (i.e. automatic) creation of complex geological models from interface and orientation data. It also offers support for stochastic modeling to adress parameter and model uncertainties.
Stars: ✭ 396 (-63.57%)
Mutual labels:  modeling
Jsmodeler
A JavaScript framework to create and visualize 3D models.
Stars: ✭ 608 (-44.07%)
Mutual labels:  modeling
p5-HackaMol
Object-Oriented Perl 5, Moose Library for Molecular Hacking
Stars: ✭ 12 (-98.9%)
Mutual labels:  modeling
Opengeode
Open source framework for representing and manipulating geometric models
Stars: ✭ 46 (-95.77%)
Mutual labels:  modeling
Plaitpy
plait.py - a fake data modeler
Stars: ✭ 416 (-61.73%)
Mutual labels:  modeling
Fastscape
A fast, versatile and user-friendly landscape evolution model
Stars: ✭ 22 (-97.98%)
Mutual labels:  modeling
Cascadestudio
A Full Live-Scripted CAD Kernel in the Browser
Stars: ✭ 344 (-68.35%)
Mutual labels:  modeling
Gaphor
Gaphor is the simple modeling tool
Stars: ✭ 386 (-64.49%)
Mutual labels:  modeling
Cadquery
A python parametric CAD scripting framework based on OCCT
Stars: ✭ 764 (-29.71%)
Mutual labels:  modeling
Sealion
The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Stars: ✭ 278 (-74.43%)
Mutual labels:  modeling
Jomini
Historical battle simulation package for Python
Stars: ✭ 31 (-97.15%)
Mutual labels:  modeling
Efdesigner
Entity Framework visual design surface and code-first code generation for EF6, Core and beyond
Stars: ✭ 256 (-76.45%)
Mutual labels:  modeling
Pynamical
Pynamical is a Python package for modeling and visualizing discrete nonlinear dynamical systems, chaos, and fractals.
Stars: ✭ 458 (-57.87%)
Mutual labels:  modeling
Modeler
Free 3D Modeling Tool
Stars: ✭ 50 (-95.4%)
Mutual labels:  modeling
Cryptobeauty
Crypto Beauty 密碼女孩 - 你專屬的區塊鏈少女卡片創作交易平台. Live on the TRON Blockchain.
Stars: ✭ 37 (-96.6%)
Mutual labels:  modeling
Ditras
DITRAS (DIary-based TRAjectory Simulator), a mathematical model to simulate human mobility
Stars: ✭ 19 (-98.25%)
Mutual labels:  modeling

broom

R build status Coverage status CRAN status Downloads

Overview

broom summarizes key information about models in tidy tibble()s. broom provides three verbs to make it convenient to interact with model objects:

  • tidy() summarizes information about model components
  • glance() reports information about the entire model
  • augment() adds informations about observations to a dataset

For a detailed introduction, please see vignette("broom").

broom tidies 100+ models from popular modelling packages and almost all of the model objects in the stats package that comes with base R. vignette("available-methods") lists method availability.

If you aren’t familiar with tidy data structures and want to know how they can make your life easier, we highly recommend reading Hadley Wickham’s Tidy Data.

Installation

# we recommend installing the entire tidyverse 
# modeling set, which includes broom:
install.packages("tidymodels")

# alternatively, to install just broom:
install.packages("broom")

# to get the development version from GitHub:
install.packages("devtools")
devtools::install_github("tidymodels/broom")

If you find a bug, please file a minimal reproducible example in the issues.

Usage

tidy() produces a tibble() where each row contains information about an important component of the model. For regression models, this often corresponds to regression coefficients. This is can be useful if you want to inspect a model or create custom visualizations.

library(broom)

fit <- lm(Volume ~ Girth + Height, trees)
tidy(fit)
#> # A tibble: 3 x 5
#>   term        estimate std.error statistic  p.value
#>   <chr>          <dbl>     <dbl>     <dbl>    <dbl>
#> 1 (Intercept)  -58.0       8.64      -6.71 2.75e- 7
#> 2 Girth          4.71      0.264     17.8  8.22e-17
#> 3 Height         0.339     0.130      2.61 1.45e- 2

glance() returns a tibble with exactly one row of goodness of fitness measures and related statistics. This is useful to check for model misspecification and to compare many models.

glance(fit)
#> # A tibble: 1 x 12
#>   r.squared adj.r.squared sigma statistic  p.value    df logLik   AIC   BIC
#>       <dbl>         <dbl> <dbl>     <dbl>    <dbl> <dbl>  <dbl> <dbl> <dbl>
#> 1     0.948         0.944  3.88      255. 1.07e-18     2  -84.5  177.  183.
#> # … with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>

augment adds columns to a dataset, containing information such as fitted values, residuals or cluster assignments. All columns added to a dataset have . prefix to prevent existing columns from being overwritten.

augment(fit, data = trees)
#> # A tibble: 31 x 9
#>    Girth Height Volume .fitted .resid .std.resid   .hat .sigma   .cooksd
#>    <dbl>  <dbl>  <dbl>   <dbl>  <dbl>      <dbl>  <dbl>  <dbl>     <dbl>
#>  1   8.3     70   10.3    4.84  5.46      1.50   0.116    3.79 0.0978   
#>  2   8.6     65   10.3    4.55  5.75      1.60   0.147    3.77 0.148    
#>  3   8.8     63   10.2    4.82  5.38      1.53   0.177    3.78 0.167    
#>  4  10.5     72   16.4   15.9   0.526     0.140  0.0592   3.95 0.000409 
#>  5  10.7     81   18.8   19.9  -1.07     -0.294  0.121    3.95 0.00394  
#>  6  10.8     83   19.7   21.0  -1.32     -0.370  0.156    3.94 0.00840  
#>  7  11       66   15.6   16.2  -0.593    -0.162  0.115    3.95 0.00114  
#>  8  11       75   18.2   19.2  -1.05     -0.277  0.0515   3.95 0.00138  
#>  9  11.1     80   22.6   21.4   1.19      0.321  0.0920   3.95 0.00348  
#> 10  11.2     75   19.9   20.2  -0.288    -0.0759 0.0480   3.95 0.0000968
#> # … with 21 more rows

Contributing

We welcome contributions of all types!

For questions and discussions about tidymodels packages, modeling, and machine learning, please post on RStudio Community. If you think you have encountered a bug, please submit an issue. Either way, learn how to create and share a reprex (a minimal, reproducible example), to clearly communicate about your code. Check out further details on contributing guidelines for tidymodels packages and how to get help.

If you have never directly contributed to an R package before, broom is an excellent place to start. Find an issue with the Beginner Friendly tag and comment that you’d like to take it on and we’ll help you get started.

Generally, too, we encourage typo corrections, bug reports, bug fixes and feature requests. Feedback on the clarity of the documentation is especially valuable!

If you are interested in adding tidier methods for new model objects, please read this article on the tidymodels website.

We have a Contributor Code of Conduct. By participating in broom you agree to abide by its terms.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].