All Projects → markvanderloo → simputation

markvanderloo / simputation

Licence: GPL-3.0 License
Making imputation easy

Programming Languages

r
7636 projects
c
50402 projects - #5 most used programming language
Makefile
30231 projects

Projects that are alternatives of or similar to simputation

r-docker-tutorial
A docker tutorial for reproducible research
Stars: ✭ 245 (+245.07%)
Mutual labels:  rstats
samplics
Select, weight and analyze complex sample data
Stars: ✭ 24 (-66.2%)
Mutual labels:  officialstatistics
corrgram
Correlograms
Stars: ✭ 13 (-81.69%)
Mutual labels:  rstats
sacred
📖 Sacred texts in R
Stars: ✭ 19 (-73.24%)
Mutual labels:  rstats
agent
Store sensitive data such as API tokens
Stars: ✭ 19 (-73.24%)
Mutual labels:  rstats
tftargets
🎯 Human transcription factor target genes.
Stars: ✭ 77 (+8.45%)
Mutual labels:  rstats
HumanPilot
Spatial Transcriptomics human DLPFC pilot study part of the spatialLIBD project
Stars: ✭ 22 (-69.01%)
Mutual labels:  rstats
CENTIPEDE.tutorial
🐛 How to use CENTIPEDE to determine if a transcription factor is bound.
Stars: ✭ 23 (-67.61%)
Mutual labels:  rstats
flowmapblue.R
Flowmap.blue widget for R
Stars: ✭ 42 (-40.85%)
Mutual labels:  rstats
r-ladies-ML-1
⭐ Files for my R-Ladies ML workshop - Supervised Learning 101 ⭐
Stars: ✭ 35 (-50.7%)
Mutual labels:  rstats
music
Music Theory in R
Stars: ✭ 34 (-52.11%)
Mutual labels:  rstats
heddlr
Bring a functional programming mindset to R Markdown document generation
Stars: ✭ 14 (-80.28%)
Mutual labels:  rstats
missRanger
R package "missRanger" for fast imputation of missing values by random forests.
Stars: ✭ 42 (-40.85%)
Mutual labels:  imputation
cusumcharter
Easier CUSUM control charts. Returns simple CUSUM statistics, CUSUMs with control limit calculations, and function to generate faceted CUSUM Control Charts
Stars: ✭ 17 (-76.06%)
Mutual labels:  rstats
rLandsat
R Package to make Landsat8 data accessible
Stars: ✭ 95 (+33.8%)
Mutual labels:  rstats
cranlogs
Download Logs from the RStudio CRAN Mirror
Stars: ✭ 70 (-1.41%)
Mutual labels:  rstats
destatiscleanr
Imports and cleans data from official German statistical offices to jump-start the data analysis
Stars: ✭ 47 (-33.8%)
Mutual labels:  rstats
casewhen
Create reusable dplyr::case_when() functions
Stars: ✭ 64 (-9.86%)
Mutual labels:  rstats
mobileCharts
Mobile friendly charts with a familiar syntax
Stars: ✭ 23 (-67.61%)
Mutual labels:  rstats
scclusteval
Single Cell Cluster Evaluation
Stars: ✭ 57 (-19.72%)
Mutual labels:  rstats

Build Status Coverage Status CRANstatus DownloadsMentioned in Awesome Official Statistics

simputation

An R package to make imputation simple. Currently supported methods include

  • Model based (optionally add [non-]parametric random residual)
    • linear regression
    • robust linear regression (M-estimation)
    • ridge/elasticnet/lasso regression (from version >= 0.2.1)
    • CART models
    • Random forest
  • Model based, multivariate
    • Imputation based on EM-estimated parameters (from version >= 0.2.1)
    • missForest (from version >= 0.2.1)
  • Donor imputation (including various donor pool specifications)
    • k-nearest neigbour (based on gower's distance)
    • sequential hotdeck (LOCF, NOCB)
    • random hotdeck
    • Predictive mean matching
  • Other
    • (groupwise) median imputation (optional random residual)
    • Proxy imputation (copy from other variable)

Installation

To install simputation and all packages needed to support various imputation models do the following.

install.packages("simputation", dependencies=TRUE)

To install the development version.

git clone https://github.com/markvanderloo/simputation
make install

Example usage

Create some data suffering from missings

library(simputation) # current package
library(magrittr)    # for the %>% not-a-pipe operator
dat <- iris
# empty a few fields
dat[1:3,1] <- dat[3:7,2] <- dat[8:10,5] <- NA
head(dat,10)

Now impute Sepal.Length and Sepal.Width by regression on Petal.Length and Species, and impute Species using a CART model, that uses all other variables (including the imputed variables in this case).

dat %>% 
  impute_lm(Sepal.Length + Sepal.Width ~ Petal.Length + Species) %>%
  impute_cart(Species ~ .) %>% # use all variables except 'Species' as predictor
  head(10)

Materials

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].