All Projects → data-cleaning → editrules

data-cleaning / editrules

Licence: other
R package for handling, checking and enforcing data rules

Programming Languages

r
7636 projects
TeX
3793 projects

Build Status CRAN Downloads

editrules has been succeeded by R packages: validate and errorlocate

editrules

R package for parsing edit rules The editrules package aims to provide an environment to conveniently define, read and check recordwise data constraints including

  • Linear (in)equality constraints for numerical data
  • Constraints on value combinations of categorical data
  • Conditional constraints on numerical and/or mixed data

In literature these constraints, or restrictions are refered to as edits. editrules can perform common rule set manipulations like variable elimination and value substitution, and offers error localization functionality based on the (generalized) paradigm of Fellegi and Holt. Under this paradigm, one determines the smallest (weighted) number of variables to adapt such that no (additional or derived) rules are violated. The paradigm is based on the assumption that errors are distributed randomly over the variables and there is no detectable cause of error. It also decouples the detection of corrupt variables from their correction. For some types of error, such as sign flips, typing errors or rounding errors, this assumption does not hold. These errors can be detected and are closely related to their resolution. The reader is referred to the deducorrect package for treating such errors.

To install the latest version in R:

install.packages("editrules")

To get started, see the editrules vignette.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].