All Projects → zmjones → Edarf

zmjones / Edarf

Licence: mit
exploratory data analysis using random forests

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Edarf

Inspectdf
🛠️ 📊 Tools for Exploring and Comparing Data Frames
Stars: ✭ 195 (+214.52%)
Mutual labels:  exploratory-data-analysis, rstats
How-to-score-0.8134-in-Titanic-Kaggle-Challenge
Solution of the Titanic Kaggle competition
Stars: ✭ 114 (+83.87%)
Mutual labels:  random-forest, exploratory-data-analysis
Visdat
Preliminary Exploratory Visualisation of Data
Stars: ✭ 377 (+508.06%)
Mutual labels:  exploratory-data-analysis, rstats
Dtupdate
The dtupdate package has functions that try to make it easier to keep up with the non-CRAN universe
Stars: ✭ 51 (-17.74%)
Mutual labels:  rstats
Tl
tldr for R!
Stars: ✭ 52 (-16.13%)
Mutual labels:  rstats
Nodbi
Document DBI connector for R
Stars: ✭ 56 (-9.68%)
Mutual labels:  rstats
Sever
🔪Good-looking problems: customise your Shiny disconnected screen and error messages
Stars: ✭ 60 (-3.23%)
Mutual labels:  rstats
Rdoc
colourised R docs in the terminal
Stars: ✭ 49 (-20.97%)
Mutual labels:  rstats
Sigmajs
Σ sigma.js for R
Stars: ✭ 58 (-6.45%)
Mutual labels:  rstats
Rtimes
R wrapper for NYTimes API for government data - ABANDONED
Stars: ✭ 55 (-11.29%)
Mutual labels:  rstats
Colormap
R package to generate colors from a list of 44 pre-defined palettes
Stars: ✭ 55 (-11.29%)
Mutual labels:  rstats
Ggeconodist
📉 Create Diminutive Distribution Charts
Stars: ✭ 53 (-14.52%)
Mutual labels:  rstats
Drake Examples
Example workflows for the drake R package
Stars: ✭ 57 (-8.06%)
Mutual labels:  rstats
Euclid
Exact Computation Geometry Framework Based on 'CGAL'
Stars: ✭ 52 (-16.13%)
Mutual labels:  rstats
Textdata
Download, parse, store, and load text datasets instead of storing it in packages
Stars: ✭ 59 (-4.84%)
Mutual labels:  rstats
Tpot
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Stars: ✭ 8,378 (+13412.9%)
Mutual labels:  random-forest
Mixomics
Development repository for the Bioconductor package 'mixOmics '
Stars: ✭ 58 (-6.45%)
Mutual labels:  rstats
Vcr
Record HTTP calls and replay them
Stars: ✭ 54 (-12.9%)
Mutual labels:  rstats
Orangetext
🍊📄 : An #rstats project to keep track of The 🍊 One's speeches
Stars: ✭ 53 (-14.52%)
Mutual labels:  rstats
Stock Market Sentiment Analysis
Identification of trends in the stock prices of a company by performing fundamental analysis of the company. News articles were provided as training data-sets to the model which classified the articles as positive or neutral. Sentiment score was computed by calculating the difference between positive and negative words present in the news article. Comparisons were made between the actual stock prices and the sentiment scores. Naive Bayes, OneR and Random Forest algorithms were used to observe the results of the model using Weka
Stars: ✭ 56 (-9.68%)
Mutual labels:  random-forest

DOI status

Functions useful for exploratory data analysis using random forests.

This package extends the functionality of random forests fit by party (multivariate, regression, and classification), randomForestSRC (regression and classification,), randomForest (regression and classification), and ranger (classification and regression).

The subdirectory pkg contains the actual package. The package can be installed with devtools.

devtools::install_github("zmjones/edarf", subdir = "pkg")

Functionality includes:

  • partial_dependence which computes the expected prediction made by the random forest if it were marginalized to only depend on a subset of the features. plot_pd plots the results.
  • variable_importance which computes feature importance for arbitrary loss functions, aggregated across the training data or for individual observations. This may also be used for subsets of the feature space in order to detect interactions.
  • extract_proximity and plot_prox which computes or extracts proximity matrices and plots them using a biplot given a matrix of principal components of said matrix.

If you use the package for research, please cite it.

@article{jones2016,
  doi = {10.21105/joss.00092},
  url = {http://dx.doi.org/10.21105/joss.00092},
  year  = {2016},
  month = {oct},
  publisher = {The Open Journal},
  volume = {1},
  number = {6},
  author = {Zachary M. Jones and Fridolin J. Linder},
  title = {edarf: Exploratory Data Analysis using Random Forests},
  journal = {The Journal of Open Source Software}
}

Pull requests, bug reports, feature requests, etc. are welcome!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].