All Projects → Mamba413 → Ball

Mamba413 / Ball

Licence: other
Statistical Inference and Sure Independence Screening via Ball Statistics

Programming Languages

c
50402 projects - #5 most used programming language
r
7636 projects
python
139335 projects - #7 most used programming language
C++
36643 projects - #6 most used programming language
Makefile
30231 projects
CMake
9771 projects

Projects that are alternatives of or similar to Ball

CAST
Developer Version of the R package CAST: Caret Applications for Spatio-Temporal models
Stars: ✭ 65 (+195.45%)
Mutual labels:  feature-selection
qbso-fs
Python implementation of QBSO-FS : a Reinforcement Learning based Bee Swarm Optimization metaheuristic for Feature Selection problem.
Stars: ✭ 47 (+113.64%)
Mutual labels:  feature-selection
FIFA-2019-Analysis
This is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+27.27%)
Mutual labels:  feature-selection
Market-Mix-Modeling
Market Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
Stars: ✭ 31 (+40.91%)
Mutual labels:  feature-selection
PyImpetus
PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features
Stars: ✭ 83 (+277.27%)
Mutual labels:  feature-selection
adapt
Awesome Domain Adaptation Python Toolbox
Stars: ✭ 46 (+109.09%)
Mutual labels:  feature-selection
laravel-rollout
A package to integrate rollout into your Laravel project.
Stars: ✭ 23 (+4.55%)
Mutual labels:  feature-selection
skrobot
skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Stars: ✭ 22 (+0%)
Mutual labels:  feature-selection
GeneticAlgorithmForFeatureSelection
Search the best feature subset for you classification mode
Stars: ✭ 82 (+272.73%)
Mutual labels:  feature-selection
fsfc
Feature Selection for Clustering
Stars: ✭ 80 (+263.64%)
Mutual labels:  feature-selection
featurewiz
Use advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (+940.91%)
Mutual labels:  feature-selection
arfs
All Relevant Feature Selection
Stars: ✭ 21 (-4.55%)
Mutual labels:  feature-selection
feature engine
Feature engineering package with sklearn like functionality
Stars: ✭ 758 (+3345.45%)
Mutual labels:  feature-selection
zoofs
zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.
Stars: ✭ 142 (+545.45%)
Mutual labels:  feature-selection
Reinforcement-Learning-Feature-Selection
Feature selection for maximizing expected cumulative reward
Stars: ✭ 27 (+22.73%)
Mutual labels:  feature-selection
FEAST
A FEAture Selection Toolbox for C/C+, Java, and Matlab/Octave.
Stars: ✭ 67 (+204.55%)
Mutual labels:  feature-selection
GPS
code for "A global pathway selection algorithm for the reduction of detailed chemical kinetic mechanisms" (Gao et al., CNF'16)
Stars: ✭ 18 (-18.18%)
Mutual labels:  feature-selection
GraphOfDocs
GraphOfDocs: Representing multiple documents as a single graph
Stars: ✭ 13 (-40.91%)
Mutual labels:  feature-selection
bess
Best Subset Selection algorithm for Regression, Classification, Count, Survival analysis
Stars: ✭ 14 (-36.36%)
Mutual labels:  feature-selection
Mlr
Machine Learning in R
Stars: ✭ 1,542 (+6909.09%)
Mutual labels:  feature-selection

Ball Statistics

AppVeyor Build Status CRAN Status Badge PyPI version

Introdution

The fundamental problems for data mining, statistical analysis, and machine learning are:

  • whether several distributions are different?
  • whether random variables are dependent?
  • how to pick out useful variables/features from a high-dimensional data?

These issues can be tackled by Ball statistics, which enjoy following admirable advantages:

  • available for most of datasets (e.g., traditional tabular data, brain shape, functional connectome, wind direction and so on)
  • insensitive to outliers, distribution-free and model-free;
  • theoretically guaranteed and computationally efficient.

Softwares

R package

Install the Ball package from CRAN:

install.packages("Ball")

Compared with selective R packages available for datasets in metric spaces:

fastmit energy HHG Ball
Test of equal distributions ✔️ ✔️ ✔️
Test of independence ✔️ ✔️ ✔️ ✔️
Test of joint independence ✔️
Feature screening / Sure Independence Screening (SIS) ✔️
Iterative Feature screening / Iterative SIS ✔️
Datasets in metric spaces ✔️ SNT ✔️ ✔️
Robustness ✔️ ✔️ ✔️
Parallel programming ✔️ ✔️
Computational efficiency 🏃🏃🏃 🏃🏃🏃 🏃🏃 🏃🏃🚶

SNT is the abbreviation of strong negative type.

See the following documents for more details about the Ball package:

Python package

Install the Ball package from PyPI:

pip install Ball

Citation

If you use Ball or reference our vignettes in a presentation or publication, we would appreciate citations of our package.

Zhu J, Pan W, Zheng W, Wang X (2021). “Ball: An R Package for Detecting Distribution Difference and Association in Metric Spaces.” Journal of Statistical Software, 97(6), 1–31. doi: 10.18637/jss.v097.i06.

Here is the corresponding Bibtex entry

@Article{,
  title = {{Ball}: An {R} Package for Detecting Distribution Difference and Association in Metric Spaces},
  author = {Jin Zhu and Wenliang Pan and Wei Zheng and Xueqin Wang},
  journal = {Journal of Statistical Software},
  year = {2021},
  volume = {97},
  number = {6},
  pages = {1--31},
  doi = {10.18637/jss.v097.i06},
}

References

Bug report

Open an issue or send an email to Jin Zhu at [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].