All Projects → cokelaer → Fitter

cokelaer / Fitter

Licence: gpl-3.0
Fit data to many distributions

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Fitter

Librmath.js
Javascript Pure Implementation of Statistical R "core" numerical libRmath.so
Stars: ✭ 425 (+260.17%)
Mutual labels:  statistics, distribution
Stanford Cme 106 Probability And Statistics
VIP cheatsheets for Stanford's CME 106 Probability and Statistics for Engineers
Stars: ✭ 242 (+105.08%)
Mutual labels:  statistics, distribution
Your spotify
Self hosted Spotify tracking dashboard
Stars: ✭ 102 (-13.56%)
Mutual labels:  statistics
Mongoeye
Schema and data analyzer for MongoDB written in Go.
Stars: ✭ 113 (-4.24%)
Mutual labels:  statistics
Isl Python
Porting the R code in ISL to python. Labs and exercises
Stars: ✭ 108 (-8.47%)
Mutual labels:  statistics
Npm Stats
📈 npm package statistics dashboard build with vue
Stars: ✭ 106 (-10.17%)
Mutual labels:  statistics
Parquet Index
Spark SQL index for Parquet tables
Stars: ✭ 109 (-7.63%)
Mutual labels:  statistics
Gitlogg
💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Stars: ✭ 102 (-13.56%)
Mutual labels:  statistics
Calculate All
calculate_all method for aggregate functions in Active Record
Stars: ✭ 118 (+0%)
Mutual labels:  statistics
Ptstat
Probabilistic Programming and Statistical Inference in PyTorch
Stars: ✭ 108 (-8.47%)
Mutual labels:  statistics
Psycho.r
An R package for experimental psychologists
Stars: ✭ 113 (-4.24%)
Mutual labels:  statistics
Tamicappmonitoring
Android App 无痕迹全埋点
Stars: ✭ 108 (-8.47%)
Mutual labels:  statistics
Ml Videos
A collection of video resources for machine learning
Stars: ✭ 1,446 (+1125.42%)
Mutual labels:  statistics
Knitr case studies
Inference case studies in knitr
Stars: ✭ 110 (-6.78%)
Mutual labels:  statistics
Devstats
📊 A CLI application that fetches stats from developer sites
Stars: ✭ 105 (-11.02%)
Mutual labels:  statistics
Mlr
Machine Learning in R
Stars: ✭ 1,542 (+1206.78%)
Mutual labels:  statistics
Laravel Stats
📈 Get insights about your Laravel or Lumen Project
Stars: ✭ 1,386 (+1074.58%)
Mutual labels:  statistics
Scikit Learn
scikit-learn: machine learning in Python
Stars: ✭ 48,322 (+40850.85%)
Mutual labels:  statistics
Impy
Impy is a Python3 library with features that help you in your computer vision tasks.
Stars: ✭ 109 (-7.63%)
Mutual labels:  statistics
Ml Dl Scripts
The repository provides usefull python scripts for ML and data analysis
Stars: ✭ 119 (+0.85%)
Mutual labels:  statistics

############################# FITTER documentation #############################

.. image:: https://badge.fury.io/py/fitter.svg :target: https://pypi.python.org/pypi/fitter

.. image:: https://secure.travis-ci.org/cokelaer/fitter.png :target: http://travis-ci.org/cokelaer/fitter

.. image:: https://coveralls.io/repos/cokelaer/fitter/badge.png?branch=master :target: https://coveralls.io/r/cokelaer/fitter?branch=master

.. image:: http://readthedocs.org/projects/fitter/badge/?version=latest :target: http://fitter.readthedocs.org/en/latest/?badge=latest :alt: Documentation Status

.. image:: https://zenodo.org/badge/23078551.svg :target: https://zenodo.org/badge/latestdoi/23078551

Compatible with Python 3.6, 3.7, and 3.8(Travis tests)

What is it ? ################

fitter package provides a simple class to identify the distribution from which a data samples is generated from. It uses 80 distributions from Scipy and allows you to plot the results to check what is the most probable distribution and the best parameters.

Installation ###################

::

pip install fitter

fitter is also available on conda (bioconda channel)::

 conda install fitter

And as a singularity file in damona <https://damona.readthedocs.io>::

pip install damona
damona install fitter

Usage ##################

standalone

A standalone application (very simple) is also provided and works with input CSV files::

fitter fitdist data.csv --column-number 1 --distributions gamma,normal

It creates a file called fitter.png and a log fitter.log

From Python shell

First, let us create a data samples with N = 10,000 points from a gamma distribution::

from scipy import stats
data = stats.gamma.rvs(2, loc=1.5, scale=2, size=10000)

.. note:: the fitting is slow so keep the size value to reasonable value.

Now, without any knowledge about the distribution or its parameter, what is the distribution that fits the data best ? Scipy has 80 distributions and the Fitter class will scan all of them, call the fit function for you, ignoring those that fail or run forever and finally give you a summary of the best distributions in the sense of sum of the square errors. The best is to give an example::

from fitter import Fitter
f = Fitter(data)
f.fit()
# may take some time since by default, all distributions are tried
# but you call manually provide a smaller set of distributions 
f.summary()

.. image:: http://pythonhosted.org/fitter/_images/index-1.png :target: http://pythonhosted.org/fitter/_images/index-1.png

See the online <http://fitter.readthedocs.io/>_ documentation for details.

Changelog

========= ==========================================================================
Version   Description
========= ==========================================================================
1.3.0     * parallel process implemented https://github.com/cokelaer/fitter/pull/25
            thanks to @arsenyinfo 
1.2.3     * remove vervose arguments in Fitter class. Using the logging module 
            instead
          * the Fitter.fit has now a progress bar
          * add a standalone application called … fitter (see the doc)
1.2.2     was not released
1.2.1     adding new class called histfit (see documentation)
1.2       * Fixed the version. Previous version switched from 
            1.0.9 to 1.1.11. To start a fresh version, we increase to 1.2.0
          * Merged pull request required by bioconda
          * Merged pull request related to implementation of 
            AIC/BIC/KL criteria (https://github.com/cokelaer/fitter/pull/19). 
            This also fixes https://github.com/cokelaer/fitter/issues/9
          * Implement two functions to get all distributions, or a list of 
            common distributions to help users decreading computational time 
            (https://github.com/cokelaer/fitter/issues/20). Also added a FAQS 
            section.
          * travis tested Python 3.6 and 3.7 (not 3.5 anymore)
1.1       * Fixed deprecated warning
          * fitter is now in readthedocs at fitter.readthedocs.io
1.0.9     * https://github.com/cokelaer/fitter/pull/8 and 11
            PR https://github.com/cokelaer/fitter/pull/8
1.0.6     * summary() now returns the dataframe (instead of printing it)
1.0.5      https://github.com/cokelaer/fitter/issues
1.0.2     add manifest to fix missing source in the pypi repository.
========= ==========================================================================




Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].