All Projects → statsmodels → Statsmodels

statsmodels / Statsmodels

Licence: other
Statsmodels: statistical modeling and econometrics in Python

Programming Languages

python
139335 projects - #7 most used programming language
AGS Script
88 projects
cython
566 projects
HTML
75241 projects
r
7636 projects
matlab
3953 projects

Projects that are alternatives of or similar to Statsmodels

hdfe
No description or website provided.
Stars: ✭ 22 (-99.68%)
Mutual labels:  statistics, econometrics
Probability Theory
A quick introduction to all most important concepts of Probability Theory, only freshman level of mathematics needed as prerequisite.
Stars: ✭ 25 (-99.64%)
Mutual labels:  statistics, econometrics
yt-channels-DS-AI-ML-CS
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (-85.03%)
Mutual labels:  statistics, data-analysis
Histogram
Fast multi-dimensional generalized histogram with convenient interface for C++14
Stars: ✭ 243 (-96.5%)
Mutual labels:  statistics, data-analysis
Scikit Mobility
scikit-mobility: mobility analysis in Python
Stars: ✭ 339 (-95.11%)
Mutual labels:  statistics, data-analysis
Kdepy
Kernel Density Estimation in Python
Stars: ✭ 244 (-96.48%)
Mutual labels:  statistics, data-analysis
growthbook
Open Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (-66.23%)
Mutual labels:  statistics, data-analysis
Morpheus Core
The foundational library of the Morpheus data science framework
Stars: ✭ 203 (-97.07%)
Mutual labels:  statistics, data-analysis
Xlearn
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (-57.2%)
Mutual labels:  statistics, data-analysis
NNS
Nonlinear Nonparametric Statistics
Stars: ✭ 26 (-99.63%)
Mutual labels:  statistics, econometrics
Datascience
Curated list of Python resources for data science.
Stars: ✭ 3,051 (-56.01%)
Mutual labels:  statistics, data-analysis
Gonum
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
Stars: ✭ 5,384 (-22.36%)
Mutual labels:  statistics, data-analysis
Dabest Python
Data Analysis with Bootstrapped ESTimation
Stars: ✭ 231 (-96.67%)
Mutual labels:  statistics, data-analysis
tempo
API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (-96.94%)
Mutual labels:  data-analysis, timeseries-analysis
Tablesaw
Java dataframe and visualization library
Stars: ✭ 2,785 (-59.84%)
Mutual labels:  statistics, data-analysis
Data-Analyst-Nanodegree
Kai Sheng Teh - Udacity Data Analyst Nanodegree
Stars: ✭ 42 (-99.39%)
Mutual labels:  statistics, data-analysis
Collapse
Advanced and Fast Data Transformation in R
Stars: ✭ 184 (-97.35%)
Mutual labels:  statistics, data-analysis
Data Science Live Book
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-97.22%)
Mutual labels:  statistics, data-analysis
fairlens
Identify bias and measure fairness of your data
Stars: ✭ 51 (-99.26%)
Mutual labels:  statistics, data-analysis
R
Exercises (incl. analyses) with R language (math+statistics)
Stars: ✭ 462 (-93.34%)
Mutual labels:  statistics, data-analysis

PyPI Version Conda Version License Azure CI Build Status Codecov Coverage Coveralls Coverage PyPI - Downloads Conda downloads

About statsmodels

statsmodels is a Python package that provides a complement to scipy for statistical computations including descriptive statistics and estimation and inference for statistical models.

Documentation

The documentation for the latest release is at

https://www.statsmodels.org/stable/

The documentation for the development version is at

https://www.statsmodels.org/dev/

Recent improvements are highlighted in the release notes

https://www.statsmodels.org/stable/release/

Backups of documentation are available at https://statsmodels.github.io/stable/ and https://statsmodels.github.io/dev/.

Main Features

  • Linear regression models:
    • Ordinary least squares
    • Generalized least squares
    • Weighted least squares
    • Least squares with autoregressive errors
    • Quantile regression
    • Recursive least squares
  • Mixed Linear Model with mixed effects and variance components
  • GLM: Generalized linear models with support for all of the one-parameter exponential family distributions
  • Bayesian Mixed GLM for Binomial and Poisson
  • GEE: Generalized Estimating Equations for one-way clustered or longitudinal data
  • Discrete models:
    • Logit and Probit
    • Multinomial logit (MNLogit)
    • Poisson and Generalized Poisson regression
    • Negative Binomial regression
    • Zero-Inflated Count models
  • RLM: Robust linear models with support for several M-estimators.
  • Time Series Analysis: models for time series analysis
    • Complete StateSpace modeling framework
      • Seasonal ARIMA and ARIMAX models
      • VARMA and VARMAX models
      • Dynamic Factor models
      • Unobserved Component models
    • Markov switching models (MSAR), also known as Hidden Markov Models (HMM)
    • Univariate time series analysis: AR, ARIMA
    • Vector autoregressive models, VAR and structural VAR
    • Vector error correction model, VECM
    • exponential smoothing, Holt-Winters
    • Hypothesis tests for time series: unit root, cointegration and others
    • Descriptive statistics and process models for time series analysis
  • Survival analysis:
    • Proportional hazards regression (Cox models)
    • Survivor function estimation (Kaplan-Meier)
    • Cumulative incidence function estimation
  • Multivariate:
    • Principal Component Analysis with missing data
    • Factor Analysis with rotation
    • MANOVA
    • Canonical Correlation
  • Nonparametric statistics: Univariate and multivariate kernel density estimators
  • Datasets: Datasets used for examples and in testing
  • Statistics: a wide range of statistical tests
    • diagnostics and specification tests
    • goodness-of-fit and normality tests
    • functions for multiple testing
    • various additional statistical tests
  • Imputation with MICE, regression on order statistic and Gaussian imputation
  • Mediation analysis
  • Graphics includes plot functions for visual analysis of data and model results
  • I/O
    • Tools for reading Stata .dta files, but pandas has a more recent version
    • Table output to ascii, latex, and html
  • Miscellaneous models
  • Sandbox: statsmodels contains a sandbox folder with code in various stages of development and testing which is not considered "production ready". This covers among others
    • Generalized method of moments (GMM) estimators
    • Kernel regression
    • Various extensions to scipy.stats.distributions
    • Panel data models
    • Information theoretic measures

How to get it

The main branch on GitHub is the most up to date code

https://www.github.com/statsmodels/statsmodels

Source download of release tags are available on GitHub

https://github.com/statsmodels/statsmodels/tags

Binaries and source distributions are available from PyPi

https://pypi.org/project/statsmodels/

Binaries can be installed in Anaconda

conda install statsmodels

Installing from sources

See INSTALL.txt for requirements or see the documentation

https://statsmodels.github.io/dev/install.html

Contributing

Contributions in any form are welcome, including:

  • Documentation improvements
  • Additional tests
  • New features to existing models
  • New models

https://www.statsmodels.org/stable/dev/test_notes

for instructions on installing statsmodels in editable mode.

License

Modified BSD (3-clause)

Discussion and Development

Discussions take place on the mailing list

https://groups.google.com/group/pystatsmodels

and in the issue tracker. We are very interested in feedback about usability and suggestions for improvements.

Bug Reports

Bug reports can be submitted to the issue tracker at

https://github.com/statsmodels/statsmodels/issues

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].