All Projects → AlexIoannides → Pymc Example Project

AlexIoannides / Pymc Example Project

Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pymc Example Project

Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+2341.11%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn, numpy
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-61.11%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn, numpy
Data Science Projects With Python
A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn
Stars: ✭ 198 (+120%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn, numpy
Code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (+218.89%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn
Data Science Complete Tutorial
For extensive instructor led learning
Stars: ✭ 1,027 (+1041.11%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn, numpy
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+203.33%)
Mutual labels:  jupyter-notebook, data-science, pandas, numpy
Ds and ml projects
Data Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-37.78%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn
Thesemicolon
This repository contains Ipython notebooks and datasets for the data analytics youtube tutorials on The Semicolon.
Stars: ✭ 345 (+283.33%)
Mutual labels:  jupyter-notebook, data-science, scikit-learn, numpy
Mlcourse.ai
Open Machine Learning Course
Stars: ✭ 7,963 (+8747.78%)
Mutual labels:  data-science, pandas, scikit-learn, numpy
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-77.78%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn
Data Science Portfolio
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (+521.11%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn
Pythondatasciencehandbook
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
Stars: ✭ 31,995 (+35450%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn, numpy
Andrew Ng Notes
This is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (+100%)
Mutual labels:  jupyter-notebook, data-science, pandas, numpy
Cheatsheets.pdf
📚 Various cheatsheets in PDF
Stars: ✭ 159 (+76.67%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn, numpy
Machine Learning Alpine
Alpine Container for Machine Learning
Stars: ✭ 30 (-66.67%)
Mutual labels:  jupyter-notebook, pandas, scikit-learn, numpy
Stats Maths With Python
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (+323.33%)
Mutual labels:  jupyter-notebook, data-science, pandas, numpy
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+1584.44%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn
Data Science For Marketing Analytics
Achieve your marketing goals with the data analytics power of Python
Stars: ✭ 127 (+41.11%)
Mutual labels:  jupyter-notebook, data-science, pandas, numpy
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+24397.78%)
Mutual labels:  data-science, pandas, scikit-learn, numpy
Python for ml
brief introduction to Python for machine learning
Stars: ✭ 29 (-67.78%)
Mutual labels:  jupyter-notebook, data-science, pandas, scikit-learn

Example PyMC3 Project for Bayesian Data Analysis

This notebook contains the code required to conduct a Bayesian data analysis on data collected from a set of multiple-lot online auction events executed in Europen markets, over the course of a year. We will build interpretable models of the average Return-on-Reserve (RoR), using variables that describe various facets of multiple-lot online auction events - e.g. the average number of bidders per-lot, the number of lots offered, etc..

The ultimate aim of this endeavour is for this notebook (and it's surrounding directory structure) to serve as a template workflow for conducting an end-to-end Bayesian data analysis using PyMC3. It includes many helper functions for automating otherwise tedious tasks (e.g. interacting with Theano to score models on test data) and examples of how data pre-processing can be integrated with Scikit-Learn.

The overall approach has been heavily inspired by the book 'Statistical Rethinking - a Bayesian Course with Examples in R and Stan', by Richard McElreath http://xcelab.net/rm/statistical-rethinking/, which is the most significant book on statistics and modelling that I have read during a career spanning more than a decade. Credit also goes to 'Bayesian Data Analysis', by Andrew Gelman & Co. http://www.stat.columbia.edu/~gelman/book/, which I refer to on an on-going basis.

Project Dependencies

We use pipenv for managing project dependencies and Python environments (i.e. virtual environments). All of the direct packages dependencies required to run the code (e.g. NumPy for arrays/tensors and Pandas for DataFrames), as well as all the packages used during development (e.g. IPython and JupyterLab as the chosen development environment), are described in the Pipfile. Their precise downstream dependencies are described in Pipfile.lock.

Installing Pipenv

To get started with Pipenv, first of all download it - assuming that there is a global version of Python available on your system and on the PATH, then this can be achieved by running the following command,

pip3 install pipenv

Pipenv is also available to install from many non-Python package managers. For example, on OS X it can be installed using the Homebrew package manager, with the following terminal command,

brew install pipenv

For more information, including advanced configuration options, see the official pipenv documentation.

Installing this Projects' Dependencies

Make sure that you're in the project's root directory (the same one in which the Pipfile resides), and then run,

pipenv install --dev

This will install all of the direct project dependencies as well as the development dependencies (the latter a consequence of the --dev flag).

Running Python, IPython and JupyterLab from the Project's Virtual Environment

In order to continue development in a Python environment that precisely mimics the one the project was initially developed with, use Pipenv from the command line as follows,

pipenv run python3

The python3 command could just as well be ipython3 or the JupterLab, for example,

pipenv run jupyter lab

This will fire-up JupyterLab where the default Python 3 kernel includes all of the direct and development project dependencies. This is how we advise that the notebooks within this project are used.

Pipenv Shells

Prepending pipenv to every command you want to run within the context of your Pipenv-managed virtual environment, can get very tedious. This can be avoided by entering into a Pipenv-managed shell,

pipenv shell

which is equivalent to 'activating' the virtual environment. Any command will now be executed within the virtual environment. Use exit to leave the shell session.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].