All Projects → CamDavidsonPilon → Lifetimes

CamDavidsonPilon / Lifetimes

Licence: mit
Lifetime value in Python

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Lifetimes

Learn Julia The Hard Way
Learn Julia the hard way!
Stars: ✭ 679 (-37.25%)
Mutual labels:  data-science, statistics
25daysinmachinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-95.1%)
Mutual labels:  data-science, statistics
Statistical Rethinking With Python And Pymc3
Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath
Stars: ✭ 713 (-34.1%)
Mutual labels:  data-science, statistics
Smile
Statistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+400.18%)
Mutual labels:  data-science, statistics
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (-9.24%)
Mutual labels:  data-science, statistics
Boltons
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Stars: ✭ 5,671 (+424.12%)
Mutual labels:  data-science, statistics
Looper
A resource list for causality in statistics, data science and physics
Stars: ✭ 23 (-97.87%)
Mutual labels:  data-science, statistics
Edward
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Stars: ✭ 4,674 (+331.98%)
Mutual labels:  data-science, statistics
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+669.78%)
Mutual labels:  data-science, statistics
Socrat
A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-97.6%)
Mutual labels:  data-science, statistics
Imbalanced Learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+419.13%)
Mutual labels:  data-science, statistics
Ppd599
USC urban data science course series with Python and Jupyter
Stars: ✭ 1,062 (-1.85%)
Mutual labels:  data-science, statistics
Probabilistic Programming And Bayesian Methods For Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Stars: ✭ 23,912 (+2109.98%)
Mutual labels:  data-science, statistics
Cracking The Data Science Interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Stars: ✭ 672 (-37.89%)
Mutual labels:  data-science, statistics
Facebook data analyzer
Analyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more
Stars: ✭ 515 (-52.4%)
Mutual labels:  data-science, statistics
Awesome Python Data Science
Probably the best curated list of data science software in Python.
Stars: ✭ 812 (-24.95%)
Mutual labels:  data-science, statistics
Teaching
Teaching Materials for Dr. Waleed A. Yousef
Stars: ✭ 435 (-59.8%)
Mutual labels:  data-science, statistics
Dataframe Go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Stars: ✭ 487 (-54.99%)
Mutual labels:  data-science, statistics
Blogr
Scripts + data to recreate analyses published on http://benjaminlmoore.wordpress.com and http://blm.io
Stars: ✭ 23 (-97.87%)
Mutual labels:  data-science, statistics
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (-0.55%)
Mutual labels:  data-science, statistics

Measuring users is hard. Lifetimes makes it easy.

Inactively Maintained PyPI version Documentation Status Build Status Coverage Status

Read me first: Latest on the lifetimes project

👋 This project has moved to "maintenance-mode". I won't be adding new features, improvements, or even answering issues on this project (but perhaps the occasional bug fix). Why? I don't use lifetimes anymore, nor do I keep up with the literature around RFM. I would love to see a successor library that elevates RFM even further (and please include covariates!)

Introduction

Lifetimes can be used to analyze your users based on a few assumption:

  1. Users interact with you when they are "alive".
  2. Users under study may "die" after some period of time.

I've quoted "alive" and "die" as these are the most abstract terms: feel free to use your own definition of "alive" and "die" (they are used similarly to "birth" and "death" in survival analysis). Whenever we have individuals repeating occurrences, we can use Lifetimes to help understand user behaviour.

Applications

If this is too abstract, consider these applications:

  • Predicting how often a visitor will return to your website. (Alive = visiting. Die = decided the website wasn't for them)
  • Understanding how frequently a patient may return to a hospital. (Alive = visiting. Die = maybe the patient moved to a new city, or became deceased.)
  • Predicting individuals who have churned from an app using only their usage history. (Alive = logins. Die = removed the app)
  • Predicting repeat purchases from a customer. (Alive = actively purchasing. Die = became disinterested with your product)
  • Predicting the lifetime value of your customers

Specific Application: Customer Lifetime Value

As emphasized by P. Fader and B. Hardie, understanding and acting on customer lifetime value (CLV) is the most important part of your business's sales efforts. And (apparently) everyone is doing it wrong (Prof. Fader's Video Lecture). Lifetimes is a Python library to calculate CLV for you.

Installation

pip install lifetimes

Contributing

Please refer to the Contributing Guide before creating any Pull Requests. It will make life easier for everyone.

Documentation and tutorials

Official documentation

Questions? Comments? Requests?

Please create an issue in the lifetimes repository.

Main Articles

  1. Probably, the seminal article of Non-Contractual CLV is Counting Your Customers: Who Are They and What Will They Do Next?, by David C. Schmittlein, Donald G. Morrison and Richard Colombo. Despite it being paid, it is worth the read. The relevant information will eventually end up in this library's documentation though.
  2. The other (more recent) paper is “Counting Your Customers” the Easy Way: An Alternative to the Pareto/NBD Model, by Peter Fader, Bruce Hardie and Ka Lok Lee.

More Information

  1. Roberto Medri did a nice presentation on CLV at Etsy.
  2. Papers, lots of papers.
  3. R implementation is called BTYD (Buy 'Til You Die).
  4. Bruce Hardie's Website, especially his notes, is full of useful and essential explanations, many of which are featured in this library.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].