Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.

Stars: ✭ 5,671 (+424.12%)

Mutual labels: data-science, statistics

Looper

A resource list for causality in statistics, data science and physics

Stars: ✭ 23 (-97.87%)

Mutual labels: data-science, statistics

Edward

A probabilistic programming language in TensorFlow. Deep generative models, variational inference.

Stars: ✭ 4,674 (+331.98%)

Mutual labels: data-science, statistics

Pandas Profiling

Create HTML profiling reports from pandas DataFrame objects

Stars: ✭ 8,329 (+669.78%)

Mutual labels: data-science, statistics

Socrat

A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization

Stars: ✭ 26 (-97.6%)

Mutual labels: data-science, statistics

Imbalanced Learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

Stars: ✭ 5,617 (+419.13%)

Mutual labels: data-science, statistics

Ppd599

USC urban data science course series with Python and Jupyter

Stars: ✭ 1,062 (-1.85%)

Mutual labels: data-science, statistics

Probabilistic Programming And Bayesian Methods For Hackers

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

Stars: ✭ 23,912 (+2109.98%)

Mutual labels: data-science, statistics

Cracking The Data Science Interview

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

Stars: ✭ 672 (-37.89%)

Mutual labels: data-science, statistics

Facebook data analyzer

Analyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more

Stars: ✭ 515 (-52.4%)

Mutual labels: data-science, statistics

Awesome Python Data Science

Probably the best curated list of data science software in Python.

Stars: ✭ 812 (-24.95%)

Mutual labels: data-science, statistics

Teaching

Teaching Materials for Dr. Waleed A. Yousef

Stars: ✭ 435 (-59.8%)

Mutual labels: data-science, statistics

Dataframe Go

DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration

Stars: ✭ 487 (-54.99%)

Mutual labels: data-science, statistics

Blogr

Scripts + data to recreate analyses published on http://benjaminlmoore.wordpress.com and http://blm.io

Stars: ✭ 23 (-97.87%)

Mutual labels: data-science, statistics

Pycm

Multi-class confusion matrix library in Python

Stars: ✭ 1,076 (-0.55%)

Mutual labels: data-science, statistics

View All Similar Projects ➔

Measuring users is hard. Lifetimes makes it easy.

Read me first: Latest on the lifetimes project

👋 This project has moved to "maintenance-mode". I won't be adding new features, improvements, or even answering issues on this project (but perhaps the occasional bug fix). Why? I don't use lifetimes anymore, nor do I keep up with the literature around RFM. I would love to see a successor library that elevates RFM even further (and please include covariates!)

Introduction

Lifetimes can be used to analyze your users based on a few assumption:

Users interact with you when they are "alive".
Users under study may "die" after some period of time.

I've quoted "alive" and "die" as these are the most abstract terms: feel free to use your own definition of "alive" and "die" (they are used similarly to "birth" and "death" in survival analysis). Whenever we have individuals repeating occurrences, we can use Lifetimes to help understand user behaviour.

Applications

If this is too abstract, consider these applications:

Predicting how often a visitor will return to your website. (Alive = visiting. Die = decided the website wasn't for them)
Understanding how frequently a patient may return to a hospital. (Alive = visiting. Die = maybe the patient moved to a new city, or became deceased.)
Predicting individuals who have churned from an app using only their usage history. (Alive = logins. Die = removed the app)
Predicting repeat purchases from a customer. (Alive = actively purchasing. Die = became disinterested with your product)
Predicting the lifetime value of your customers

Specific Application: Customer Lifetime Value

As emphasized by P. Fader and B. Hardie, understanding and acting on customer lifetime value (CLV) is the most important part of your business's sales efforts. And (apparently) everyone is doing it wrong (Prof. Fader's Video Lecture). Lifetimes is a Python library to calculate CLV for you.

Installation

pip install lifetimes

Contributing

Please refer to the Contributing Guide before creating any Pull Requests. It will make life easier for everyone.

Documentation and tutorials

Official documentation

Questions? Comments? Requests?

Please create an issue in the lifetimes repository.

Main Articles

Probably, the seminal article of Non-Contractual CLV is Counting Your Customers: Who Are They and What Will They Do Next?, by David C. Schmittlein, Donald G. Morrison and Richard Colombo. Despite it being paid, it is worth the read. The relevant information will eventually end up in this library's documentation though.
The other (more recent) paper is “Counting Your Customers” the Easy Way: An Alternative to the Pareto/NBD Model, by Peter Fader, Bruce Hardie and Ka Lok Lee.

More Information

Roberto Medri did a nice presentation on CLV at Etsy.
Papers, lots of papers.
R implementation is called BTYD (Buy 'Til You Die).
Bruce Hardie's Website, especially his notes, is full of useful and essential explanations, many of which are featured in this library.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 1,082

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (148) 🔗