Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → retentioneering → Retentioneering Tools

retentioneering / Retentioneering Tools

Licence: other

Retentioneering: product analytics, data-driven customer journey map optimization, marketing analytics, web analytics, transaction analytics, graph visualization, and behavioral segmentation with customer segments in Python. Opensource analytics, predictive analytics over clickstream, sentiment analysis, AB tests, machine learning, and Monte Carlo Markov Chain simulations, extending Pandas, Networkx and sklearn.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

machine-learning library data-visualization pandas segmentation machinelearning business-intelligence predictive-modeling web-analytics

Projects that are alternatives of or similar to Retentioneering Tools

Edaviz

edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab

Stars: ✭ 220 (-24.4%)

Mutual labels: pandas, data-visualization

Datacamp Python Data Science Track

All the slides, accompanying code and exercises all stored in this repo. 🎈

Stars: ✭ 250 (-14.09%)

Mutual labels: pandas, machinelearning

Deepgraph

Analyze Data with Pandas-based Networks. Documentation:

Stars: ✭ 232 (-20.27%)

Mutual labels: pandas, data-visualization

Py Quantmod

Powerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/

Stars: ✭ 155 (-46.74%)

Mutual labels: pandas, data-visualization

Data-Scientist-In-Python

This repository contains notes and projects of Data scientist track from dataquest course work.

Stars: ✭ 23 (-92.1%)

Mutual labels: pandas, machinelearning

Dtale

Visualizer for pandas data structures

Stars: ✭ 2,864 (+884.19%)

Mutual labels: pandas, data-visualization

Code

Compilation of R and Python programming codes on the Data Professor YouTube channel.

Stars: ✭ 287 (-1.37%)

Mutual labels: pandas, machinelearning

Dat8

General Assembly's 2015 Data Science course in Washington, DC

Stars: ✭ 1,516 (+420.96%)

Mutual labels: pandas, data-visualization

fal

do more with dbt. fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

Stars: ✭ 567 (+94.85%)

Mutual labels: pandas, machinelearning

Data-Science-Resources

A guide to getting started with Data Science and ML.

Stars: ✭ 17 (-94.16%)

Mutual labels: pandas, machinelearning

Dtale Desktop

Build a data visualization dashboard with simple snippets of python code

Stars: ✭ 128 (-56.01%)

Mutual labels: pandas, data-visualization

Data Science Hacks

Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.

Stars: ✭ 273 (-6.19%)

Mutual labels: pandas, data-visualization

Data Science For Marketing Analytics

Achieve your marketing goals with the data analytics power of Python

Stars: ✭ 127 (-56.36%)

Mutual labels: pandas, data-visualization

Dexplot

Simple plotting library that wraps Matplotlib and integrated with DataFrames

Stars: ✭ 208 (-28.52%)

Mutual labels: pandas, data-visualization

Pbpython

Code, Notebooks and Examples from Practical Business Python

Stars: ✭ 1,724 (+492.44%)

Mutual labels: pandas, data-visualization

Orange3

🍊 📊 💡 Orange: Interactive data analysis

Stars: ✭ 3,152 (+983.16%)

Mutual labels: pandas, data-visualization

Sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

Stars: ✭ 1,851 (+536.08%)

Mutual labels: pandas, data-visualization

Seaborn Tutorial

This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.

Stars: ✭ 114 (-60.82%)

Mutual labels: pandas, data-visualization

Missingno

Missing data visualization module for Python.

Stars: ✭ 3,019 (+937.46%)

Mutual labels: pandas, data-visualization

prosto

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

Stars: ✭ 54 (-81.44%)

Mutual labels: pandas, business-intelligence

View All Similar Projects ➔

What is Retentioneering?

Retentioneering is a Python framework and library to assist product analysts and marketing analysts as it makes it easier to process and analyze clickstreams, event streams, trajectories, and event logs. You can segment users, clients (agents), build ML pipelines to predict agent category or probability of target event based on historical data.

In a common scenario you can use raw data from Google Analytics BigQuery stream or any other silimal streams in form of events and their timestamps for users, and Retentioneering is all you need to explore the user behavior from that data, it can reveal much more isights than funnel analytics, as it will automatically build the behavioral segments and their patterns, highlighting what events and pattern impact your conversion rates, retention and revenue.

Retentioneering extends Pandas, NetworkX, Scikit-learn for in-depth processing of event sequences data, specifically Retentioneering provides a powerful environment to perform an in-depth analysis of customer journey maps, bringing behavior-driven segmentation of users and machine learning pipelines to product analytics.

Most recent is Retentioneering 2.0.0, this version has major updates from 1.0.x and it is not reverse compatible with previous releases due to major syntax changes. With significant improvements we now provided architecture and the solid ground for farther updates and rapid development of analytical tools. Please update, leave your feedback and stay tuned.

Changelog

This is new major release Retentioneering 2.0. Change log is available here.

Complete documentation is available here.

Installation

Option 1. Run directly from google.colab. Open google.colab and click File-> “new notebook”. In the code cell run following to install Retentioneering (same command will install directly from Jupyter notebook):

!pip3 install retentioneering

Option 2. Install Retentioneering from PyPI:

pip3 install retentioneering

Option 3. Install Retentioneering directly from the source:

git clone https://github.com/retentioneering/retentioneering-tools
cd retentioneering-tools
python3 setup.py install

Quick start

Start using Retentioneering for clickstream analysis

Or directly open this notebook in Google Colab to run with sample data.

Suggested first steps:

import retentioneering

# load sample user behavior data as a pandas dataframe: 
data = retentioneering.datasets.load_simple_shop()

# update config to pass columns names:
retentioneering.config.update({
    'user_col': 'user_id',
    'event_col':'event',
    'event_time_col':'timestamp',
})

Above we imported sample dataset, which is regular pandas dataframe containing raw user behavior data from hypothetical web-site or app in form of sequence of records {'user_id', 'event', 'timestamp'}, and pass those column names to retentioneering.config. Now, let's plot the graph to visualize user behaviour from the dataset (read more about graphs here):

data.rete.plot_graph(norm_type='node',
                     weight_col='user_id',
                     thresh=0.2,
                     targets = {'payment_done':'green',
                                'lost':'red'})

Here we obtain the high-level graph of user activity where edge A --> B weight shows percent of users transitioning to event B from all users reached event A (note, edges with small weighs are thresholded to avoid visual clutter, read more in the documentation)

To automatically find distinct behavioral patterns we can cluster users from the dataset based on their behavior (read more about behavioral clustering here):

data.rete.get_clusters(method='kmeans',
                       n_clusters=8,
                       ngram_range=(1,2),
                       plot_type='cluster_bar',
                       targets=['payment_done','cart']);

Users with similar behavior grouped in the same cluster. Clusters with low conversion rate can represent systematic problem in the product: specific behavior pattern which does not lead to product goals. Obtained user segments can be explored deeper to understand problematic behavior pattern. In the example above for instance, cluster 4 has low conversion rate to purchase but high conversion rate to cart visit.

clus_4 = data.rete.filter_cluster(4)
clus_4.rete.plot_graph(thresh=0.1,
                        weight_col='user_id',
                        targets = {'lost':'red',
                                   'payment_done':'green'})

To explore more features please see the documentation

Step-by-step guides

Contributing

This is community-driven open source project in active development. Any contributions, new ideas, bug reports, bug fixes, documentation improvements are very welcome.

Retentioneering now provides several opensource solutions for data-driven product analytics and web analytics. Please checkout this repository for JS library to track the mutations of the website elements: https://github.com/retentioneering/retentioneering-dom-observer

Apps are better with math!:) Retentioneering is a research laboratory, analytics methodology and opensource tools founded by Maxim Godzi and Anatoly Zaytsev in 2015. Please feel free to contact us at [email protected] if you have any questions regarding this repo.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 291

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (9) 🔗