All Projects → amphibian-dev → Toad

amphibian-dev / Toad

Licence: mit
ESC Team's scorecard tools

Programming Languages

python
139335 projects - #7 most used programming language
python3
1442 projects

Projects that are alternatives of or similar to Toad

Python practice of data analysis and mining
《Python数据分析与挖掘实战》随书源码与数据
Stars: ✭ 172 (-16.91%)
Mutual labels:  data-analysis
Gradio
Create UIs for your machine learning model in Python in 3 minutes
Stars: ✭ 4,358 (+2005.31%)
Mutual labels:  data-analysis
Tad
A desktop application for viewing and analyzing tabular data
Stars: ✭ 2,275 (+999.03%)
Mutual labels:  data-analysis
Ida
Introduction to Data Analysis, using R (2013)
Stars: ✭ 180 (-13.04%)
Mutual labels:  data-analysis
Redata
Monitoring system for data teams. Computing health checks on data, visualizing and alerting on them.
Stars: ✭ 181 (-12.56%)
Mutual labels:  data-analysis
Zebras
Data analysis library for JavaScript built with Ramda
Stars: ✭ 192 (-7.25%)
Mutual labels:  data-analysis
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-17.39%)
Mutual labels:  data-analysis
Awkward 1.0
Manipulate JSON-like data with NumPy-like idioms.
Stars: ✭ 203 (-1.93%)
Mutual labels:  data-analysis
Volbx
Graphical tool for data manipulation written in C++/Qt
Stars: ✭ 187 (-9.66%)
Mutual labels:  data-analysis
Data Science Notebook
📖 每一个伟大的思想和行动都有一个微不足道的开始
Stars: ✭ 196 (-5.31%)
Mutual labels:  data-analysis
Collapse
Advanced and Fast Data Transformation in R
Stars: ✭ 184 (-11.11%)
Mutual labels:  data-analysis
Goaccess
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Stars: ✭ 14,096 (+6709.66%)
Mutual labels:  data-analysis
Data Science Live Book
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-6.76%)
Mutual labels:  data-analysis
Matplotlib Doc Zh
📖 [译] Matplotlib 用户指南
Stars: ✭ 178 (-14.01%)
Mutual labels:  data-analysis
Discovery
Frontend framework for rapid data (JSON) analysis, sharable serverless reports and dashboards
Stars: ✭ 199 (-3.86%)
Mutual labels:  data-analysis
Eegrunt
A Collection Python EEG (+ ECG) Analysis Utilities for OpenBCI and Muse
Stars: ✭ 171 (-17.39%)
Mutual labels:  data-analysis
Klib
Easy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (-7.25%)
Mutual labels:  data-analysis
Morpheus Core
The foundational library of the Morpheus data science framework
Stars: ✭ 203 (-1.93%)
Mutual labels:  data-analysis
Python Novice Inflammation
Programming with Python
Stars: ✭ 199 (-3.86%)
Mutual labels:  data-analysis
Fuck illness
写疾病数据分析用到的所有东西
Stars: ✭ 195 (-5.8%)
Mutual labels:  data-analysis

TOAD

PyPi version Python version Build Status Downloads Status

Toad is dedicated to facilitating model development process, especially for a scorecard. It provides intuitive functions of the entire process, from EDA, feature engineering and selection etc. to results validation and scorecard transformation. Its key functionality streamlines the most critical and time-consuming process such as feature selection and fine binning.

Toad 是专为工业界模型开发设计的Python工具包,特别针对评分卡的开发。Toad 的功能覆盖了建模全流程,从 EDA、特征工程、特征筛选 到 模型验证和评分卡转化。Toad 的主要功能极大简化了建模中最重要最费时的流程,即特征筛选和分箱。

Install and Upgrade · 安装与升级

Pip

pip install toad # to install
pip install -U toad # to upgrade

Conda

conda install toad --channel conda-forge # to install
conda install -U toad --channel conda-forge # to upgrade

Source code

python setup.py install

Key features · 主要功能

The following showcases some of the most popular features of toad, for more detailed demonstrations and user guidance, please refer to the tutorials.

以下部分简单介绍了toad最受欢迎的一些功能,具体的使用方法和使用教程,请详见文档部分。

  • Simple IV calculation for all features · 一键算IV:
toad.quality(data,'target',iv_only=True)
  • Preliminary selection based on criteria · 根据特定条件的初步变量筛选;
  • and stepwise feature selection (with optimised algorithm) · 优化过的逐步回归:
selected_data = toad.selection.select(data,target = 'target', empty = 0.5, iv = 0.02, corr = 0.7, return_drop=True, exclude=['ID','month'])

final_data = toad.selection.stepwise(data_woe,target = 'target', estimator='ols', direction = 'both', criterion = 'aic', exclude = to_drop)
  • Reliable fine binning with visualisation · 分箱及可视化:
# Chi-squared fine binning
c = toad.transform.Combiner()
c.fit(data_selected.drop(to_drop, axis=1), y = 'target', method = 'chi', min_samples = 0.05) 
print(c.export())

# Visualisation to check binning results 
col = 'feature_name'
bin_plot(c.transform(data_selected[[col,'target']], labels=True), x=col, target='target')
  • Intuitive model results presentation · 模型结果展示:
toad.metrics.KS_bucket(pred_proba, final_data['target'], bucket=10, method = 'quantile')
  • One-click scorecard transformation · 评分卡转化:
card = toad.ScoreCard(
    combiner = c,
    transer = transer,
    class_weight = 'balanced',
    C=0.1,
    base_score = 600,
    base_odds = 35 ,
    pdo = 60,
    rate = 2
)

card.fit(final_data[col], final_data['target'])
print(card.export())

Documents · 文档

Community · 社区

We welcome public feedback and new PRs. We hold a WeChat group for questions and suggestions.

欢迎各位提PR,同时我们有toad使用交流的微信群,欢迎询问加群。


Dedicated by The ESC Team

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].