All Projects → data-describe → Data Describe

data-describe / Data Describe

Licence: other
data⎰describe: Pythonic EDA Accelerator for Data Science

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Data Describe

Complete Life Cycle Of A Data Science Project
Complete-Life-Cycle-of-a-Data-Science-Project
Stars: ✭ 140 (-47.96%)
Mutual labels:  analysis, data-science, exploratory-data-analysis, eda
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+588.1%)
Mutual labels:  data-science, exploratory-data-analysis, eda
Dapy
Easy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (+94.42%)
Mutual labels:  analysis, data-science, pypi
100 Days Of Ml Code
A day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (-36.06%)
Mutual labels:  data-science, exploratory-data-analysis, eda
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+2996.28%)
Mutual labels:  data-science, exploratory-data-analysis, eda
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+2059.11%)
Mutual labels:  data-science, exploratory-data-analysis, eda
Dataprep
DataPrep — The easiest way to prepare data in Python
Stars: ✭ 639 (+137.55%)
Mutual labels:  data-science, exploratory-data-analysis, eda
Xda
R package for exploratory data analysis
Stars: ✭ 112 (-58.36%)
Mutual labels:  data-science, exploratory-data-analysis
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+336.8%)
Mutual labels:  data-science, eda
Finviz
Unofficial API for finviz.com
Stars: ✭ 493 (+83.27%)
Mutual labels:  analysis, pypi
Multiqc
Aggregate results from bioinformatics analyses across many samples into a single report.
Stars: ✭ 708 (+163.2%)
Mutual labels:  analysis, pypi
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-76.58%)
Mutual labels:  analysis, data-science
Dream3d
Data Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-72.86%)
Mutual labels:  analysis, data-science
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-59.48%)
Mutual labels:  data-science, exploratory-data-analysis
Kaggle Competitions
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-68.03%)
Mutual labels:  data-science, exploratory-data-analysis
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-81.04%)
Mutual labels:  exploratory-data-analysis, eda
Audioowl
Fast and simple music and audio analysis using RNN in Python 🕵️‍♀️ 🥁
Stars: ✭ 151 (-43.87%)
Mutual labels:  analysis, pypi
leila
Librería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Stars: ✭ 56 (-79.18%)
Mutual labels:  exploratory-data-analysis, eda
Lux
Python API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+192.57%)
Mutual labels:  data-science, exploratory-data-analysis
Exploratory Data Analysis Visualization Python
Data analysis and visualization with PyData ecosystem: Pandas, Matplotlib Numpy, and Seaborn
Stars: ✭ 78 (-71%)
Mutual labels:  exploratory-data-analysis, eda

PyPI status PyPI license Downloads

PyPI version shields.io PyPI pyversions codecov

data ⎰ describe

data-describe is a Python toolkit for Exploratory Data Analysis (EDA). It aims to accelerate data exploration and analysis by providing automated and polished analysis widgets.

For more examples of data-describe in action, see the Quick Start Tutorial.

Main Features

data-describe implements the following basic features:

Feature Description
Data Summary Curated data summary
Data Heatmap Data variation and missingness heatmap
Correlation Matrix Correlation heatmaps with categorical support
Distribution Plots Generate histograms, violin plots, bar charts
Scatterplots Generate scatterplots and evaluate with scatterplot diagnostics
Cluster Analysis Automated clustering and plotting
Feature Ranking Evaluate feature importance using tree models

Extended Features

data-describe is always looking to elevate the standard for Exploratory Data Analysis. Here are just a few that are implemented:

  • Dimensionality Reduction Methods
  • Sensitive Data (PII) Redaction
  • Text Pre-processing / Topic Modeling
  • Big Data Support

Installation

data-describe can be installed using pip:

pip install data-describe

Getting Started

import data_describe as dd
help(dd)

See the User Guide for more information.

Project Status

data-describe is currently in beta status.

Contributing

data-describe welcomes contributions from the community.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].