data-describe / Data Describe
Licence: other
data⎰describe: Pythonic EDA Accelerator for Data Science
Stars: ✭ 269
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Data Describe
Complete Life Cycle Of A Data Science Project
Complete-Life-Cycle-of-a-Data-Science-Project
Stars: ✭ 140 (-47.96%)
Mutual labels: analysis, data-science, exploratory-data-analysis, eda
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+588.1%)
Mutual labels: data-science, exploratory-data-analysis, eda
Dapy
Easy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (+94.42%)
Mutual labels: analysis, data-science, pypi
100 Days Of Ml Code
A day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (-36.06%)
Mutual labels: data-science, exploratory-data-analysis, eda
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+2996.28%)
Mutual labels: data-science, exploratory-data-analysis, eda
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+2059.11%)
Mutual labels: data-science, exploratory-data-analysis, eda
Dataprep
DataPrep — The easiest way to prepare data in Python
Stars: ✭ 639 (+137.55%)
Mutual labels: data-science, exploratory-data-analysis, eda
Xda
R package for exploratory data analysis
Stars: ✭ 112 (-58.36%)
Mutual labels: data-science, exploratory-data-analysis
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+336.8%)
Mutual labels: data-science, eda
Multiqc
Aggregate results from bioinformatics analyses across many samples into a single report.
Stars: ✭ 708 (+163.2%)
Mutual labels: analysis, pypi
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-76.58%)
Mutual labels: analysis, data-science
Dream3d
Data Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-72.86%)
Mutual labels: analysis, data-science
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-59.48%)
Mutual labels: data-science, exploratory-data-analysis
Kaggle Competitions
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-68.03%)
Mutual labels: data-science, exploratory-data-analysis
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-81.04%)
Mutual labels: exploratory-data-analysis, eda
Audioowl
Fast and simple music and audio analysis using RNN in Python 🕵️♀️ 🥁
Stars: ✭ 151 (-43.87%)
Mutual labels: analysis, pypi
leila
Librería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Stars: ✭ 56 (-79.18%)
Mutual labels: exploratory-data-analysis, eda
Lux
Python API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+192.57%)
Mutual labels: data-science, exploratory-data-analysis
Exploratory Data Analysis Visualization Python
Data analysis and visualization with PyData ecosystem: Pandas, Matplotlib Numpy, and Seaborn
Stars: ✭ 78 (-71%)
Mutual labels: exploratory-data-analysis, eda
data ⎰ describe
data-describe is a Python toolkit for Exploratory Data Analysis (EDA). It aims to accelerate data exploration and analysis by providing automated and polished analysis widgets.
For more examples of data-describe in action, see the Quick Start Tutorial.
Main Features
data-describe implements the following basic features:
Feature | Description |
---|---|
Data Summary | Curated data summary |
Data Heatmap | Data variation and missingness heatmap |
Correlation Matrix | Correlation heatmaps with categorical support |
Distribution Plots | Generate histograms, violin plots, bar charts |
Scatterplots | Generate scatterplots and evaluate with scatterplot diagnostics |
Cluster Analysis | Automated clustering and plotting |
Feature Ranking | Evaluate feature importance using tree models |
Extended Features
data-describe is always looking to elevate the standard for Exploratory Data Analysis. Here are just a few that are implemented:
- Dimensionality Reduction Methods
- Sensitive Data (PII) Redaction
- Text Pre-processing / Topic Modeling
- Big Data Support
Installation
data-describe can be installed using pip:
pip install data-describe
Getting Started
import data_describe as dd
help(dd)
See the User Guide for more information.
Project Status
data-describe is currently in beta status.
Contributing
data-describe welcomes contributions from the community.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].