All Projects → Dataprep → Similar Projects or Alternatives

1122 Open source projects that are alternatives of or similar to Dataprep

Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+808.92%)
Complete Life Cycle Of A Data Science Project
Complete-Life-Cycle-of-a-Data-Science-Project
Stars: ✭ 140 (-78.09%)
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+1203.44%)
100 Days Of Ml Code
A day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (-73.08%)
Data Describe
data⎰describe: Pythonic EDA Accelerator for Data Science
Stars: ✭ 269 (-57.9%)
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+189.67%)
Exploratory Data Analysis Visualization Python
Data analysis and visualization with PyData ecosystem: Pandas, Matplotlib Numpy, and Seaborn
Stars: ✭ 78 (-87.79%)
Mutual labels:  exploratory-data-analysis, eda
leila
Librería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Stars: ✭ 56 (-91.24%)
Mutual labels:  exploratory-data-analysis, eda
Hn so analysis
Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality
Stars: ✭ 94 (-85.29%)
Mutual labels:  exploratory-data-analysis, eda
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-92.02%)
Mutual labels:  exploratory-data-analysis, eda
Inspectdf
🛠️ 📊 Tools for Exploring and Comparing Data Frames
Stars: ✭ 195 (-69.48%)
Mutual labels:  exploratory-data-analysis, eda
skimpy
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Stars: ✭ 236 (-63.07%)
Mutual labels:  exploratory-data-analysis, eda
olliePy
OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
Stars: ✭ 46 (-92.8%)
Mutual labels:  exploratory-data-analysis, eda
Lux
Python API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+23.16%)
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+83.88%)
Mutual labels:  data-science, eda
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-82.94%)
Data Science Your Way
Ways of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (-17.06%)
Kaggle Competitions
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-86.54%)
Autoeda Resources
A list of software and papers related to automatic and fast Exploratory Data Analysis
Stars: ✭ 268 (-58.06%)
Mutual labels:  exploratory-data-analysis, eda
Scattertext
Beautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+169.48%)
Mutual labels:  exploratory-data-analysis, eda
Ditching Excel For Python
Functionalities in Excel translated to Python
Stars: ✭ 172 (-73.08%)
Mutual labels:  exploratory-data-analysis, eda
Xda
R package for exploratory data analysis
Stars: ✭ 112 (-82.47%)
Code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-55.09%)
Dataexplorer
Automate Data Exploration and Treatment
Stars: ✭ 362 (-43.35%)
Mutual labels:  data-science, eda
Pygam
[HELP REQUESTED] Generalized Additive Models in Python
Stars: ✭ 569 (-10.95%)
Mutual labels:  data-science
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (-4.07%)
Mutual labels:  data-science
Alphapy
Automated Machine Learning [AutoML] with Python, scikit-learn, Keras, XGBoost, LightGBM, and CatBoost
Stars: ✭ 564 (-11.74%)
Mutual labels:  data-science
Baikal
A graph-based functional API for building complex scikit-learn pipelines.
Stars: ✭ 573 (-10.33%)
Mutual labels:  data-science
Elki
ELKI Data Mining Toolkit
Stars: ✭ 613 (-4.07%)
Mutual labels:  data-science
Datasets For Recommender Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Stars: ✭ 564 (-11.74%)
Mutual labels:  data-science
Boltons
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Stars: ✭ 5,671 (+787.48%)
Mutual labels:  data-science
Data Analysis And Machine Learning Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Stars: ✭ 5,166 (+708.45%)
Mutual labels:  data-science
Sigma coding youtube
This is a collection of all the code that can be found on my YouTube channel Sigma Coding.
Stars: ✭ 611 (-4.38%)
Mutual labels:  data-science
Pachyderm
Reproducible Data Science at Scale!
Stars: ✭ 5,305 (+730.2%)
Mutual labels:  data-science
Data Science Portfolio
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (-12.52%)
Mutual labels:  data-science
Httplog
Log outgoing HTTP requests in ruby
Stars: ✭ 633 (-0.94%)
Mutual labels:  apis
Dataproofer
A proofreader for your data
Stars: ✭ 628 (-1.72%)
Mutual labels:  data-science
Book sample
another book on data science
Stars: ✭ 611 (-4.38%)
Mutual labels:  data-science
Nipype
Workflows and interfaces for neuroimaging packages
Stars: ✭ 557 (-12.83%)
Mutual labels:  data-science
Baby Names Analysis
Data ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'.
Stars: ✭ 557 (-12.83%)
Mutual labels:  eda
Moviegeek
A django website used in the book Practical Recommender Systems to illustrate how recommender algorithms can be implemented.
Stars: ✭ 608 (-4.85%)
Mutual labels:  data-science
Data Science With Ruby
Practical Data Science with Ruby based tools.
Stars: ✭ 549 (-14.08%)
Mutual labels:  data-science
Probabilistic Programming And Bayesian Methods For Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Stars: ✭ 23,912 (+3642.1%)
Mutual labels:  data-science
Lazydata
Lazydata: Scalable data dependencies for Python projects
Stars: ✭ 627 (-1.88%)
Mutual labels:  data-science
Fusesoc
Package manager and build abstraction tool for FPGA/ASIC development
Stars: ✭ 607 (-5.01%)
Mutual labels:  eda
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-15.34%)
Mutual labels:  data-science
Intro To Python
An intro to Python & programming for wanna-be data scientists
Stars: ✭ 536 (-16.12%)
Mutual labels:  data-science
Smile
Statistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+746.95%)
Mutual labels:  data-science
Stanford Cs 230 Deep Learning
VIP cheatsheets for Stanford's CS 230 Deep Learning
Stars: ✭ 5,149 (+705.79%)
Mutual labels:  data-science
Feature Selection
Features selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (-16.43%)
Mutual labels:  data-science
Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-0.94%)
Mutual labels:  data-science
Datascience Box
Data Science Course in a Box
Stars: ✭ 629 (-1.56%)
Mutual labels:  data-science
Nfstream
NFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-2.66%)
Mutual labels:  data-science
Datasheets
Read data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-7.2%)
Mutual labels:  data-science
Awesome Twitter Data
A list of Twitter datasets and related resources.
Stars: ✭ 533 (-16.59%)
Mutual labels:  data-science
Pdpipe
Easy pipelines for pandas DataFrames.
Stars: ✭ 590 (-7.67%)
Mutual labels:  data-science
Interpretable machine learning with python
Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (-17.06%)
Mutual labels:  data-science
Lets Plot
An open-source plotting library for statistical data.
Stars: ✭ 531 (-16.9%)
Mutual labels:  data-science
Matrixprofile Ts
A Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
Stars: ✭ 621 (-2.82%)
Mutual labels:  data-science
Mongo Spark
The MongoDB Spark Connector
Stars: ✭ 588 (-7.98%)
Mutual labels:  connector
1-60 of 1122 similar projects