Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+1203.44%)
100 Days Of Ml CodeA day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (-73.08%)
Data Describedata⎰describe: Pythonic EDA Accelerator for Data Science
Stars: ✭ 269 (-57.9%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+189.67%)
leilaLibrería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Stars: ✭ 56 (-91.24%)
Hn so analysisIs there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality
Stars: ✭ 94 (-85.29%)
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-92.02%)
Inspectdf🛠️ 📊 Tools for Exploring and Comparing Data Frames
Stars: ✭ 195 (-69.48%)
skimpyskimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Stars: ✭ 236 (-63.07%)
olliePyOlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
Stars: ✭ 46 (-92.8%)
LuxPython API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+23.16%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-82.94%)
Data Science Your WayWays of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (-17.06%)
Kaggle CompetitionsThere are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-86.54%)
Autoeda ResourcesA list of software and papers related to automatic and fast Exploratory Data Analysis
Stars: ✭ 268 (-58.06%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+169.48%)
XdaR package for exploratory data analysis
Stars: ✭ 112 (-82.47%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-55.09%)
DataexplorerAutomate Data Exploration and Treatment
Stars: ✭ 362 (-43.35%)
Pygam[HELP REQUESTED] Generalized Additive Models in Python
Stars: ✭ 569 (-10.95%)
Dist KerasDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (-4.07%)
AlphapyAutomated Machine Learning [AutoML] with Python, scikit-learn, Keras, XGBoost, LightGBM, and CatBoost
Stars: ✭ 564 (-11.74%)
BaikalA graph-based functional API for building complex scikit-learn pipelines.
Stars: ✭ 573 (-10.33%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-4.07%)
Boltons🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Stars: ✭ 5,671 (+787.48%)
Sigma coding youtubeThis is a collection of all the code that can be found on my YouTube channel Sigma Coding.
Stars: ✭ 611 (-4.38%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+730.2%)
Data Science PortfolioPortfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (-12.52%)
HttplogLog outgoing HTTP requests in ruby
Stars: ✭ 633 (-0.94%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-1.72%)
Book sampleanother book on data science
Stars: ✭ 611 (-4.38%)
NipypeWorkflows and interfaces for neuroimaging packages
Stars: ✭ 557 (-12.83%)
Baby Names AnalysisData ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'.
Stars: ✭ 557 (-12.83%)
MoviegeekA django website used in the book Practical Recommender Systems to illustrate how recommender algorithms can be implemented.
Stars: ✭ 608 (-4.85%)
LazydataLazydata: Scalable data dependencies for Python projects
Stars: ✭ 627 (-1.88%)
FusesocPackage manager and build abstraction tool for FPGA/ASIC development
Stars: ✭ 607 (-5.01%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-15.34%)
Intro To PythonAn intro to Python & programming for wanna-be data scientists
Stars: ✭ 536 (-16.12%)
SmileStatistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+746.95%)
Feature SelectionFeatures selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (-16.43%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-0.94%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-2.66%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-7.2%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (-7.67%)
Interpretable machine learning with pythonExamples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (-17.06%)
Lets PlotAn open-source plotting library for statistical data.
Stars: ✭ 531 (-16.9%)
Matrixprofile TsA Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
Stars: ✭ 621 (-2.82%)
Mongo SparkThe MongoDB Spark Connector
Stars: ✭ 588 (-7.98%)