Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+15615.09%)
leilaLibrería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Stars: ✭ 56 (+5.66%)
DQLabThis is a repository for storing and sharing data resulting from working on projects and materials in DQLab
Stars: ✭ 39 (-26.42%)
DatavisualizationTutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
Stars: ✭ 234 (+341.51%)
Edarfexploratory data analysis using random forests
Stars: ✭ 62 (+16.98%)
Data Science Your WayWays of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (+900%)
Autoeda ResourcesA list of software and papers related to automatic and fast Exploratory Data Analysis
Stars: ✭ 268 (+405.66%)
100 Days Of Ml CodeA day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (+224.53%)
olliePyOlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
Stars: ✭ 46 (-13.21%)
Kaggle CompetitionsThere are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (+62.26%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-71.7%)
DataprepDataPrep — The easiest way to prepare data in Python
Stars: ✭ 639 (+1105.66%)
soda-sparkSoda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (+9.43%)
VisdatPreliminary Exploratory Visualisation of Data
Stars: ✭ 377 (+611.32%)
Inspectdf🛠️ 📊 Tools for Exploring and Comparing Data Frames
Stars: ✭ 195 (+267.92%)
kushner eb5 censusJared Kushner and his partners used a program meant for job-starved areas to build a luxury skyscraper
Stars: ✭ 49 (-7.55%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (+198.11%)
THE-SPARKS-FOUNDATION📌 This repo. Contains Basic - Advance level Machine learning / business analysis Projects. 👨💻
Stars: ✭ 87 (+64.15%)
MetaOmGraphMetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets
Stars: ✭ 30 (-43.4%)
skimpyskimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Stars: ✭ 236 (+345.28%)
NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (+92.45%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+3149.06%)
adenineADENINE: A Data ExploratioN PipelINE
Stars: ✭ 15 (-71.7%)
Hn so analysisIs there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality
Stars: ✭ 94 (+77.36%)
kanaSingle cell analysis in the browser
Stars: ✭ 81 (+52.83%)
LuxPython API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+1384.91%)
KdepyKernel Density Estimation in Python
Stars: ✭ 244 (+360.38%)
contessaEasy way to define, execute and store quality rules for your data.
Stars: ✭ 17 (-67.92%)
XdaR package for exploratory data analysis
Stars: ✭ 112 (+111.32%)
furnitureThe furniture R package contains table1 for publication-ready simple and stratified descriptive statistics, tableC for publication-ready correlation matrixes, and other tables #rstats
Stars: ✭ 43 (-18.87%)
MusicmoodA machine learning approach to classify songs by mood.
Stars: ✭ 388 (+632.08%)
Lotteryprediction🌝 Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to change" is called The Gambler's Fallacy" existed.
Stars: ✭ 202 (+281.13%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (+441.51%)
hacksawExtra tidyverse-like functionality
Stars: ✭ 33 (-37.74%)
Data Describedata⎰describe: Pythonic EDA Accelerator for Data Science
Stars: ✭ 269 (+407.55%)
MiradorTool for visual exploration of complex data.
Stars: ✭ 186 (+250.94%)
DenseNet-MURA-PyTorchImplementation of DenseNet model on Standford's MURA dataset using PyTorch
Stars: ✭ 59 (+11.32%)
penguin-datalayer-collectA data layer quality monitoring and validation module, this solution is part of the Raft Suite ecosystem.
Stars: ✭ 19 (-64.15%)
Edge2GuardCode for PerCom Workshop paper title 'Edge2Guard: Botnet Attacks Detecting Offline Models for Resource-Constrained IoT Devices'
Stars: ✭ 16 (-69.81%)
data-inspectorData Inspector is an open-source python library that brings 15++ types of different functions to make EDA, data cleaning easier.
Stars: ✭ 38 (-28.3%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (-47.17%)
Fraud-AnalysisInsurance fraud claims analysis project
Stars: ✭ 37 (-30.19%)
learnrExploratory, Inferential and Predictive data analysis. Feel free to show your ❤️ by giving a star ⭐
Stars: ✭ 64 (+20.75%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+3392.45%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-64.15%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (+105.66%)
rl tradingNo description or website provided.
Stars: ✭ 14 (-73.58%)
loonA Toolkit for Interactive Statistical Data Visualization
Stars: ✭ 45 (-15.09%)
hive compared bqhive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Stars: ✭ 27 (-49.06%)