SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+9642.11%)
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (+631.58%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+43736.84%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-15.79%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+4989.47%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+747.37%)
Pandas SummaryAn extension to pandas dataframes describe function.
Stars: ✭ 361 (+1800%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+373.68%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+22989.47%)
SiubaPython library for using dplyr like syntax with pandas and SQL
Stars: ✭ 605 (+3084.21%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (+1878.95%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+215.79%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+41810.53%)
100 Pandas Puzzles100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Stars: ✭ 1,382 (+7173.68%)
PbpythonCode, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+8973.68%)
Pyda 2e Zh📖 [译] 利用 Python 进行数据分析 · 第 2 版
Stars: ✭ 866 (+4457.89%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (+500%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (+557.89%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-31.58%)
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (+11389.47%)
DtaleVisualizer for pandas data structures
Stars: ✭ 2,864 (+14973.68%)
kobe-every-shot-everA Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+247.37%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+1494.74%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+1336.84%)
Pydata Notebook利用Python进行数据分析 第二版 (2017) 中文翻译笔记
Stars: ✭ 4,300 (+22531.58%)
PandastableTable analysis in Tkinter using pandas DataFrames.
Stars: ✭ 376 (+1878.95%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+4257.89%)
fairlensIdentify bias and measure fairness of your data
Stars: ✭ 51 (+168.42%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+1136.84%)
PandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Stars: ✭ 32,029 (+168473.68%)
Pydata Pandas WorkshopMaterial for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
Stars: ✭ 65 (+242.11%)
KodiakEnhance your feature engineering workflow with Kodiak
Stars: ✭ 20 (+5.26%)
Pandas VideosJupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+8931.58%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+7878.95%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+184.21%)
Awkward 1.0Manipulate JSON-like data with NumPy-like idioms.
Stars: ✭ 203 (+968.42%)
MissingnoMissing data visualization module for Python.
Stars: ✭ 3,019 (+15789.47%)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-31.58%)
ZebrasData analysis library for JavaScript built with Ramda
Stars: ✭ 192 (+910.53%)
Data Analysis主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (+647.37%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+1121.05%)
Edavizedaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
Stars: ✭ 220 (+1057.89%)
skimpyskimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Stars: ✭ 236 (+1142.11%)
LuxPython API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+4042.11%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (+473.68%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (+1410.53%)
XdaR package for exploratory data analysis
Stars: ✭ 112 (+489.47%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (+1015.79%)
GreyNSightsPrivacy-Preserving Data Analysis using Pandas
Stars: ✭ 18 (-5.26%)
validadaAnother library for defensive data analysis.
Stars: ✭ 29 (+52.63%)
Dtale DesktopBuild a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (+573.68%)