DatatestTools for test driven data-wrangling and data validation.
QsacnpjPacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)
SjmiscData transformation and utility functions for R
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
HypertoolsA Python toolbox for gaining geometric insights into high-dimensional data
Uc R.github.ioMain repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Moderndive bookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
ProseMicrosoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
SqawkLike Awk but with SQL and table joins
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
mimirData-ish exploration through SQL+Uncertainty
foofahFoofah: programming-by-example data transformation program synthesizer
Chapter-2Code examples for Chapter 2 of Data Wrangling with JavaScript
xploreA python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
whyqddata wrangling simplicity, complete audit transparency, and at speed
pyrefineExecute OpenRefine JSON scripts without OpenRefine (or Java)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
qsvCSVs sliced, diced & analyzed.
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark