Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (-1.42%)
HypertoolsA Python toolbox for gaining geometric insights into high-dimensional data
Stars: ✭ 1,678 (+1090.07%)
Uc R.github.ioMain repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
Stars: ✭ 76 (-46.1%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+5950.35%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+599.29%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+585.82%)
Moderndive bookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (+273.76%)
ProseMicrosoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Stars: ✭ 470 (+233.33%)
SqawkLike Awk but with SQL and table joins
Stars: ✭ 263 (+86.52%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-61.7%)
mimirData-ish exploration through SQL+Uncertainty
Stars: ✭ 26 (-81.56%)
foofahFoofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (-82.98%)
Chapter-2Code examples for Chapter 2 of Data Wrangling with JavaScript
Stars: ✭ 16 (-88.65%)
xploreA python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.
Stars: ✭ 21 (-85.11%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+14.18%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-86.52%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (-36.17%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-88.65%)
pyrefineExecute OpenRefine JSON scripts without OpenRefine (or Java)
Stars: ✭ 25 (-82.27%)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-90.78%)
qsvCSVs sliced, diced & analyzed.
Stars: ✭ 438 (+210.64%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+858.16%)
DatatestTools for test driven data-wrangling and data validation.
Stars: ✭ 238 (+68.79%)
R Ecology LessonData Analysis and Visualization in R for Ecologists
Stars: ✭ 218 (+54.61%)
QsacnpjPacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)
Stars: ✭ 187 (+32.62%)