SjmiscData transformation and utility functions for R
Stars: ✭ 141 (-24.6%)
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (-25.67%)
HypertoolsA Python toolbox for gaining geometric insights into high-dimensional data
Stars: ✭ 1,678 (+797.33%)
Uc R.github.ioMain repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
Stars: ✭ 76 (-59.36%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+4462.03%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+427.27%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+417.11%)
Moderndive bookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (+181.82%)
ProseMicrosoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Stars: ✭ 470 (+151.34%)
SqawkLike Awk but with SQL and table joins
Stars: ✭ 263 (+40.64%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-71.12%)
mimirData-ish exploration through SQL+Uncertainty
Stars: ✭ 26 (-86.1%)
foofahFoofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (-87.17%)
Chapter-2Code examples for Chapter 2 of Data Wrangling with JavaScript
Stars: ✭ 16 (-91.44%)
xploreA python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.
Stars: ✭ 21 (-88.77%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (-13.9%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-89.84%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (-51.87%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-91.44%)
pyrefineExecute OpenRefine JSON scripts without OpenRefine (or Java)
Stars: ✭ 25 (-86.63%)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-93.05%)
qsvCSVs sliced, diced & analyzed.
Stars: ✭ 438 (+134.22%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+622.46%)
DatatestTools for test driven data-wrangling and data validation.
Stars: ✭ 238 (+27.27%)
R Ecology LessonData Analysis and Visualization in R for Ecologists
Stars: ✭ 218 (+16.58%)