All Categories → Data Processing → data-wrangling

Top 43 data-wrangling open source projects

Datatest
Tools for test driven data-wrangling and data validation.
R Ecology Lesson
Data Analysis and Visualization in R for Ecologists
Qsacnpj
Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)
Sjmisc
Data transformation and utility functions for R
Data Forge Js
JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
Uc R.github.io
Main repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Cracking The Data Science Interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Moderndive book
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Prose
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Sqawk
Like Awk but with SQL and table joins
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
mimir
Data-ish exploration through SQL+Uncertainty
advanced-data-wrangling-in-R-legacy
Advanced-data-wrangling-in-R, Workshop
Chapter-2
Code examples for Chapter 2 of Data Wrangling with JavaScript
xplore
A python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.
Data-Science-101
Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
whyqd
data wrangling simplicity, complete audit transparency, and at speed
pyrefine
Execute OpenRefine JSON scripts without OpenRefine (or Java)
1-43 of 43 data-wrangling projects