HypertoolsA Python toolbox for gaining geometric insights into high-dimensional data
Stars: ✭ 1,678 (+6612%)
Mutual labels: data-wrangling
SjmiscData transformation and utility functions for R
Stars: ✭ 141 (+464%)
Mutual labels: data-wrangling
DatatestTools for test driven data-wrangling and data validation.
Stars: ✭ 238 (+852%)
Mutual labels: data-wrangling
openrefine-clientThe OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the command line interface (CLI) and is distributed as a convenient one-file-executable (Windows, Linux, Mac). It is also available via Docker Hub, PyPI and Binder.
Stars: ✭ 67 (+168%)
Mutual labels: openrefine
Python Ecology LessonData Analysis and Visualization in Python for Ecologists
Stars: ✭ 116 (+364%)
Mutual labels: data-wrangling
Data Cleaning 101Data Cleaning Libraries with Python
Stars: ✭ 243 (+872%)
Mutual labels: data-wrangling
qsvCSVs sliced, diced & analyzed.
Stars: ✭ 438 (+1652%)
Mutual labels: data-wrangling
Web Database AnalyticsWeb scrapping and related analytics using Python tools
Stars: ✭ 175 (+600%)
Mutual labels: data-wrangling
R Ecology LessonData Analysis and Visualization in R for Ecologists
Stars: ✭ 218 (+772%)
Mutual labels: data-wrangling
openrefine-dockerOpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.
Stars: ✭ 19 (-24%)
Mutual labels: openrefine
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (+456%)
Mutual labels: data-wrangling
sql-ecology-lessonData Management with SQL for Ecologists
Stars: ✭ 37 (+48%)
Mutual labels: data-wrangling
R Novice GapminderR for Reproducible Scientific Analysis
Stars: ✭ 127 (+408%)
Mutual labels: data-wrangling
conciliatorOpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Stars: ✭ 95 (+280%)
Mutual labels: openrefine
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-48%)
Mutual labels: data-wrangling
openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+204%)
Mutual labels: openrefine
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+5304%)
Mutual labels: data-wrangling