All Projects → foofah → Similar Projects or Alternatives

101 Open source projects that are alternatives of or similar to foofah

optimus

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Stars: ✭ 1,351 (+5529.17%)

Mutual labels: data-transformation, data-wrangling, data-preparation, data-cleaning

allie

🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).

Stars: ✭ 93 (+287.5%)

Mutual labels: data-transformation, data-cleaning

Udacity-Data-Analyst-Nanodegree

Repository for the projects needed to complete the Data Analyst Nanodegree.

Stars: ✭ 31 (+29.17%)

Mutual labels: data-wrangling, data-cleaning

Data Forge Ts

The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.

Stars: ✭ 967 (+3929.17%)

Mutual labels: data-wrangling, data-cleaning

bumblebee

🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)

Stars: ✭ 120 (+400%)

Mutual labels: data-preparation, data-cleaning

prosto

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

Stars: ✭ 54 (+125%)

Mutual labels: data-wrangling, data-preparation

Optimus

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (+4008.33%)

Mutual labels: data-wrangling, data-cleaning

qsv

CSVs sliced, diced & analyzed.

Stars: ✭ 438 (+1725%)

Mutual labels: data-wrangling

whyqd

data wrangling simplicity, complete audit transparency, and at speed

Stars: ✭ 16 (-33.33%)

Mutual labels: data-wrangling

php-serializer

Serialize PHP variables, including objects, in any format. Support to unserialize it too.

Stars: ✭ 47 (+95.83%)

Mutual labels: data-transformation

pycsvw

A tool to read CSV files with CSVW metadata and transform them into other formats.

Stars: ✭ 32 (+33.33%)

Mutual labels: data-transformation

sql-ecology-lesson

Data Management with SQL for Ecologists

Stars: ✭ 37 (+54.17%)

Mutual labels: data-wrangling

r-novice-inflammation

Programming with R

Stars: ✭ 142 (+491.67%)

Mutual labels: data-wrangling

datapackage-m

Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel

Stars: ✭ 26 (+8.33%)

Mutual labels: data-transformation

OpenRefine-ecology-lesson

Data Cleaning with OpenRefine for Ecologists

Stars: ✭ 20 (-16.67%)

Mutual labels: data-cleaning

HoloClean-Legacy-deprecated

A Machine Learning System for Data Enrichment.

Stars: ✭ 75 (+212.5%)

Mutual labels: data-cleaning

reskit

A library for creating and curating reproducible pipelines for scientific and industrial machine learning

Stars: ✭ 27 (+12.5%)

Mutual labels: data-preparation

gallia-core

A schema-aware Scala library for data transformation

Stars: ✭ 44 (+83.33%)

Mutual labels: data-transformation

R-Learning-Journey

Some of the projects i made when starting to learn R for Data Science at the university

Stars: ✭ 19 (-20.83%)

Mutual labels: data-cleaning

dry-transformer

Data transformation toolkit

Stars: ✭ 59 (+145.83%)

Mutual labels: data-transformation

pyrefine

Execute OpenRefine JSON scripts without OpenRefine (or Java)

Stars: ✭ 25 (+4.17%)

Mutual labels: data-wrangling

data-algorithms-with-spark

O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian

Stars: ✭ 34 (+41.67%)

Mutual labels: data-transformation

serializer-benchmark

A PHP benchmark application to compare PHP serializer libraries

Stars: ✭ 14 (-41.67%)

Mutual labels: data-transformation

objectiv-analytics

Powerful product analytics for data teams, with full control over data & models.

Stars: ✭ 399 (+1562.5%)

Mutual labels: data-cleaning

Data Cleaning 101

Data Cleaning Libraries with Python

Stars: ✭ 243 (+912.5%)

Mutual labels: data-wrangling

R Ecology Lesson

Data Analysis and Visualization in R for Ecologists

Stars: ✭ 218 (+808.33%)

Mutual labels: data-wrangling

daany

Daany - .NET DAta ANalYtics .NET library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.

Stars: ✭ 49 (+104.17%)

Mutual labels: data-transformation

fastverse

An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R

Stars: ✭ 123 (+412.5%)

Mutual labels: data-transformation

Chapter-2

Code examples for Chapter 2 of Data Wrangling with JavaScript

Stars: ✭ 16 (-33.33%)

Mutual labels: data-wrangling

pandas-workshop

An introductory workshop on pandas with notebooks and exercises for following along.

Stars: ✭ 161 (+570.83%)

Mutual labels: data-wrangling

Data-Analyst-Nanodegree

This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.

Stars: ✭ 13 (-45.83%)

Mutual labels: data-wrangling

Web Database Analytics

Web scrapping and related analytics using Python tools

Stars: ✭ 175 (+629.17%)

Mutual labels: data-wrangling

bamboolib binder template

bamboolib - template for creating your own binder notebook

Stars: ✭ 19 (-20.83%)

Mutual labels: data-transformation

Data-Wrangling-with-Python

Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices

Stars: ✭ 90 (+275%)

Mutual labels: data-wrangling

Cleaner.jl

A toolbox of simple solutions for common data cleaning problems.

Stars: ✭ 21 (-12.5%)

Mutual labels: data-cleaning

wrangler

Wrangler Transform: A DMD system for transforming Big Data

Stars: ✭ 63 (+162.5%)

Mutual labels: data-transformation

FIFA-2019-Analysis

This is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations

Stars: ✭ 28 (+16.67%)

Mutual labels: data-cleaning

Semantic-Bus

object flow treatment, data transformation

Stars: ✭ 49 (+104.17%)

Mutual labels: data-transformation

tutorials

Short programming tutorials pertaining to data analysis.

Stars: ✭ 14 (-41.67%)

Mutual labels: data-transformation

dynamic.yaml

DEPRECATED: YAML-based data transformations

Stars: ✭ 14 (-41.67%)

Mutual labels: data-transformation

LDWizard

A generic framework for simplifying the creation of linked data.

Stars: ✭ 17 (-29.17%)

Mutual labels: data-transformation

richflow

A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.

Stars: ✭ 17 (-29.17%)

Mutual labels: data-transformation

clojure-dsl-resources

A curated list of Clojure resources for dealing with domain-specific languages.

Stars: ✭ 99 (+312.5%)

Mutual labels: data-transformation

errorlocate

Find and replace erroneous fields in data using validation rules

Stars: ✭ 19 (-20.83%)

Mutual labels: data-cleaning

Data Forge Js

JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.

Stars: ✭ 139 (+479.17%)

Mutual labels: data-wrangling

xplore

A python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.

Stars: ✭ 21 (-12.5%)

Mutual labels: data-wrangling

Datatest

Tools for test driven data-wrangling and data validation.

Stars: ✭ 238 (+891.67%)

Mutual labels: data-wrangling

zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Stars: ✭ 655 (+2629.17%)

Mutual labels: data-transformation

Qsacnpj

Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)