All Projects → foofah → Similar Projects or Alternatives

101 Open source projects that are alternatives of or similar to foofah

optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+5529.17%)
allie
🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Stars: ✭ 93 (+287.5%)
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (+29.17%)
Mutual labels:  data-wrangling, data-cleaning
Data Forge Ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+3929.17%)
Mutual labels:  data-wrangling, data-cleaning
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+400%)
Mutual labels:  data-preparation, data-cleaning
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+125%)
Mutual labels:  data-wrangling, data-preparation
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+4008.33%)
Mutual labels:  data-wrangling, data-cleaning
qsv
CSVs sliced, diced & analyzed.
Stars: ✭ 438 (+1725%)
Mutual labels:  data-wrangling
whyqd
data wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-33.33%)
Mutual labels:  data-wrangling
php-serializer
Serialize PHP variables, including objects, in any format. Support to unserialize it too.
Stars: ✭ 47 (+95.83%)
Mutual labels:  data-transformation
pycsvw
A tool to read CSV files with CSVW metadata and transform them into other formats.
Stars: ✭ 32 (+33.33%)
Mutual labels:  data-transformation
sql-ecology-lesson
Data Management with SQL for Ecologists
Stars: ✭ 37 (+54.17%)
Mutual labels:  data-wrangling
r-novice-inflammation
Programming with R
Stars: ✭ 142 (+491.67%)
Mutual labels:  data-wrangling
datapackage-m
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
Stars: ✭ 26 (+8.33%)
Mutual labels:  data-transformation
OpenRefine-ecology-lesson
Data Cleaning with OpenRefine for Ecologists
Stars: ✭ 20 (-16.67%)
Mutual labels:  data-cleaning
HoloClean-Legacy-deprecated
A Machine Learning System for Data Enrichment.
Stars: ✭ 75 (+212.5%)
Mutual labels:  data-cleaning
reskit
A library for creating and curating reproducible pipelines for scientific and industrial machine learning
Stars: ✭ 27 (+12.5%)
Mutual labels:  data-preparation
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (+83.33%)
Mutual labels:  data-transformation
R-Learning-Journey
Some of the projects i made when starting to learn R for Data Science at the university
Stars: ✭ 19 (-20.83%)
Mutual labels:  data-cleaning
dry-transformer
Data transformation toolkit
Stars: ✭ 59 (+145.83%)
Mutual labels:  data-transformation
pyrefine
Execute OpenRefine JSON scripts without OpenRefine (or Java)
Stars: ✭ 25 (+4.17%)
Mutual labels:  data-wrangling
data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Stars: ✭ 34 (+41.67%)
Mutual labels:  data-transformation
serializer-benchmark
A PHP benchmark application to compare PHP serializer libraries
Stars: ✭ 14 (-41.67%)
Mutual labels:  data-transformation
objectiv-analytics
Powerful product analytics for data teams, with full control over data & models.
Stars: ✭ 399 (+1562.5%)
Mutual labels:  data-cleaning
Data Cleaning 101
Data Cleaning Libraries with Python
Stars: ✭ 243 (+912.5%)
Mutual labels:  data-wrangling
R Ecology Lesson
Data Analysis and Visualization in R for Ecologists
Stars: ✭ 218 (+808.33%)
Mutual labels:  data-wrangling
daany
Daany - .NET DAta ANalYtics .NET library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.
Stars: ✭ 49 (+104.17%)
Mutual labels:  data-transformation
fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Stars: ✭ 123 (+412.5%)
Mutual labels:  data-transformation
Chapter-2
Code examples for Chapter 2 of Data Wrangling with JavaScript
Stars: ✭ 16 (-33.33%)
Mutual labels:  data-wrangling
pandas-workshop
An introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+570.83%)
Mutual labels:  data-wrangling
Data-Analyst-Nanodegree
This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-45.83%)
Mutual labels:  data-wrangling
Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (+629.17%)
Mutual labels:  data-wrangling
bamboolib binder template
bamboolib - template for creating your own binder notebook
Stars: ✭ 19 (-20.83%)
Mutual labels:  data-transformation
Data-Wrangling-with-Python
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+275%)
Mutual labels:  data-wrangling
Cleaner.jl
A toolbox of simple solutions for common data cleaning problems.
Stars: ✭ 21 (-12.5%)
Mutual labels:  data-cleaning
wrangler
Wrangler Transform: A DMD system for transforming Big Data
Stars: ✭ 63 (+162.5%)
Mutual labels:  data-transformation
FIFA-2019-Analysis
This is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (+16.67%)
Mutual labels:  data-cleaning
Semantic-Bus
object flow treatment, data transformation
Stars: ✭ 49 (+104.17%)
Mutual labels:  data-transformation
tutorials
Short programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-41.67%)
Mutual labels:  data-transformation
dynamic.yaml
DEPRECATED: YAML-based data transformations
Stars: ✭ 14 (-41.67%)
Mutual labels:  data-transformation
LDWizard
A generic framework for simplifying the creation of linked data.
Stars: ✭ 17 (-29.17%)
Mutual labels:  data-transformation
richflow
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Stars: ✭ 17 (-29.17%)
Mutual labels:  data-transformation
clojure-dsl-resources
A curated list of Clojure resources for dealing with domain-specific languages.
Stars: ✭ 99 (+312.5%)
Mutual labels:  data-transformation
errorlocate
Find and replace erroneous fields in data using validation rules
Stars: ✭ 19 (-20.83%)
Mutual labels:  data-cleaning
Data Forge Js
JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (+479.17%)
Mutual labels:  data-wrangling
xplore
A python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.
Stars: ✭ 21 (-12.5%)
Mutual labels:  data-wrangling
Datatest
Tools for test driven data-wrangling and data validation.
Stars: ✭ 238 (+891.67%)
Mutual labels:  data-wrangling
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+2629.17%)
Mutual labels:  data-transformation
Qsacnpj
Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)
Stars: ✭ 187 (+679.17%)
Mutual labels:  data-wrangling
advanced-data-wrangling-in-R-legacy
Advanced-data-wrangling-in-R, Workshop
Stars: ✭ 14 (-41.67%)
Mutual labels:  data-wrangling
Sjmisc
Data transformation and utility functions for R
Stars: ✭ 141 (+487.5%)
Mutual labels:  data-wrangling
exemplary-ml-pipeline
Exemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-4.17%)
Mutual labels:  data-cleaning
Data-Science-101
Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-20.83%)
Mutual labels:  data-wrangling
Hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
Stars: ✭ 1,678 (+6891.67%)
Mutual labels:  data-wrangling
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+812.5%)
Mutual labels:  data-transformation
R Novice Gapminder
R for Reproducible Scientific Analysis
Stars: ✭ 127 (+429.17%)
Mutual labels:  data-wrangling
Python Ecology Lesson
Data Analysis and Visualization in Python for Ecologists
Stars: ✭ 116 (+383.33%)
Mutual labels:  data-wrangling
machine-learning-data-pipeline
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (-8.33%)
Mutual labels:  data-preparation
pipe envy
Elixir style pipe operator for Ruby
Stars: ✭ 46 (+91.67%)
Mutual labels:  data-transformation
cq
Clojure Command-line Data Processor for JSON, YAML, EDN, XML and more
Stars: ✭ 111 (+362.5%)
Mutual labels:  data-transformation
1-60 of 101 similar projects