All Projects → xplore → Similar Projects or Alternatives

54 Open source projects that are alternatives of or similar to xplore

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

Stars: ✭ 54 (+157.14%)

Mutual labels: data-wrangling, data-preprocessing

pandas-workshop

An introductory workshop on pandas with notebooks and exercises for following along.

Stars: ✭ 161 (+666.67%)

Mutual labels: data-wrangling

Stock-Trading-Using-Machine-Learning

A comprehensive approach for stock trading implemented using Neural Network and Reinforcement Learning separately.

Stars: ✭ 20 (-4.76%)

Mutual labels: data-preprocessing

Data-Science-101

Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.

Stars: ✭ 19 (-9.52%)

Mutual labels: data-wrangling

SMMT

Social Media Mining Toolkit (SMMT) main repository

Stars: ✭ 116 (+452.38%)

Mutual labels: data-preprocessing

Data-Wrangling-with-Python

Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices

Stars: ✭ 90 (+328.57%)

Mutual labels: data-wrangling

whyqd

data wrangling simplicity, complete audit transparency, and at speed

Stars: ✭ 16 (-23.81%)

Mutual labels: data-wrangling

timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

Stars: ✭ 14 (-33.33%)

Mutual labels: data-preprocessing

Udacity-Data-Analyst-Nanodegree

Repository for the projects needed to complete the Data Analyst Nanodegree.

Stars: ✭ 31 (+47.62%)

Mutual labels: data-wrangling

modelscript

REPO MOVED TO https://github.com/repetere/jsonstack-data - Data Science and Machine learning in JavaScript

Stars: ✭ 40 (+90.48%)

Mutual labels: data-preprocessing

pyrefine

Execute OpenRefine JSON scripts without OpenRefine (or Java)

Stars: ✭ 25 (+19.05%)

Mutual labels: data-wrangling

Data-Analyst-Nanodegree

This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.

Stars: ✭ 13 (-38.1%)

Mutual labels: data-wrangling

machine-learning-data-pipeline

Pipeline module for parallel real-time data processing for machine learning models development and production purposes.

Stars: ✭ 22 (+4.76%)

Mutual labels: data-preprocessing

sciblox

sciblox - Easier Data Science and Machine Learning

Stars: ✭ 48 (+128.57%)

Mutual labels: data-preprocessing

sql-novice-survey

Databases and SQL

Stars: ✭ 59 (+180.95%)

Mutual labels: data-wrangling

nuts-ml

Flow-based data pre-processing for deep learning

Stars: ✭ 32 (+52.38%)

Mutual labels: data-preprocessing

sql-ecology-lesson

Data Management with SQL for Ecologists

Stars: ✭ 37 (+76.19%)

Mutual labels: data-wrangling

r-novice-inflammation

Programming with R

Stars: ✭ 142 (+576.19%)

Mutual labels: data-wrangling

qsv

CSVs sliced, diced & analyzed.

Stars: ✭ 438 (+1985.71%)

Mutual labels: data-wrangling

optimus

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Stars: ✭ 1,351 (+6333.33%)

Mutual labels: data-wrangling

Data Cleaning 101

Data Cleaning Libraries with Python

Stars: ✭ 243 (+1057.14%)

Mutual labels: data-wrangling

Datatest

Tools for test driven data-wrangling and data validation.

Stars: ✭ 238 (+1033.33%)

Mutual labels: data-wrangling

R Ecology Lesson

Data Analysis and Visualization in R for Ecologists

Stars: ✭ 218 (+938.1%)

Mutual labels: data-wrangling

Qsacnpj

Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)

Stars: ✭ 187 (+790.48%)

Mutual labels: data-wrangling

Web Database Analytics

Web scrapping and related analytics using Python tools

Stars: ✭ 175 (+733.33%)

Mutual labels: data-wrangling

Sjmisc

Data transformation and utility functions for R

Stars: ✭ 141 (+571.43%)

Mutual labels: data-wrangling

Data Forge Js

JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.

Stars: ✭ 139 (+561.9%)

Mutual labels: data-wrangling

Hypertools

A Python toolbox for gaining geometric insights into high-dimensional data

Stars: ✭ 1,678 (+7890.48%)

Mutual labels: data-wrangling

R Novice Gapminder

R for Reproducible Scientific Analysis

Stars: ✭ 127 (+504.76%)

Mutual labels: data-wrangling

Python Ecology Lesson

Data Analysis and Visualization in Python for Ecologists

Stars: ✭ 116 (+452.38%)

Mutual labels: data-wrangling

Python Novice Gapminder

Plotting and Programming in Python

Stars: ✭ 109 (+419.05%)

Mutual labels: data-wrangling

R Raster Vector Geospatial

Introduction to Geospatial Raster and Vector Data with R

Stars: ✭ 76 (+261.9%)

Mutual labels: data-wrangling

Uc R.github.io

Main repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.

Stars: ✭ 76 (+261.9%)

Mutual labels: data-wrangling

Data Science Best Resources

Carefully curated resource links for data science in one place

Stars: ✭ 1,104 (+5157.14%)

Mutual labels: data-wrangling

Openrefine

OpenRefine is a free, open source power tool for working with messy data and improving it

Stars: ✭ 8,531 (+40523.81%)

Mutual labels: data-wrangling

Optimus

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (+4595.24%)

Mutual labels: data-wrangling

Data Forge Ts

The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.

Stars: ✭ 967 (+4504.76%)

Mutual labels: data-wrangling

Cracking The Data Science Interview

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

Stars: ✭ 672 (+3100%)

Mutual labels: data-wrangling

Moderndive book

Statistical Inference via Data Science: A ModernDive into R and the Tidyverse

Stars: ✭ 527 (+2409.52%)

Mutual labels: data-wrangling

Prose

Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.

Stars: ✭ 470 (+2138.1%)

Mutual labels: data-wrangling

Sqawk

Like Awk but with SQL and table joins

Stars: ✭ 263 (+1152.38%)

Mutual labels: data-wrangling

mimir

Data-ish exploration through SQL+Uncertainty

Stars: ✭ 26 (+23.81%)

Mutual labels: data-wrangling

The-Data-Visualization-Workshop

A New, Interactive Approach to Learning Data Visualization