All Projects → HoloClean-Legacy-deprecated → Similar Projects or Alternatives

50 Open source projects that are alternatives of or similar to HoloClean-Legacy-deprecated

optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+1701.33%)
Mutual labels:  data-cleaning
R-Learning-Journey
Some of the projects i made when starting to learn R for Data Science at the university
Stars: ✭ 19 (-74.67%)
Mutual labels:  data-cleaning
watson-discovery-food-reviews
Combine Watson Knowledge Studio and Watson Discovery to discover customer sentiment from product reviews
Stars: ✭ 36 (-52%)
Mutual labels:  data-enrichment
Miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Stars: ✭ 4,633 (+6077.33%)
Mutual labels:  data-cleaning
Voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Stars: ✭ 236 (+214.67%)
Mutual labels:  data-cleaning
Klib
Easy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (+156%)
Mutual labels:  data-cleaning
Machine Learning Workflow With Python
This is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (+109.33%)
Mutual labels:  data-cleaning
Cleanlab
The standard package for machine learning with noisy labels, finding mislabeled data, and uncertainty quantification. Works with most datasets and models.
Stars: ✭ 2,526 (+3268%)
Mutual labels:  data-cleaning
Datamaid
An R package for data screening
Stars: ✭ 120 (+60%)
Mutual labels:  data-cleaning
Pandas Videos
Jupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+2188%)
Mutual labels:  data-cleaning
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+1921.33%)
Mutual labels:  data-cleaning
Refinr
Cluster and merge similar char values: an R implementation of Open Refine clustering algorithms
Stars: ✭ 91 (+21.33%)
Mutual labels:  data-cleaning
Bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (pandas, dask, cuDF, dask-cuDF and PySpark)
Stars: ✭ 86 (+14.67%)
Mutual labels:  data-cleaning
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+1466.67%)
Mutual labels:  data-cleaning
Clean
Fast and Easy Data Cleaning (in R)
Stars: ✭ 49 (-34.67%)
Mutual labels:  data-cleaning
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1214.67%)
Mutual labels:  data-cleaning
Janitor
simple tools for data cleaning in R
Stars: ✭ 981 (+1208%)
Mutual labels:  data-cleaning
Drugs Recommendation Using Reviews
Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-53.33%)
Mutual labels:  data-cleaning
Data Forge Ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+1189.33%)
Mutual labels:  data-cleaning
Moodle Local datacleaner
Reduce, filter, and anonymize moodle data for non-prod environments
Stars: ✭ 12 (-84%)
Mutual labels:  data-cleaning
Boltzmannclean
Fill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Stars: ✭ 23 (-69.33%)
Mutual labels:  data-cleaning
Pandera
A light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+574.67%)
Mutual labels:  data-cleaning
Nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (+305.33%)
Mutual labels:  data-cleaning
Validate
Professional data validation for the R environment
Stars: ✭ 268 (+257.33%)
Mutual labels:  data-cleaning
Dirty cat
Encoding methods for dirty categorical variables
Stars: ✭ 259 (+245.33%)
Mutual labels:  data-cleaning
covid-19-data-cleanup
Scripts to cleanup data from https://github.com/CSSEGISandData/COVID-19
Stars: ✭ 25 (-66.67%)
Mutual labels:  data-cleaning
nepali-translator
Neural Machine Translation on the Nepali-English language pair
Stars: ✭ 29 (-61.33%)
Mutual labels:  data-cleaning
allie
🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Stars: ✭ 93 (+24%)
Mutual labels:  data-cleaning
foofah
Foofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (-68%)
Mutual labels:  data-cleaning
OpenRefine-ecology-lesson
Data Cleaning with OpenRefine for Ecologists
Stars: ✭ 20 (-73.33%)
Mutual labels:  data-cleaning
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+60%)
Mutual labels:  data-cleaning
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (-58.67%)
Mutual labels:  data-cleaning
errorlocate
Find and replace erroneous fields in data using validation rules
Stars: ✭ 19 (-74.67%)
Mutual labels:  data-cleaning
objectiv-analytics
Powerful product analytics for data teams, with full control over data & models.
Stars: ✭ 399 (+432%)
Mutual labels:  data-cleaning
exemplary-ml-pipeline
Exemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-69.33%)
Mutual labels:  data-cleaning
Cleaner.jl
A toolbox of simple solutions for common data cleaning problems.
Stars: ✭ 21 (-72%)
Mutual labels:  data-cleaning
FIFA-2019-Analysis
This is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (-62.67%)
Mutual labels:  data-cleaning
Openvino
OpenVINO™ Toolkit repository
Stars: ✭ 2,858 (+3710.67%)
Mutual labels:  inference-engine
gaze-estimation-with-laser-sparking
Deep learning based gaze estimation demo with a fun feature :-)
Stars: ✭ 32 (-57.33%)
Mutual labels:  inference-engine
Torsten
library of C++ functions that support applications of Stan in Pharmacometrics
Stars: ✭ 38 (-49.33%)
Mutual labels:  inference-engine
daisykit
Daisykit is an easy AI toolkit for software engineers to integrate pretrained AI models and pipelines into their projects. - with NCNN, OpenCV, Python wrappers
Stars: ✭ 22 (-70.67%)
Mutual labels:  inference-engine
BMW-IntelOpenVINO-Detection-Inference-API
This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.
Stars: ✭ 66 (-12%)
Mutual labels:  inference-engine
exper
experimental rule-based programming formalism (under construction)
Stars: ✭ 14 (-81.33%)
Mutual labels:  inference-engine
opencv-python-inference-engine
Wrapper package for OpenCV with Inference Engine python bindings.
Stars: ✭ 32 (-57.33%)
Mutual labels:  inference-engine
IDVerification
"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.
Stars: ✭ 44 (-41.33%)
Mutual labels:  inference-engine
lisp-inference
An Inference Engine based on Propositional Calculus written in Common Lisp
Stars: ✭ 36 (-52%)
Mutual labels:  inference-engine
r2inference
RidgeRun Inference Framework
Stars: ✭ 22 (-70.67%)
Mutual labels:  inference-engine
tensorrt-ssd-easy
No description or website provided.
Stars: ✭ 32 (-57.33%)
Mutual labels:  inference-engine
pomagma
An inference engine for extensional untyped λ-calculus
Stars: ✭ 15 (-80%)
Mutual labels:  inference-engine
planer
Powerful Light Artificial NEuRon inference framework for CNN
Stars: ✭ 52 (-30.67%)
Mutual labels:  inference-engine
1-50 of 50 similar projects