All Projects → Chain.jl → Similar Projects or Alternatives

1488 Open source projects that are alternatives of or similar to Chain.jl

Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-33.05%)
Mutual labels:  data-science, data-analysis, pipeline
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+4068.64%)
Mutual labels:  data-science, data-analysis, pipeline
Socrat
A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-77.97%)
Mutual labels:  data-science, data-analysis
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+735.59%)
Mutual labels:  data-science, data-analysis
Python data analysis and mining action
《python数据分析与挖掘实战》的代码笔记
Stars: ✭ 1,027 (+770.34%)
Mutual labels:  data-science, data-analysis
Loandefault Prediction
Lending Club Loan data analysis
Stars: ✭ 113 (-4.24%)
Mutual labels:  data-science, data-analysis
Resources
PyMC3 educational resources
Stars: ✭ 930 (+688.14%)
Mutual labels:  data-science, data-analysis
Mlcourse.ai
Open Machine Learning Course
Stars: ✭ 7,963 (+6648.31%)
Mutual labels:  data-science, data-analysis
Janitor
simple tools for data cleaning in R
Stars: ✭ 981 (+731.36%)
Mutual labels:  data-science, data-analysis
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-3.39%)
Mutual labels:  data-science, data-analysis
Mlbox
MLBox is a powerful Automated Machine Learning python library.
Stars: ✭ 1,199 (+916.1%)
Mutual labels:  data-science, pipeline
Hyperlearn
50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+920.34%)
Mutual labels:  data-science, data-analysis
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-4.24%)
Mutual labels:  data-science, data-analysis
Dataframe
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+601.69%)
Mutual labels:  data-science, data-analysis
Scikit Learn
scikit-learn: machine learning in Python
Stars: ✭ 48,322 (+40850.85%)
Mutual labels:  data-science, data-analysis
Football Data
football (soccer) datasets
Stars: ✭ 18 (-84.75%)
Mutual labels:  data-science, data-analysis
Ai Expert Roadmap
Roadmap to becoming an Artificial Intelligence Expert in 2021
Stars: ✭ 15,441 (+12985.59%)
Mutual labels:  data-science, data-analysis
Steppy Toolkit
Curated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-82.2%)
Mutual labels:  data-science, pipeline
Xda
R package for exploratory data analysis
Stars: ✭ 112 (-5.08%)
Mutual labels:  data-science, data-analysis
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+623.73%)
Mutual labels:  data-science, data-analysis
Datacomparer
dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-50.85%)
Mutual labels:  data-science, data-analysis
Graphia
A visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-43.22%)
Mutual labels:  data-science, data-analysis
Tsrepr
TSrepr: R package for time series representations
Stars: ✭ 75 (-36.44%)
Mutual labels:  data-science, data-analysis
Datacamp
🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-41.53%)
Mutual labels:  data-science, data-analysis
Fklearn
fklearn: Functional Machine Learning
Stars: ✭ 1,305 (+1005.93%)
Mutual labels:  data-science, data-analysis
Bayesian Cognitive Modeling In Pymc3
PyMC3 codes of Lee and Wagenmakers' Bayesian Cognitive Modeling - A Pratical Course
Stars: ✭ 93 (-21.19%)
Mutual labels:  data-science, data-analysis
Blurr
Data transformations for the ML era
Stars: ✭ 96 (-18.64%)
Mutual labels:  data-science, pipeline
Awesome Python Data Science
Probably the best curated list of data science software in Python.
Stars: ✭ 812 (+588.14%)
Mutual labels:  data-science, data-analysis
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+496.61%)
Mutual labels:  data-science, data-analysis
Skdata
Python tools for data analysis
Stars: ✭ 16 (-86.44%)
Mutual labels:  data-science, data-analysis
Dataproofer
A proofreader for your data
Stars: ✭ 628 (+432.2%)
Mutual labels:  data-science, data-analysis
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-81.36%)
Mutual labels:  data-science, data-analysis
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-84.75%)
Mutual labels:  data-science, data-analysis
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+36030.51%)
Mutual labels:  data-science, data-analysis
Nfstream
NFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+427.12%)
Mutual labels:  data-science, data-analysis
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+1468.64%)
Mutual labels:  data-science, data-analysis
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+6958.47%)
Mutual labels:  data-science, data-analysis
Art Data Science
The Art of Data Science
Stars: ✭ 32 (-72.88%)
Mutual labels:  data-science, data-analysis
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+632.2%)
Mutual labels:  data-science, data-analysis
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-9.32%)
Mutual labels:  data-science, data-analysis
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (+732.2%)
Mutual labels:  data-science, pipeline
Mathematicavsr
Example projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-65.25%)
Mutual labels:  data-science, data-analysis
Elki
ELKI Data Mining Toolkit
Stars: ✭ 613 (+419.49%)
Mutual labels:  data-science, data-analysis
Openrefine
OpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+7129.66%)
Mutual labels:  data-science, data-analysis
Drake Examples
Example workflows for the drake R package
Stars: ✭ 57 (-51.69%)
Mutual labels:  data-science, pipeline
Awesome Business Intelligence
Actively curated list of awesome BI tools. PRs welcome!
Stars: ✭ 1,157 (+880.51%)
Mutual labels:  data-science, data-analysis
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (+811.86%)
Mutual labels:  data-science, data-analysis
Dream3d
Data Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-38.14%)
Mutual labels:  data-science, data-analysis
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+895.76%)
Mutual labels:  data-science, data-analysis
Tiledb
The Universal Storage Engine
Stars: ✭ 1,072 (+808.47%)
Mutual labels:  data-science, data-analysis
Drake
An R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+1002.54%)
Mutual labels:  data-science, pipeline
Flyte
Accelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+952.54%)
Mutual labels:  data-science, data-analysis
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1033.9%)
Mutual labels:  data-science, data-analysis
Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+949.15%)
Mutual labels:  data-science, data-analysis
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-7.63%)
Mutual labels:  data-science, data-analysis
Imbalanced Learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+4660.17%)
Mutual labels:  data-science, data-analysis
Pdpipe
Easy pipelines for pandas DataFrames.
Stars: ✭ 590 (+400%)
Mutual labels:  data-science, pipeline
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-58.47%)
Mutual labels:  data-science, data-analysis
Gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+941.53%)
Mutual labels:  data-science, data-analysis
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+1184.75%)
Mutual labels:  data-science, data-analysis
1-60 of 1488 similar projects