All Projects → Xda → Similar Projects or Alternatives

1177 Open source projects that are alternatives of or similar to Xda

Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+7336.61%)
Sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+1552.68%)
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-2.68%)
Ai Expert Roadmap
Roadmap to becoming an Artificial Intelligence Expert in 2021
Stars: ✭ 15,441 (+13686.61%)
Mutual labels:  data-science, data-analysis
Data Analysis And Machine Learning Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Stars: ✭ 5,166 (+4512.5%)
Mutual labels:  data-science, data-analysis
Dataprep
DataPrep — The easiest way to prepare data in Python
Stars: ✭ 639 (+470.54%)
Resources
PyMC3 educational resources
Stars: ✭ 930 (+730.36%)
Mutual labels:  data-science, data-analysis
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-80.36%)
Mutual labels:  data-science, data-analysis
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+671.43%)
Mutual labels:  data-science, data-analysis
Art Data Science
The Art of Data Science
Stars: ✭ 32 (-71.43%)
Mutual labels:  data-science, data-analysis
Openrefine
OpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+7516.96%)
Mutual labels:  data-science, data-analysis
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+37966.07%)
Mutual labels:  data-science, data-analysis
Data Science With Ruby
Practical Data Science with Ruby based tools.
Stars: ✭ 549 (+390.18%)
Mutual labels:  data-science, data-analysis
Dataproofer
A proofreader for your data
Stars: ✭ 628 (+460.71%)
Mutual labels:  data-science, data-analysis
Imbalanced Learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+4915.18%)
Mutual labels:  data-science, data-analysis
Skdata
Python tools for data analysis
Stars: ✭ 16 (-85.71%)
Mutual labels:  data-science, data-analysis
Football Data
football (soccer) datasets
Stars: ✭ 18 (-83.93%)
Mutual labels:  data-science, data-analysis
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-4.46%)
Mutual labels:  data-science, data-analysis
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+528.57%)
Mutual labels:  data-science, data-analysis
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-56.25%)
Mutual labels:  data-science, data-analysis
Tiledb
The Universal Storage Engine
Stars: ✭ 1,072 (+857.14%)
Mutual labels:  data-science, data-analysis
Datacomparer
dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-48.21%)
Mutual labels:  data-science, data-analysis
Datacamp
🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-38.39%)
Mutual labels:  data-science, data-analysis
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+949.11%)
Mutual labels:  data-science, data-analysis
Dream3d
Data Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-34.82%)
Mutual labels:  data-science, data-analysis
Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+1005.36%)
Mutual labels:  data-science, data-analysis
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+383.04%)
Mutual labels:  data-science, data-analysis
Data Science Your Way
Ways of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (+373.21%)
Pachyderm
Reproducible Data Science at Scale!
Stars: ✭ 5,305 (+4636.61%)
Mutual labels:  data-science, data-analysis
Rumale
Rumale is a machine learning library in Ruby
Stars: ✭ 526 (+369.64%)
Mutual labels:  data-science, data-analysis
Nfstream
NFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+455.36%)
Mutual labels:  data-science, data-analysis
Elki
ELKI Data Mining Toolkit
Stars: ✭ 613 (+447.32%)
Mutual labels:  data-science, data-analysis
Ml Da Coursera Yandex Mipt
Machine Learning and Data Analysis Coursera Specialization from Yandex and MIPT
Stars: ✭ 108 (-3.57%)
Mutual labels:  data-science, data-analysis
Dapy
Easy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (+366.96%)
Mutual labels:  data-science, data-analysis
Lux
Python API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+602.68%)
Dataframe
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+639.29%)
Mutual labels:  data-science, data-analysis
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-83.93%)
Mutual labels:  data-science, data-analysis
Awesome Python Data Science
Probably the best curated list of data science software in Python.
Stars: ✭ 812 (+625%)
Mutual labels:  data-science, data-analysis
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+662.5%)
Mutual labels:  data-science, data-analysis
Socrat
A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-76.79%)
Mutual labels:  data-science, data-analysis
Flyte
Accelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+1008.93%)
Mutual labels:  data-science, data-analysis
Fklearn
fklearn: Functional Machine Learning
Stars: ✭ 1,305 (+1065.18%)
Mutual labels:  data-science, data-analysis
Knowledge Repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+4325%)
Mutual labels:  data-science, data-analysis
Python data analysis and mining action
《python数据分析与挖掘实战》的代码笔记
Stars: ✭ 1,027 (+816.96%)
Mutual labels:  data-science, data-analysis
Mathematicavsr
Example projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-63.39%)
Mutual labels:  data-science, data-analysis
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (+860.71%)
Mutual labels:  data-science, data-analysis
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+780.36%)
Mutual labels:  data-science, data-analysis
Awesome Business Intelligence
Actively curated list of awesome BI tools. PRs welcome!
Stars: ✭ 1,157 (+933.04%)
Mutual labels:  data-science, data-analysis
Graphia
A visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-40.18%)
Mutual labels:  data-science, data-analysis
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1094.64%)
Mutual labels:  data-science, data-analysis
Janitor
simple tools for data cleaning in R
Stars: ✭ 981 (+775.89%)
Mutual labels:  data-science, data-analysis
Gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+997.32%)
Mutual labels:  data-science, data-analysis
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-29.46%)
Mutual labels:  data-science, data-analysis
Kaggle Competitions
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-23.21%)
Hyperlearn
50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+975%)
Mutual labels:  data-science, data-analysis
Gop
GoPlus - The Go+ language for engineering, STEM education, and data science
Stars: ✭ 7,829 (+6890.18%)
Mutual labels:  data-science, data-analysis
Awesome R
A curated list of awesome R packages, frameworks and software.
Stars: ✭ 4,858 (+4237.5%)
Mutual labels:  data-science, data-analysis
Mlcourse.ai
Open Machine Learning Course
Stars: ✭ 7,963 (+7009.82%)
Mutual labels:  data-science, data-analysis
Tsrepr
TSrepr: R package for time series representations
Stars: ✭ 75 (-33.04%)
Mutual labels:  data-science, data-analysis
Bayesian Cognitive Modeling In Pymc3
PyMC3 codes of Lee and Wagenmakers' Bayesian Cognitive Modeling - A Pratical Course
Stars: ✭ 93 (-16.96%)
Mutual labels:  data-science, data-analysis
1-60 of 1177 similar projects