CollapseAdvanced and Fast Data Transformation in R
Stars: ✭ 184 (-78.45%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+495.2%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+44.96%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+56.67%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+521.19%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-87.24%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+26%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-97.42%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-74.47%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-87.47%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-17.56%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-90.75%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-82.08%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-77.4%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-26.46%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+257.26%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-46.84%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-83.96%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-36.65%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-79.98%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-27.17%)
Knowage ServerKnowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Stars: ✭ 276 (-67.68%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1.17%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+413.7%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-28.22%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-91.22%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-67.8%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-86.77%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-85.36%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-88.29%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (-72.83%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-64.64%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-97.89%)
MlxtendA library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+336.65%)
PretzelJavascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-96.96%)
KneedKnee point detection in Python 📈
Stars: ✭ 328 (-61.59%)
Scikit Mobilityscikit-mobility: mobility analysis in Python
Stars: ✭ 339 (-60.3%)
Pandas SummaryAn extension to pandas dataframes describe function.
Stars: ✭ 361 (-57.73%)
Quantitative NotebooksEducational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (-58.31%)
ArticlesA repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (-59.02%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-98.13%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+407.49%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (-59.25%)
DataexplorerAutomate Data Exploration and Treatment
Stars: ✭ 362 (-57.61%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (-63.11%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (-54.22%)
SktimeA unified framework for machine learning with time series
Stars: ✭ 4,741 (+455.15%)
Datascience Ai Machinelearning ResourcesAlex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-51.52%)
Ml From ScratchPython implementations of some of the fundamental Machine Learning models and algorithms from scratch.
Stars: ✭ 20,624 (+2314.99%)
Cogcomp NlpCogComp's Natural Language Processing libraries and Demos:
Stars: ✭ 410 (-51.99%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (-3.04%)
Jupyter pivottablejsDrag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (-49.88%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+2481.73%)