Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (+69%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-78%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (+13%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+5556%)
kmeansA simple implementation of K-means (and Bisecting K-means) clustering algorithm in Python
Stars: ✭ 18 (-82%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-57%)
PretzelJavascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-74%)
Php MlPHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+7800%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+17420%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+754%)
Linkedingiveaway👨🏽🏫You can learn about anything over here. What Giveaways I do and why it's important in today's modern world. Are you interested in Giveaway's?🔋
Stars: ✭ 67 (-33%)
PzadКурс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Stars: ✭ 160 (+60%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+604%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-82%)
deepvismachine learning algorithms in Swift
Stars: ✭ 54 (-46%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+46%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (+202%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+1411%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+71%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+117%)
VerticapyVerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-41%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+132%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+354%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+4983%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+3052%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+2951%)
MlxtendA library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+3629%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+118%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (+37%)
Knowage ServerKnowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Stars: ✭ 276 (+176%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+441%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+1138%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+522%)
EngsoccerdataEnglish and European soccer results 1871-2020
Stars: ✭ 615 (+515%)
AethosAutomated Data Science and Machine Learning library to optimize workflow.
Stars: ✭ 94 (-6%)
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (+530%)
DataprooferA proofreader for your data
Stars: ✭ 628 (+528%)
RoughvizReusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Stars: ✭ 6,022 (+5922%)
Pyclusteringpyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (+706%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+513%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-74%)
Papers Literature Ml Dl Rl AiHighly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+1241%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+754%)
VectorbtUltimate Python library for time series analysis and backtesting at scale
Stars: ✭ 855 (+755%)
AilearningAiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP
Stars: ✭ 32,316 (+32216%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+787%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-65%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+5205%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-64%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+764%)
Mldmпотоковый курс "Машинное обучение и анализ данных (Machine Learning and Data Mining)" на факультете ВМК МГУ имени М.В. Ломоносова
Stars: ✭ 35 (-65%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-1%)