Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (+350.75%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+15538.81%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (+152.24%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+117.91%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+225.37%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+707.46%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+950.75%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-67.16%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (+49.25%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+1747.76%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+4604.48%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+223.88%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+246.27%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+4453.73%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+155.22%)
PzadКурс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Stars: ✭ 160 (+138.81%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+26049.25%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-73.13%)
TeachingTeaching Materials for Dr. Waleed A. Yousef
Stars: ✭ 435 (+549.25%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+577.61%)
Combo(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Stars: ✭ 481 (+617.91%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+7486.57%)
Facebook data analyzerAnalyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more
Stars: ✭ 515 (+668.66%)
Lets PlotAn open-source plotting library for statistical data.
Stars: ✭ 531 (+692.54%)
Mli ResourcesH2O.ai Machine Learning Interpretability Resources
Stars: ✭ 428 (+538.81%)
TurbodbcTurbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.
Stars: ✭ 449 (+570.15%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+483.58%)
Best Of Ml Python🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Stars: ✭ 6,057 (+8940.3%)
Moderndive bookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (+686.57%)
SktimeA unified framework for machine learning with time series
Stars: ✭ 4,741 (+6976.12%)
Datasets For Recommender SystemsThis is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Stars: ✭ 564 (+741.79%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+814.93%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+828.36%)
EngsoccerdataEnglish and European soccer results 1871-2020
Stars: ✭ 615 (+817.91%)
DataprooferA proofreader for your data
Stars: ✭ 628 (+837.31%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+39904.48%)
Pyclusteringpyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (+1102.99%)
Dash TableA First-Class Interactive DataTable for Dash
Stars: ✭ 382 (+470.15%)
Interpretable machine learning with pythonExamples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (+691.04%)
RoughvizReusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Stars: ✭ 6,022 (+8888.06%)
SeabornStatistical data visualization in Python
Stars: ✭ 9,007 (+13343.28%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-61.19%)
PretzelJavascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-61.19%)
VectorbtUltimate Python library for time series analysis and backtesting at scale
Stars: ✭ 855 (+1176.12%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+1174.63%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1189.55%)
Mldmпотоковый курс "Машинное обучение и анализ данных (Machine Learning and Data Mining)" на факультете ВМК МГУ имени М.В. Ломоносова
Stars: ✭ 35 (-47.76%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+1223.88%)