Emutomanipulate JSON files
Stars: ✭ 180 (+1185.71%)
WellyWell handling
Stars: ✭ 168 (+1100%)
ChirpInterface to manage and centralize Google Alert information
Stars: ✭ 227 (+1521.43%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+957.14%)
kenchiA scikit-learn compatible library for anomaly detection
Stars: ✭ 36 (+157.14%)
PzadКурс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Stars: ✭ 160 (+1042.86%)
Prefixspan PyThe shortest yet efficient Python implementation of the sequential pattern mining algorithm PrefixSpan, closed sequential pattern mining algorithm BIDE, and generator sequential pattern mining algorithm FEAT.
Stars: ✭ 214 (+1428.57%)
Tradingview Data ScraperExtract price and indicator data from TradingView charts to create ML datasets
Stars: ✭ 203 (+1350%)
Suod(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Stars: ✭ 245 (+1650%)
Ail FrameworkAIL framework - Analysis Information Leak framework
Stars: ✭ 191 (+1264.29%)
software-analyticsA repository with my data analysis results of software artifacts
Stars: ✭ 37 (+164.29%)
LasioPython library for reading and writing well data using Log ASCII Standard (LAS) files
Stars: ✭ 234 (+1571.43%)
LightgbmA fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Stars: ✭ 13,293 (+94850%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (+128.57%)
SpypiAn (un-)ethical hacking-station based on Raspberry Pi and Python
Stars: ✭ 167 (+1092.86%)
Statistical LearningLecture Slides and R Sessions for Trevor Hastie and Rob Tibshinari's "Statistical Learning" Stanford course
Stars: ✭ 223 (+1492.86%)
Etl unicorn数据可视化, 数据挖掘, 数据处理 ETL
Stars: ✭ 156 (+1014.29%)
MatminerData mining for materials science
Stars: ✭ 251 (+1692.86%)
Rosie Pattern LanguageRosie Pattern Language (RPL) and the Rosie Pattern Engine have MOVED!
Stars: ✭ 146 (+942.86%)
Estadistica Con RApuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (+1335.71%)
InstascrapePowerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (+1342.86%)
tonicA Low Profile Component Framework – Stable, minimal, easy to audit, zero-dependencies and build-tool-free.
Stars: ✭ 747 (+5235.71%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+1264.29%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+1614.29%)
imbalanced-ensembleClass-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库
Stars: ✭ 199 (+1321.43%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+21692.86%)
ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (+1157.14%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+1121.43%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+1557.14%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (+1107.14%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (+1100%)
Automlpipeline.jlA package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (+1492.86%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+13964.29%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+125042.86%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+91064.29%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+1457.14%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (+992.86%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (+250%)
Alimusic🎼天池阿里音乐流行趋势预测大赛,项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网盘下载,更详细的思路介绍欢迎访问我的博客。
Stars: ✭ 147 (+950%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+1450%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+942.86%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+22414.29%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+1371.43%)
PaperWeeklyAI📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.
Stars: ✭ 50 (+257.14%)
AsclepiusOpen Price Comparison for US Hospitals
Stars: ✭ 20 (+42.86%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (+192.86%)
TweetfeelsReal-time sentiment analysis in Python using twitter's streaming api
Stars: ✭ 249 (+1678.57%)
SmartproxyHTTP(S) Rotating Residential proxies - Code examples & General information
Stars: ✭ 205 (+1364.29%)