All Categories → Data Processing → data-mining

Top 285 data-mining open source projects

Research
novel deep learning research works with PaddlePaddle
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Interpretable machine learning with python
Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Feature Engineering And Feature Selection
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Rong360
用户贷款风险预测
Combo
(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Krangl
krangl is a {K}otlin DSL for data w{rangl}ing
Cogcomp Nlp
CogComp's Natural Language Processing libraries and Demos:
Jekyll
Jekyll-based static site for The Programming Historian
Ml From Scratch
Python implementations of some of the fundamental Machine Learning models and algorithms from scratch.
Graph Adversarial Learning Literature
A curated list of adversarial attacks and defenses papers on graph-structured data.
Pyhealth
A Python Library for Health Predictive Models
Graph Fraud Detection Papers
A curated list of fraud detection papers using graph information or graph neural networks
Mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Pm4py Core
Public repository for the PM4Py (Process Mining for Python) project.
Ai Learn
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Knowage Server
Knowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Urs
Universal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Lihang algorithms
用python和sklearn两种方法实现李航《统计学习方法》中的算法
Game Datasets
🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games
2018 Dc Datagrand Textintelprocess
2018-DC-“达观杯”文本智能处理挑战赛:冠军 (1st/3131)
Data-mining-python-script
It contain various script on web crawling/ data mining of social web(RSS,facebook,twitter,Linkedin)
crowdsource-video-experiments-on-android
Crowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
twitter-analytics-wrapper
A simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.
JobRequirementAnalysis
📉 使用 R 语言从拉勾网看数据挖掘岗位现状
BTM-Java
A java implement of Biterm Topic Model
datamining algorithms
用python实现SVM/AdaBoost/C4.5/CART/Naïve Bayes等数据挖掘领域十大经典算法
SHAP FOLD
(Explainable AI) - Learning Non-Monotonic Logic Programs From Statistical Models Using High-Utility Itemset Mining
data-mining-course
An undergraduate course on data mining.
121-180 of 285 data-mining projects