Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.

Stars: ✭ 36 (-10%)

Mutual labels: data-mining

Asclepius

Open Price Comparison for US Hospitals

Stars: ✭ 20 (-50%)

Mutual labels: data-mining

hub-toolbox-python3

Hubness analysis and removal functions

Stars: ✭ 17 (-57.5%)

Mutual labels: data-mining

Tweetfeels

Real-time sentiment analysis in Python using twitter's streaming api

Stars: ✭ 249 (+522.5%)

Mutual labels: data-mining

Awesome Datascience

📝 An awesome Data Science repository to learn and apply for real world problems.

Stars: ✭ 17,520 (+43700%)

Mutual labels: data-mining

Python Projects

some python projects

Stars: ✭ 247 (+517.5%)

Mutual labels: data-mining

conferencias matutinas amlo

CSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )

Stars: ✭ 25 (-37.5%)

Mutual labels: data-mining

website-to-json

Converts website to json using jQuery selectors

Stars: ✭ 37 (-7.5%)

Mutual labels: data-mining

TextClassification

基于scikit-learn实现对新浪新闻的文本分类，数据集为100w篇文档，总计10类，测试集与训练集1:1划分。分类算法采用SVM和Bayes，其中Bayes作为baseline。

Stars: ✭ 86 (+115%)

Mutual labels: data-mining

data-mining

Resources for the Data Mining for Bussiness and Governance course.

Stars: ✭ 52 (+30%)

Mutual labels: data-mining

PaperWeeklyAI

📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.

Stars: ✭ 50 (+25%)

Mutual labels: data-mining

xforest

A super-fast and scalable Random Forest library based on fast histogram decision tree algorithm and distributed bagging framework. It can be used for binary classification, multi-label classification, and regression tasks. This library provides both Python and command line interface to users.

Stars: ✭ 20 (-50%)

Mutual labels: data-mining

sugarcube

Monoidal data processes.

Stars: ✭ 32 (-20%)

Mutual labels: data-mining

non-api-fb-scraper

Scrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access

Stars: ✭ 40 (+0%)

Mutual labels: data-mining

Rule Extraction from Trees

A toolkit for extracting comprehensible rules from tree-based algorithms

Stars: ✭ 34 (-15%)

Mutual labels: data-mining

machine-learning-data-pipeline

Pipeline module for parallel real-time data processing for machine learning models development and production purposes.

Stars: ✭ 22 (-45%)

Mutual labels: data-preprocessing

Orange3

🍊 📊 💡 Orange: Interactive data analysis

Stars: ✭ 3,152 (+7780%)

Mutual labels: data-mining

bsu

🎓Repository for university labs on FAMCS, BSU

Stars: ✭ 91 (+127.5%)

Mutual labels: data-mining

hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

Stars: ✭ 62 (+55%)

Mutual labels: data-mining

Reaper

Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Stars: ✭ 240 (+500%)

Mutual labels: data-mining

Datascience

Curated list of Python resources for data science.

Stars: ✭ 3,051 (+7527.5%)

Mutual labels: data-mining

corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Stars: ✭ 16 (-60%)

Mutual labels: data-mining

PyDREAM

Python Implementation of Decay Replay Mining (DREAM)

Stars: ✭ 22 (-45%)

Mutual labels: data-mining

dh-core

Functional data science

Stars: ✭ 123 (+207.5%)

Mutual labels: data-mining

PySPOD

A Python package for spectral proper orthogonal decomposition (SPOD).

Stars: ✭ 50 (+25%)

Mutual labels: data-mining

MetQy

Repository for R package MetQy (read related publication here: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6247936/)

Stars: ✭ 17 (-57.5%)

Mutual labels: data-mining

blinkist-m4a-downloader

Grabs all of the audio files from all of the Blinkist books

Stars: ✭ 100 (+150%)

Mutual labels: data-mining

Heart disease prediction

Heart Disease prediction using 5 algorithms

Stars: ✭ 43 (+7.5%)

Mutual labels: data-mining

Algorithmic-Trading

Algorithmic trading using machine learning.

Stars: ✭ 102 (+155%)

Mutual labels: data-mining

EasyMiner

Easy association rule mining and classification on the web

Stars: ✭ 14 (-65%)

Mutual labels: data-mining

teanaps

자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.

Stars: ✭ 91 (+127.5%)

Mutual labels: data-mining

Data-Mining-on-Social-Media

Python scripts to extract tweets and facebook posts from public users.

Stars: ✭ 99 (+147.5%)

Mutual labels: data-mining

AILA-Artificial-Intelligence-for-Legal-Assistance

Python implementations of the various methods used in FIRE 2019 conference.

Stars: ✭ 39 (-2.5%)

Mutual labels: data-mining

imbalanced-ensemble

Class-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库

Stars: ✭ 199 (+397.5%)

Mutual labels: data-mining

xgboost-smote-detect-fraud

Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!

Stars: ✭ 59 (+47.5%)

Mutual labels: data-mining

Semantic-Bus

object flow treatment, data transformation

Stars: ✭ 49 (+22.5%)

Mutual labels: data-mining

hh research

Автоматизация поиска и исследования вакансий с сайта hh.ru (Headhunter) с помощью методов Python. Классификация данных, поиск статистических параметров.

Stars: ✭ 36 (-10%)

Mutual labels: data-mining

software-analytics

A repository with my data analysis results of software artifacts

Stars: ✭ 37 (-7.5%)

Mutual labels: data-mining

iis

Information Inference Service of the OpenAIRE system

Stars: ✭ 16 (-60%)

Mutual labels: data-mining

kenchi

A scikit-learn compatible library for anomaly detection

Stars: ✭ 36 (-10%)

Mutual labels: data-mining

2018-Tencent-Lookalike

2018-腾讯广告算法大赛-相似人群拓展(初赛)：10th/1563 (Top 0.64%)

Stars: ✭ 46 (+15%)

Mutual labels: data-mining

Matminer

Data mining for materials science

Stars: ✭ 251 (+527.5%)

Mutual labels: data-mining

data-exploration-with-apache-drill

Data Exploration with Apache Drill

Stars: ✭ 25 (-37.5%)

Mutual labels: data-mining

interpretable-ml

Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.

Stars: ✭ 17 (-57.5%)

Mutual labels: data-mining

Lasio

Python library for reading and writing well data using Log ASCII Standard (LAS) files

Stars: ✭ 234 (+485%)

Mutual labels: data-mining

Suod

(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)

Stars: ✭ 245 (+512.5%)

Mutual labels: data-mining

KaliIntelligenceSuite

Kali Intelligence Suite (KIS) shall aid in the fast, autonomous, central, and comprehensive collection of intelligence by executing standard penetration testing tools. The collected data is internally stored in a structured manner to allow the fast identification and visualisation of the collected information.

Stars: ✭ 58 (+45%)

Mutual labels: data-mining

Data Mining Conferences

Ranking, acceptance rate, deadline, and publication tips

Stars: ✭ 236 (+490%)

Mutual labels: data-mining

simon-frontend

💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻

Stars: ✭ 114 (+185%)

Mutual labels: data-mining

scikit-cycling

Tools to analyze cycling data