data-miningResources for the Data Mining for Bussiness and Governance course.
Stars: ✭ 52 (+160%)
hpipeWorkflow engine for various computing systems.
Stars: ✭ 26 (+30%)
RavenRAVEN is a flexible and multi-purpose probabilistic risk analysis, uncertainty quantification, parameter optimization and data knowledge-discovering framework.
Stars: ✭ 122 (+510%)
Kddcup 20206th Solution for 2020-KDDCUP Debiasing Challenge
Stars: ✭ 118 (+490%)
chainRecMengting Wan, Julian McAuley, "Item Recommendation on Monotonic Behavior Chains", in Proc. of 2018 ACM Conference on Recommender Systems (RecSys'18), Vancouver, Canada, Oct. 2018.
Stars: ✭ 52 (+160%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (+475%)
Graph-Based-TCGraph-based framework for text classification
Stars: ✭ 24 (+20%)
BellaBella is a pure python post-exploitation data mining tool & remote administration tool for macOS. 🍎💻
Stars: ✭ 112 (+460%)
NIDS-Intrusion-DetectionSimple Implementation of Network Intrusion Detection System. KddCup'99 Data set is used for this project. kdd_cup_10_percent is used for training test. correct set is used for test. PCA is used for dimension reduction. SVM and KNN supervised algorithms are the classification algorithms of project. Accuracy : %83.5 For SVM , %80 For KNN
Stars: ✭ 45 (+125%)
Gitlogg💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Stars: ✭ 102 (+410%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (+395%)
watson-document-classifierAugment IBM Watson Natural Language Understanding APIs with a configurable mechanism for text classification, uses Watson Studio.
Stars: ✭ 41 (+105%)
MsnoiseA Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
Stars: ✭ 94 (+370%)
dayderSearch lots of data sets for spurious correlations
Stars: ✭ 44 (+120%)
datartDatart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+5110%)
Dc Hi guides[Data Castle 算法竞赛] 精品旅行服务成单预测 final rank 11
Stars: ✭ 83 (+315%)
ml-with-text[Tutorial] Demystifying Natural Language Processing with Python
Stars: ✭ 18 (-10%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (+275%)
DataEngineeringThis repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (+135%)
Bee UniversityProject thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
Stars: ✭ 73 (+265%)
simon-frontend💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻
Stars: ✭ 114 (+470%)
FfbeDatamining for FFBE GL
Stars: ✭ 69 (+245%)
EvalneSource code for EvalNE, a Python library for evaluating Network Embedding methods.
Stars: ✭ 67 (+235%)
PyDREAMPython Implementation of Decay Replay Mining (DREAM)
Stars: ✭ 22 (+10%)
GendisContains an implementation (sklearn API) of the algorithm proposed in "GENDIS: GEnetic DIscovery of Shapelets" and code to reproduce all experiments.
Stars: ✭ 59 (+195%)
website-to-jsonConverts website to json using jQuery selectors
Stars: ✭ 37 (+85%)
Etherscan MlPython Data Science and Machine Learning Library for the Ethereum and ERC-20 Blockchain
Stars: ✭ 55 (+175%)
Customer-Feedback-AnalysisMulti Class Text (Feedback) Classification using CNN, GRU Network and pre trained Word2Vec embedding, word embeddings on TensorFlow.
Stars: ✭ 18 (-10%)
Php MlPHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+39400%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (+115%)
TorchBlocksA PyTorch-based toolkit for natural language processing
Stars: ✭ 85 (+325%)
HeliomlA book about machine learning, statistics, and data mining for heliophysics
Stars: ✭ 36 (+80%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (+75%)
tree-huggerA light-weight, extendable, high level, universal code parser built on top of tree-sitter
Stars: ✭ 96 (+380%)
Invoice2dataExtract structured data from PDF invoices
Stars: ✭ 943 (+4615%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+355%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+4335%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+87500%)
comparable-text-minerComparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary translation, documents alignment, corpus information, text classification, tf-idf computation, text similarity computation, html documents cleaning
Stars: ✭ 31 (+55%)
MatminerData mining for materials science
Stars: ✭ 251 (+1155%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+15660%)
xforestA super-fast and scalable Random Forest library based on fast histogram decision tree algorithm and distributed bagging framework. It can be used for binary classification, multi-label classification, and regression tasks. This library provides both Python and command line interface to users.
Stars: ✭ 20 (+0%)
monkeylearn-javaOfficial Java client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Java apps.
Stars: ✭ 23 (+15%)
HiGitClassHiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)
Stars: ✭ 58 (+190%)
CaverCaver: a toolkit for multilabel text classification.
Stars: ✭ 38 (+90%)