genieclustGenie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (-45.16%)
graphgroveA framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search
Stars: ✭ 29 (-53.23%)
AlinkAlink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Stars: ✭ 2,936 (+4635.48%)
Clustering-in-PythonClustering methods in Machine Learning includes both theory and python code of each algorithm. Algorithms include K Mean, K Mode, Hierarchical, DB Scan and Gaussian Mixture Model GMM. Interview questions on clustering are also added in the end.
Stars: ✭ 27 (-56.45%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+888.71%)
Apriori-and-Eclat-Frequent-Itemset-MiningImplementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Stars: ✭ 36 (-41.94%)
MatrixprofileA Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (+127.42%)
NIDS-Intrusion-DetectionSimple Implementation of Network Intrusion Detection System. KddCup'99 Data set is used for this project. kdd_cup_10_percent is used for training test. correct set is used for test. PCA is used for dimension reduction. SVM and KNN supervised algorithms are the classification algorithms of project. Accuracy : %83.5 For SVM , %80 For KNN
Stars: ✭ 45 (-27.42%)
SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+104.84%)
RAll Algorithms implemented in R
Stars: ✭ 294 (+374.19%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+46.77%)
kmeansA simple implementation of K-means (and Bisecting K-means) clustering algorithm in Python
Stars: ✭ 18 (-70.97%)
Pyclusteringpyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (+1200%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-66.13%)
Data miningThe Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms
Stars: ✭ 10 (-83.87%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+4983.87%)
scSeqRThis package has migrated to https://github.com/rezakj/iCellR please use iCellR instead of scSeqR for more functionalities and updates.
Stars: ✭ 16 (-74.19%)
northstarSingle cell type annotation guided by cell atlases, with freedom to be queer
Stars: ✭ 23 (-62.9%)
EgoSplittingA NetworkX implementation of "Ego-splitting Framework: from Non-Overlapping to Overlapping Clusters" (KDD 2017).
Stars: ✭ 78 (+25.81%)
IntroduceToEclicpseVert.xThis repository contains the code of Vert.x examples contained in my articles published on platforms such as kodcu.com, medium, dzone. How to run each example is described in its readme file.
Stars: ✭ 27 (-56.45%)
WatsonClusterA simple C# class using Watson TCP to enable a one-to-one high availability cluster.
Stars: ✭ 18 (-70.97%)
AsclepiusOpen Price Comparison for US Hospitals
Stars: ✭ 20 (-67.74%)
consul roleAnsible role to install Consul (cluster of) server/agent
Stars: ✭ 14 (-77.42%)
kohonen-mapsImplementation of SOM and GSOM
Stars: ✭ 62 (+0%)
rabbitmq-clustererThis project is ABANDONWARE. Use https://www.rabbitmq.com/cluster-formation.html instead.
Stars: ✭ 72 (+16.13%)
ensparaModeling molecular ensembles with scalable data structures and parallel computing
Stars: ✭ 28 (-54.84%)
DBSCANc++ implementation of clustering by DBSCAN
Stars: ✭ 89 (+43.55%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+103.23%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+38.71%)
FixedEffectjlrR interface for Fixed Effect Models
Stars: ✭ 20 (-67.74%)
RcppMLRcpp Machine Learning: Fast robust NMF, divisive clustering, and more
Stars: ✭ 52 (-16.13%)
MAL-MapCluster and visualize relationships between anime on MyAnimeList
Stars: ✭ 201 (+224.19%)
imbalanced-ensembleClass-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库
Stars: ✭ 199 (+220.97%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (-3.23%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (-48.39%)
conferencias matutinas amloCSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )
Stars: ✭ 25 (-59.68%)
pipeCompA R framework for pipeline benchmarking, with application to single-cell RNAseq
Stars: ✭ 38 (-38.71%)
clusterixVisual exploration of clustered data.
Stars: ✭ 44 (-29.03%)
NNetalgorithm for study: multi-layer-perceptron, cluster-graph, cnn, rnn, restricted boltzmann machine, bayesian network
Stars: ✭ 24 (-61.29%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (-20.97%)
Revisiting-Contrastive-SSLRevisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]
Stars: ✭ 81 (+30.65%)
tsp-essayA fun study of some heuristics for the Travelling Salesman Problem.
Stars: ✭ 15 (-75.81%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (-33.87%)
mathematics-statistics-for-data-scienceMathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
Stars: ✭ 56 (-9.68%)
ssdcssdeep cluster analysis for malware files
Stars: ✭ 24 (-61.29%)
noderTiny tool to transform Freemind mindmap files into Dendrograms and from there to SVG
Stars: ✭ 24 (-61.29%)
EasyMinerEasy association rule mining and classification on the web
Stars: ✭ 14 (-77.42%)
ExpressionMatrix2Software for exploration of gene expression data from single-cell RNA sequencing.
Stars: ✭ 29 (-53.23%)
clueminerinteractive clustering platform
Stars: ✭ 13 (-79.03%)