All Projects → ffftzh → BTM-Java

ffftzh / BTM-Java

Licence: other
A java implement of Biterm Topic Model

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to BTM-Java

FSCNMF
An implementation of "Fusing Structure and Content via Non-negative Matrix Factorization for Embedding Information Networks".
Stars: ✭ 16 (-11.11%)
Mutual labels:  data-mining
anomalyDetection
An R package for implementing augmented network log anomaly detection procedures
Stars: ✭ 21 (+16.67%)
Mutual labels:  data-mining
SHAP FOLD
(Explainable AI) - Learning Non-Monotonic Logic Programs From Statistical Models Using High-Utility Itemset Mining
Stars: ✭ 35 (+94.44%)
Mutual labels:  data-mining
kasthack.osp
Генератор сырых дампов пользователей VK.
Stars: ✭ 15 (-16.67%)
Mutual labels:  data-mining
advanced-text-mining
TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.
Stars: ✭ 15 (-16.67%)
Mutual labels:  data-mining
genieclust
Genie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (+88.89%)
Mutual labels:  data-mining
Network-Intrusion-Detection-Using-Machine-Learning-Techniques
Network intrusions classification using algorithms such as Support Vector Machine (SVM), Decision Tree, Naive Baye, K-Nearest Neighbor (KNN), Logistic Regression and Random Forest.
Stars: ✭ 56 (+211.11%)
Mutual labels:  data-mining
FPGrowth-and-Apriori-algorithm-Association-Rule-Data-Mining
Implementation of FPTree-Growth and Apriori-Algorithm for finding frequent patterns in Transactional Database.
Stars: ✭ 19 (+5.56%)
Mutual labels:  data-mining
jds
Jenesis Data Store: a dynamic, cross platform, high performance, ORM data-mapper. Designed to assist in rapid development and data mining
Stars: ✭ 17 (-5.56%)
Mutual labels:  data-mining
cocoon-demo
Cocoon – a flow-based workflow automation, data mining and visual analytics tool.
Stars: ✭ 19 (+5.56%)
Mutual labels:  data-mining
evine
Interactive CLI Web Crawler
Stars: ✭ 140 (+677.78%)
Mutual labels:  data-mining
Kaggle-project-list
Summary of my projects on kaggle
Stars: ✭ 20 (+11.11%)
Mutual labels:  data-mining
data-mining-course
An undergraduate course on data mining.
Stars: ✭ 24 (+33.33%)
Mutual labels:  data-mining
genie
Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (+16.67%)
Mutual labels:  data-mining
act
Computational synthetic biology: Predicting DNA edits for bioengineering
Stars: ✭ 67 (+272.22%)
Mutual labels:  data-mining
SparseLSH
A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+605.56%)
Mutual labels:  data-mining
popular restaurants from officials
서울시 공무원의 업무추진비를 분석하여 진짜 맛집 찾기 프로젝트
Stars: ✭ 22 (+22.22%)
Mutual labels:  data-mining
spmf-py
Python SPMF Wrapper 🐍 🎁
Stars: ✭ 35 (+94.44%)
Mutual labels:  data-mining
datamining algorithms
用python实现SVM/AdaBoost/C4.5/CART/Naïve Bayes等数据挖掘领域十大经典算法
Stars: ✭ 64 (+255.56%)
Mutual labels:  data-mining
imgur-scraper
Retrieve years of imgur.com's data without any authentication.
Stars: ✭ 26 (+44.44%)
Mutual labels:  data-mining

BTM-Java

A java implement of Biterm Topic Model. The origin BTM is in here: https://github.com/xiaohuiyan/BTM Usage:

java BTM [data_path] [topic_num] [alpha] [beta] [iter_num] [instance_num]

  • [data_path] string, path of training docs
  • [topic_num] int, number of topics
  • [alpha] double, Symmetric Dirichlet prior of P(z)
  • [beta] double, Symmetric Dirichlet prior of P(w|z)
  • [iter_num] int, number of iterations of Gibbs sampling
  • [instance_num] int, number of times to run this program

Examples

java BTM sample-data/sample-data.txt 100 0.1 0.01 2000 1

Output

  • 'model-final.theta' Doc*Topic matrix
  • 'model-final.phi' Topic*Word matrix
  • 'model-final.twords' top 20 words of each topics
  • 'model-final.wordmap' word dictionary
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].