All Projects → BiLiangLtd → Wordsimilarity

BiLiangLtd / Wordsimilarity

Licence: apache-2.0
基于哈工大同义词词林扩展版的单词相似度计算方法

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Wordsimilarity

Final word similarity
综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。
Stars: ✭ 485 (+112.72%)
Mutual labels:  similarity
Consimilo
A Clojure library for querying large data-sets on similarity
Stars: ✭ 54 (-76.32%)
Mutual labels:  similarity
Tensorflow Ml Nlp
텐서플로우와 머신러닝으로 시작하는 자연어처리(로지스틱회귀부터 트랜스포머 챗봇까지)
Stars: ✭ 176 (-22.81%)
Mutual labels:  similarity
Python String Similarity
A library implementing different string similarity and distance measures using Python.
Stars: ✭ 546 (+139.47%)
Mutual labels:  similarity
Ml Classify Text Js
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Stars: ✭ 38 (-83.33%)
Mutual labels:  similarity
Nlp Journey
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+465.79%)
Mutual labels:  similarity
goodreads-toolbox
9 tools for Goodreads.com, for finding people based on the books they’ve read, finding books popular among the people you follow, following new book reviews, etc
Stars: ✭ 56 (-75.44%)
Mutual labels:  similarity
Phash
pHash - the open source perceptual hash library
Stars: ✭ 208 (-8.77%)
Mutual labels:  similarity
Computervision Recipes
Best Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+3502.63%)
Mutual labels:  similarity
Text2vec
text2vec, chinese text to vetor.(文本向量化表示工具,包括词向量化、句子向量化、句子相似度计算)
Stars: ✭ 155 (-32.02%)
Mutual labels:  similarity
Dssim
Image similarity comparison simulating human perception (multiscale SSIM in Rust)
Stars: ✭ 668 (+192.98%)
Mutual labels:  similarity
Node Damerau Levenshtein
Damerau - Levenstein distance function for node
Stars: ✭ 27 (-88.16%)
Mutual labels:  similarity
Dists
IQA: Deep Image Structure and Texture Similarity Metric
Stars: ✭ 101 (-55.7%)
Mutual labels:  similarity
Recordlinkage
A toolkit for record linkage and duplicate detection in Python
Stars: ✭ 532 (+133.33%)
Mutual labels:  similarity
Synt
Find similar functions and classes in your JavaScript/TypeScript code
Stars: ✭ 178 (-21.93%)
Mutual labels:  similarity
Macropodus
自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NER(name entity recognition),Find(new words discovery),Keyword(keyword extraction),Summarize(text summarization),Sim(text similarity),Calculate(scientific calculator),Chi2num(chinese number to arabic number)
Stars: ✭ 309 (+35.53%)
Mutual labels:  similarity
Rltk
Record Linkage ToolKit (Find and link entities)
Stars: ✭ 71 (-68.86%)
Mutual labels:  similarity
Sensegram
Making sense embedding out of word embeddings using graph-based word sense induction
Stars: ✭ 209 (-8.33%)
Mutual labels:  similarity
Customer Chatbot
中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)
Stars: ✭ 198 (-13.16%)
Mutual labels:  similarity
Html Similarity
Compare html similarity using structural and style metrics
Stars: ✭ 152 (-33.33%)
Mutual labels:  similarity

WordSimilarity

这是一个基于哈工大同义词词林扩展版的单词相似度计算方法的python实现,参考论文如下:

2010 田久乐等,吉林大学学报(信息科学版),基于同义词词林的词语相似度计算方法。

安装

pip install WordSimilarity

使用

from word_similarity import WordSimilarity2010

ws_tool = WordSimilarity2010()

b_a = "抄袭"
b_b = "克隆"
sim_b = ws_tool.similarity(b_a, b_b)
print(b_a, b_b, '相似度为', sim_b)
#抄袭 克隆 最终的相似度为 0.585642777645155

w_a = '人民'
sample_list = ["国民", "群众", "党群", "良民", "同志", "成年人", "市民", "亲属", "志愿者", "先锋" ]

for s_a in sample_list:
    sim_a = ws_tool.similarity(w_a,s_a)
    print(w_a, s_a, '相似度为', sim_a)
# 人民 国民 相似度为 1
# 人民 群众 相似度为 0.9576614882494312
# 人民 党群 相似度为 0.8978076452338418
# 人民 良民 相似度为 0.7182461161870735
# 人民 同志 相似度为 0.6630145969121822
# 人民 成年人 相似度为 0.6306922220793977
# 人民 市民 相似度为 0.5405933332109123
# 人民 亲属 相似度为 0.36039555547394153
# 人民 志愿者 相似度为 0.22524722217121346
# 人民 先锋 相似度为 0.18019777773697077
        

同类项目

致谢:

本代码的实现要感谢下面几位作者:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].