All Projects → rgtjf → Semantic Texual Similarity Toolkits

rgtjf / Semantic Texual Similarity Toolkits

Licence: mit
Semantic Textual Similarity (STS) measures the degree of equivalence in the underlying semantics of paired snippets of text.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Semantic Texual Similarity Toolkits

Nested Ner Tacl2020 Transformers
Implementation of Nested Named Entity Recognition using BERT
Stars: ✭ 76 (-12.64%)
Mutual labels:  natural-language-processing
Deepmoji
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
Stars: ✭ 1,215 (+1296.55%)
Mutual labels:  natural-language-processing
Greek Bert
A Greek edition of BERT pre-trained language model
Stars: ✭ 84 (-3.45%)
Mutual labels:  natural-language-processing
Dialogue Understanding
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Stars: ✭ 77 (-11.49%)
Mutual labels:  natural-language-processing
Text Dependency Parser
🏄 依存关系分析,NLP,自然语言处理
Stars: ✭ 78 (-10.34%)
Mutual labels:  natural-language-processing
Opennmt Tf
Neural machine translation and sequence learning using TensorFlow
Stars: ✭ 1,223 (+1305.75%)
Mutual labels:  natural-language-processing
Rutermextract
Term extraction for Russian language
Stars: ✭ 75 (-13.79%)
Mutual labels:  natural-language-processing
Turkish Bert Nlp Pipeline
Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.
Stars: ✭ 85 (-2.3%)
Mutual labels:  natural-language-processing
Practical 3
Oxford Deep NLP 2017 course - Practical 3: Text Classification with RNNs
Stars: ✭ 78 (-10.34%)
Mutual labels:  natural-language-processing
Simplednn
SimpleDNN is a machine learning lightweight open-source library written in Kotlin designed to support relevant neural network architectures in natural language processing tasks
Stars: ✭ 81 (-6.9%)
Mutual labels:  natural-language-processing
Abigsurvey
A collection of 500+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML)
Stars: ✭ 1,203 (+1282.76%)
Mutual labels:  natural-language-processing
Chinese Xlnet
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
Stars: ✭ 1,213 (+1294.25%)
Mutual labels:  natural-language-processing
Typenovel
A simple markup language to write novel with types.
Stars: ✭ 80 (-8.05%)
Mutual labels:  natural-language-processing
Monkeylearn Ruby
Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.
Stars: ✭ 76 (-12.64%)
Mutual labels:  natural-language-processing
Scanrefer
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Stars: ✭ 84 (-3.45%)
Mutual labels:  natural-language-processing
Awesome Bert Japanese
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
Stars: ✭ 76 (-12.64%)
Mutual labels:  natural-language-processing
Ja.text8
Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-9.2%)
Mutual labels:  natural-language-processing
Ml
A high-level machine learning and deep learning library for the PHP language.
Stars: ✭ 1,270 (+1359.77%)
Mutual labels:  natural-language-processing
Practical Open
Oxford Deep NLP 2017 course - Open practical
Stars: ✭ 84 (-3.45%)
Mutual labels:  natural-language-processing
Spacy Graphql
🤹‍♀️ Query spaCy's linguistic annotations using GraphQL
Stars: ✭ 81 (-6.9%)
Mutual labels:  natural-language-processing

Semantic Textual Similarity Toolkits

Gitter

This is the code by ECNU team submitted to SemEval STS Task.

Installation

# download the repo
git clone https://github.com/rgtjf/Semantic-Texual-Similarity-Toolkits.git
# download the dataset and stanford CoreNLP tools
sh download.sh
# run the demo
python demo.py

Results

you can configure sts_model.py to see the performance of different features on STSBenchmark dataset.

STSBenchmark

Methods Dev Test
RF 0.8333 0.7993
GB 0.8356 0.8022
EN-seven 0.8466 0.8100
---------------------- -------- --------
aligner 0.6991 0.6379
idf_aligner 0.7969 0.7622
BOWFeature-True 0.7584 0.6472
BOWFeature-False 0.7788 0.6874
nGramOverlapFeature 0.7817 0.7453
BOWFeature 0.7639 0.6847
AlignmentFeature 0.8163 0.7748
WordEmbeddingFeature 0.8011 0.7128

Reference

STSBenchmark board

Contacts

Any questions, please feel free to contact us: rgtjf1 AT 163 DOT com

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].