kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-63.33%)
CheXbertCombining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT
Stars: ✭ 51 (-43.33%)
AgentOCR一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.
Stars: ✭ 98 (+8.89%)
toxicityThe world's largest social media toxicity dataset.
Stars: ✭ 135 (+50%)
Transformer-QG-on-SQuADImplement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)
Stars: ✭ 28 (-68.89%)
bert attn vizVisualize BERT's self-attention layers on text classification tasks
Stars: ✭ 41 (-54.44%)
FasterTransformerTransformer related optimization, including BERT, GPT
Stars: ✭ 1,571 (+1645.56%)
korpatbert특허분야 특화된 한국어 AI언어모델 KorPatBERT
Stars: ✭ 48 (-46.67%)
NLPDataAugmentationChinese NLP Data Augmentation, BERT Contextual Augmentation
Stars: ✭ 94 (+4.44%)
TwinBertpytorch implementation of the TwinBert paper
Stars: ✭ 36 (-60%)
BertSimilarityComputing similarity of two sentences with google's BERT algorithm。利用Bert计算句子相似度。语义相似度计算。文本相似度计算。
Stars: ✭ 348 (+286.67%)
beirA Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Stars: ✭ 738 (+720%)
hugo-noticeA Hugo theme component to display nice notices
Stars: ✭ 138 (+53.33%)
LAMB Optimizer TFLAMB Optimizer for Large Batch Training (TensorFlow version)
Stars: ✭ 119 (+32.22%)
JointIDSFBERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)
Stars: ✭ 55 (-38.89%)
exams-qaA Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
Stars: ✭ 25 (-72.22%)
HatefulUsersTwitterCode for the paper "Characterizing and Detecting Hateful Users on Twitter"
Stars: ✭ 69 (-23.33%)
ganbertEnhancing the BERT training with Semi-supervised Generative Adversarial Networks
Stars: ✭ 205 (+127.78%)
monkMonk is an elegant and lightweight WordPress translation plugin to make your content reach the world.
Stars: ✭ 15 (-83.33%)
limaThe Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Stars: ✭ 75 (-16.67%)
cmrc2019A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)
Stars: ✭ 118 (+31.11%)
R-ATRegularized Adversarial Training
Stars: ✭ 19 (-78.89%)
oreilly-bert-nlpThis repository contains code for the O'Reilly Live Online Training for BERT
Stars: ✭ 19 (-78.89%)
integreat-cmsSimplified content management back end for the Integreat App - a multilingual information platform for newcomers
Stars: ✭ 46 (-48.89%)
question generatorAn NLP system for generating reading comprehension questions
Stars: ✭ 188 (+108.89%)
i18n-language.jsi18n-language.js is Simple i18n language with Vanilla Javascript
Stars: ✭ 21 (-76.67%)
neural-ranking-kdImproving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
Stars: ✭ 74 (-17.78%)
sketch-crowdinConnect your Sketch and Crowdin projects together
Stars: ✭ 35 (-61.11%)
CAIL法研杯CAIL2019阅读理解赛题参赛模型
Stars: ✭ 34 (-62.22%)
BERTOverflowA Pre-trained BERT on StackOverflow Corpus
Stars: ✭ 40 (-55.56%)
AliceMindALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Stars: ✭ 1,479 (+1543.33%)
KitanaQAKitanaQA: Adversarial training and data augmentation for neural question-answering models
Stars: ✭ 58 (-35.56%)
BERT-QECode and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".
Stars: ✭ 43 (-52.22%)
sisterSImple SenTence EmbeddeR
Stars: ✭ 66 (-26.67%)
backpropBackprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Stars: ✭ 229 (+154.44%)
Fill-the-GAP[ACL-WS] 4th place solution to gendered pronoun resolution challenge on Kaggle
Stars: ✭ 13 (-85.56%)
neuro-comma🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺
Stars: ✭ 46 (-48.89%)
TraduXioA participative platform for cultural texts translators
Stars: ✭ 19 (-78.89%)
wisdomifyA BERT-based reverse dictionary of Korean proverbs
Stars: ✭ 95 (+5.56%)
mok-projectMultilingual Onscreen Keyboard Project
Stars: ✭ 27 (-70%)
Sohu20192019搜狐校园算法大赛
Stars: ✭ 26 (-71.11%)
OpenUEOpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)
Stars: ✭ 274 (+204.44%)
TabFormerCode & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)
Stars: ✭ 209 (+132.22%)
SA-BERTCIKM 2020: Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots
Stars: ✭ 71 (-21.11%)
banglabertThis repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…
Stars: ✭ 186 (+106.67%)
DiscEvalDiscourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-80%)