anonymisationAnonymization of legal cases (Fr) based on Flair embeddings
Stars: ✭ 85 (+325%)
keras-bert-nerKeras solution of Chinese NER task using BiLSTM-CRF/BiGRU-CRF/IDCNN-CRF model with Pretrained Language Model: supporting BERT/RoBERTa/ALBERT
Stars: ✭ 7 (-65%)
KashgariKashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Stars: ✭ 2,235 (+11075%)
neuro-comma🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺
Stars: ✭ 46 (+130%)
troveWeakly supervised medical named entity classification
Stars: ✭ 55 (+175%)
Bert Bilstm Crf NerTensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
Stars: ✭ 3,838 (+19090%)
DrFAQDrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.
Stars: ✭ 29 (+45%)
sequence taggingNamed Entity Recognition (LSTM + CRF + FastText) with models for [historic] German
Stars: ✭ 25 (+25%)
AiSpaceAiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0
Stars: ✭ 28 (+40%)
wisdomifyA BERT-based reverse dictionary of Korean proverbs
Stars: ✭ 95 (+375%)
OpenUEOpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)
Stars: ✭ 274 (+1270%)
Ner Bert PytorchPyTorch solution of named entity recognition task Using Google AI's pre-trained BERT model.
Stars: ✭ 249 (+1145%)
Pytorch ner bilstm cnn crfEnd-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF implement in pyotrch
Stars: ✭ 249 (+1145%)
LAMB Optimizer TFLAMB Optimizer for Large Batch Training (TensorFlow version)
Stars: ✭ 119 (+495%)
Malaya Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (+1095%)
Bert nerNer with Bert
Stars: ✭ 240 (+1100%)
WebstructNER toolkit for HTML data
Stars: ✭ 230 (+1050%)
FinBERT-QAFinancial Domain Question Answering with pre-trained BERT Language Model
Stars: ✭ 70 (+250%)
gplPowerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
Stars: ✭ 216 (+980%)
Ner DatasetsDatasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
Stars: ✭ 220 (+1000%)
npo classifierAutomated coding using machine-learning and remapping the U.S. nonprofit sector: A guide and benchmark
Stars: ✭ 18 (-10%)
neural name taggingCode for "Reliability-aware Dynamic Feature Composition for Name Tagging" (ACL2019)
Stars: ✭ 39 (+95%)
KoBERT-NERNER Task with KoBERT (with Naver NLP Challenge dataset)
Stars: ✭ 76 (+280%)
TwinBertpytorch implementation of the TwinBert paper
Stars: ✭ 36 (+80%)
ai explore机器学习、深度学习基础知识. 推荐系统及nlp相关算法实现
Stars: ✭ 54 (+170%)
Cool-NLPCVSome Cool NLP and CV Repositories and Solutions (收集NLP中常见任务的开源解决方案、数据集、工具、学习资料等)
Stars: ✭ 143 (+615%)
cmrc2019A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)
Stars: ✭ 118 (+490%)
Chinese Names Corpus中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+15165%)
Kaleido-BERT(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.
Stars: ✭ 252 (+1160%)
bert attn vizVisualize BERT's self-attention layers on text classification tasks
Stars: ✭ 41 (+105%)
Pytorch Bert Crf NerKoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity Recognition model for Korean)
Stars: ✭ 236 (+1080%)
Nlp Tools😋本项目旨在通过Tensorflow基于BiLSTM+CRF实现中文分词、词性标注、命名实体识别(NER)。
Stars: ✭ 225 (+1025%)
sisterSImple SenTence EmbeddeR
Stars: ✭ 66 (+230%)
py-lingualyticsA text analytics library with support for codemixed data
Stars: ✭ 36 (+80%)
Spacy LookupNamed Entity Recognition based on dictionaries
Stars: ✭ 212 (+960%)
MonpaMONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Stars: ✭ 203 (+915%)
DataturksML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.
Stars: ✭ 200 (+900%)
AliceMindALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Stars: ✭ 1,479 (+7295%)
NLPDataAugmentationChinese NLP Data Augmentation, BERT Contextual Augmentation
Stars: ✭ 94 (+370%)
bert-sentimentFine-grained Sentiment Classification Using BERT
Stars: ✭ 49 (+145%)
BertnerChineseNER based on BERT, with BiLSTM+CRF layer
Stars: ✭ 195 (+875%)
Marktool这是一款基于web的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持文本的迭代标注和实体的嵌套标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验和调整,提高了标注语料的准确率和可靠性。
Stars: ✭ 190 (+850%)
vietnamese-robertaA Robustly Optimized BERT Pretraining Approach for Vietnamese
Stars: ✭ 22 (+10%)
Persian Nerپیکره بزرگ شناسایی موجودیتهای نامدار فارسی برچسب خورده
Stars: ✭ 183 (+815%)
PhoNER COVID19COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Stars: ✭ 55 (+175%)
NEMONeural Modeling for Named Entities and Morphology (Hebrew NER)
Stars: ✭ 25 (+25%)
Bert Sklearna sklearn wrapper for Google's BERT model
Stars: ✭ 182 (+810%)