Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation)，etc. All codes are implemented intensorflow 2.0.

Stars: ✭ 1,290 (+280.53%)

Mutual labels: classification, attention, ner

Skin Cancer Image Classification

Skin cancer classification using Inceptionv3

Stars: ✭ 16 (-95.28%)

Mutual labels: jupyter-notebook, classification, transfer-learning

Machine Learning

My Attempt(s) In The World Of ML/DL....

Stars: ✭ 78 (-76.99%)

Mutual labels: jupyter-notebook, classification, attention

Keras transfer cifar10

Object classification with CIFAR-10 using transfer learning

Stars: ✭ 120 (-64.6%)

Mutual labels: jupyter-notebook, classification, transfer-learning

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+987.02%)

Mutual labels: jupyter-notebook, nmt

Jddc solution 4th

2018-JDDC大赛第4名的解决方案

Stars: ✭ 235 (-30.68%)

Mutual labels: jupyter-notebook, attention

Malaya

Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/

Stars: ✭ 239 (-29.5%)

Mutual labels: jupyter-notebook, ner

MoeFlow

Repository for anime characters recognition website, powered by TensorFlow

Stars: ✭ 113 (-66.67%)

Mutual labels: classification, transfer-learning

Deeppicar

Deep Learning Autonomous Car based on Raspberry Pi, SunFounder PiCar-V Kit, TensorFlow, and Google's EdgeTPU Co-Processor

Stars: ✭ 242 (-28.61%)

Mutual labels: jupyter-notebook, transfer-learning

verseagility

Ramp up your custom natural language processing (NLP) task, allowing you to bring your own data, use your preferred frameworks and bring models into production.

Stars: ✭ 23 (-93.22%)

Mutual labels: classification, ner

Timeseries fastai

fastai V2 implementation of Timeseries classification papers.

Stars: ✭ 221 (-34.81%)

Mutual labels: jupyter-notebook, classification

Doc Han Att

Hierarchical Attention Networks for Chinese Sentiment Classification

Stars: ✭ 206 (-39.23%)

Mutual labels: jupyter-notebook, attention

Pytorch Bert Crf Ner

KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity Recognition model for Korean)

Stars: ✭ 236 (-30.38%)

Mutual labels: jupyter-notebook, ner

Graph attention pool

Attention over nodes in Graph Neural Networks using PyTorch (NeurIPS 2019)

Stars: ✭ 186 (-45.13%)

Mutual labels: jupyter-notebook, attention

Pytorch Seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Stars: ✭ 3,418 (+908.26%)

Mutual labels: jupyter-notebook, attention

neuralBlack

A Multi-Class Brain Tumor Classifier using Convolutional Neural Network with 99% Accuracy achieved by applying the method of Transfer Learning using Python and Pytorch Deep Learning Framework

Stars: ✭ 36 (-89.38%)

Mutual labels: classification, transfer-learning

NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (-94.4%)

Mutual labels: attention, transfer-learning

View All Similar Projects ➔

0. Papers

There are two solutions based on this architecture.

BSNLP 2019 ACL workshop: solution and paper on multilingual shared task.
The second place solution of Dialogue AGRR-2019 task and paper.

Description

This repository contains solution of NER task based on PyTorch reimplementation of Google's TensorFlow repository for the BERT model that was released together with the paper BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding by Jacob Devlin, Ming-Wei Chang, Kenton Lee and Kristina Toutanova.

This implementation can load any pre-trained TensorFlow checkpoint for BERT (in particular Google's pre-trained models).

Old version is in "old" branch.

2. Usage

2.1 Create data

from modules.data import bert_data
data = bert_data.LearnData.create(
    train_df_path=train_df_path,
    valid_df_path=valid_df_path,
    idx2labels_path="/path/to/vocab",
    clear_cache=True
)

2.2 Create model

from modules.models.bert_models import BERTBiLSTMAttnCRF
model = BERTBiLSTMAttnCRF.create(len(data.train_ds.idx2label))

2.3 Create Learner

from modules.train.train import NerLearner
num_epochs = 100
learner = NerLearner(
    model, data, "/path/for/save/best/model", t_total=num_epochs * len(data.train_dl))

2.4 Predict

from modules.data.bert_data import get_data_loader_for_predict
learner.load_model()
dl = get_data_loader_for_predict(data, df_path="/path/to/df/for/predict")
preds = learner.predict(dl)

2.5 Evaluate

from sklearn_crfsuite.metrics import flat_classification_report
from modules.analyze_utils.utils import bert_labels2tokens, voting_choicer
from modules.analyze_utils.plot_metrics import get_bert_span_report
from modules.analyze_utils.main_metrics import precision_recall_f1


pred_tokens, pred_labels = bert_labels2tokens(dl, preds)
true_tokens, true_labels = bert_labels2tokens(dl, [x.bert_labels for x in dl.dataset])
tokens_report = flat_classification_report(true_labels, pred_labels, digits=4)
print(tokens_report)

results = precision_recall_f1(true_labels, pred_labels)

3. Results

We didn't search best parametres and obtained the following results.

Model	Data set	Dev F1 tok	Dev F1 span	Test F1 tok	Test F1 span
OURS
M-BERTCRF-IO	FactRuEval	-	-	0.8543	0.8409
M-BERTNCRF-IO	FactRuEval	-	-	0.8637	0.8516
M-BERTBiLSTMCRF-IO	FactRuEval	-	-	0.8835	0.8718
M-BERTBiLSTMNCRF-IO	FactRuEval	-	-	0.8632	0.8510
M-BERTAttnCRF-IO	FactRuEval	-	-	0.8503	0.8346
M-BERTBiLSTMAttnCRF-IO	FactRuEval	-	-	0.8839	0.8716
M-BERTBiLSTMAttnNCRF-IO	FactRuEval	-	-	0.8807	0.8680
M-BERTBiLSTMAttnCRF-fit_BERT-IO	FactRuEval	-	-	0.8823	0.8709
M-BERTBiLSTMAttnNCRF-fit_BERT-IO	FactRuEval	-	-	0.8583	0.8456
-	-	-	-	-	-
BERTBiLSTMCRF-IO	CoNLL-2003	0.9629	-	0.9221	-
B-BERTBiLSTMCRF-IO	CoNLL-2003	0.9635	-	0.9229	-
B-BERTBiLSTMAttnCRF-IO	CoNLL-2003	0.9614	-	0.9237	-
B-BERTBiLSTMAttnNCRF-IO	CoNLL-2003	0.9631	-	0.9249	-
Current SOTA
DeepPavlov-RuBERT-NER	FactRuEval	-	-	-	0.8266
CSE	CoNLL-2003	-	-	0.931	-
BERT-LARGE	CoNLL-2003	0.966	-	0.928	-
BERT-BASE	CoNLL-2003	0.964	-	0.924	-

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 339

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗