Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → stevezheng23 → bert_extension_tf

stevezheng23 / bert_extension_tf

Licence: Apache-2.0 license

BERT Extension in TensorFlow

Programming Languages

139335 projects - #7 most used programming language

77523 projects

Labels

machine-learning natural-language-processing deep-learning artificial-intelligence bert natural-language-understanding bert-ner bert-nlu

Projects that are alternatives of or similar to bert extension tf

label-studio-transformers

Label data using HuggingFace's transformers and automatically get a prediction service

Stars: ✭ 117 (+303.45%)

Mutual labels: bert, natural-language-understanding

Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT

Stars: ✭ 15 (-48.28%)

Mutual labels: bert, natural-language-understanding

Discourse Based Evaluation of Language Understanding

Stars: ✭ 18 (-37.93%)

Mutual labels: bert, natural-language-understanding

classy is a simple-to-use library for building high-performance Machine Learning models in NLP.

Stars: ✭ 61 (+110.34%)

Mutual labels: bert, natural-language-understanding

[ACL-WS] 4th place solution to gendered pronoun resolution challenge on Kaggle

Stars: ✭ 13 (-55.17%)

Mutual labels: bert, natural-language-understanding

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+192113.79%)

Mutual labels: bert, natural-language-understanding

[ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.

Stars: ✭ 62 (+113.79%)

Mutual labels: bert, natural-language-understanding

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Stars: ✭ 5,077 (+17406.9%)

Mutual labels: bert, natural-language-understanding

Bert As Service

Mapping a variable-length sentence to a fixed-length vector using BERT model

Stars: ✭ 9,779 (+33620.69%)

Mutual labels: bert, natural-language-understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

Stars: ✭ 1,871 (+6351.72%)

Mutual labels: bert, natural-language-understanding

GLUE-bert4keras

基于bert4keras的GLUE基准代码

Stars: ✭ 59 (+103.45%)

Mutual labels: bert, natural-language-understanding

embedding study

中文预训练模型生成字向量学习，测试BERT，ELMO的中文效果

Stars: ✭ 94 (+224.14%)

Mutual labels: bert

pytorch implementation of the TwinBert paper

Stars: ✭ 36 (+24.14%)

Mutual labels: bert

A BERT-based reverse dictionary of Korean proverbs

Stars: ✭ 95 (+227.59%)

Mutual labels: bert

MHPC-Natural-Language-Processing-Lectures

This is the second part of the Deep Learning Course for the Master in High-Performance Computing (SISSA/ICTP).)

Stars: ✭ 33 (+13.79%)

Mutual labels: natural-language-understanding

Visualize BERT's self-attention layers on text classification tasks

Stars: ✭ 41 (+41.38%)

Mutual labels: bert

ERNIE-text-classification-pytorch

This repo contains a PyTorch implementation of a pretrained ERNIE model for text classification.

Stars: ✭ 49 (+68.97%)

Mutual labels: bert

A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)

Stars: ✭ 118 (+306.9%)

Mutual labels: bert

OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)

Stars: ✭ 274 (+844.83%)

Mutual labels: bert

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.

Stars: ✭ 252 (+768.97%)

Mutual labels: bert

View All Similar Projects ➔

BERT Extension

BERT (Bidirectional Encoder Representations from Transformers) is a generalized autoencoding pretraining method proposed by Google AI Language team, which obtains new state-of-the-art results on 11 NLP tasks ranging from question answering, natural, language inference and sentiment analysis. BERT is designed to pretrain deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which allows it to be easily finetuned for downstream tasks without substantial task-specific architecture modifications. This project is aiming to provide extensions built on top of current BERT and bring power of BERT to other NLP tasks like NER and NLU.

Figure 1: Illustrations of fine-tuning BERT on different tasks

Setting

Python 3.6.7
Tensorflow 1.13.1
NumPy 1.13.3

DataSet

CoNLL2003 is a multi-task dataset, which contains 3 sub-tasks, POS tagging, syntactic chunking and NER. For NER sub-task, it contains 4 types of named entities: persons, locations, organizations and names of miscellaneous entities that do not belong to the previous three groups.
ATIS (Airline Travel Information System) is NLU dataset in airline travel domain. The dataset contains 4978 train and 893 test utterances classified into one of 26 intents, and each token in utterance is labeled with tags from 128 slot filling tags in IOB format.

Usage

Preprocess data

python prepro/prepro_conll.py \
  --data_format json \
  --input_file data/ner/conll2003/raw/eng.xxx \
  --output_file data/ner/conll2003/xxx-conll2003/xxx-conll2003.json

Run experiment

CUDA_VISIBLE_DEVICES=0 python run_ner.py \
    --task_name=conll2003 \
    --do_train=true \
    --do_eval=true \
    --do_predict=true \
    --do_export=true \
    --data_dir=data/ner/conll2003 \
    --vocab_file=model/cased_L-12_H-768_A-12/vocab.txt \
    --bert_config_file=model/cased_L-12_H-768_A-12/bert_config.json \
    --init_checkpoint=model/cased_L-12_H-768_A-12/bert_model.ckpt \
    --max_seq_length=128 \
    --train_batch_size=32 \
    --eval_batch_size=8 \
    --predict_batch_size=8 \
    --learning_rate=2e-5 \
    --num_train_epochs=5.0 \
    --output_dir=output/ner/conll2003/debug
    --export_dir=output/ner/conll2003/export

Visualize summary

tensorboard --logdir=output/ner/conll2003

Setup service

docker run -p 8500:8500 \
  -v output/ner/conll2003/export/xxxxx:models/ner \
  -e MODEL_NAME=ner \
  -t tensorflow/serving

Experiment

CoNLL2003-NER

Figure 2: Illustrations of fine-tuning BERT on NER task

CoNLL2003 - NER	Avg. (5-run)	Best
Precision	91.37 ± 0.33	91.87
Recall	92.37 ± 0.25	92.68
F1 Score	91.87 ± 0.28	92.27

Table 1: The test set performance of BERT-large finetuned model on CoNLL2003-NER task with setting: batch size = 16, max length = 128, learning rate = 2e-5, num epoch = 5.0

ATIS-NLU

Figure 3: Illustrations of fine-tuning BERT on NLU task

ATIS - NLU	Avg. (5-run)	Best
Accuracy - Intent	97.38 ± 0.19	97.65
F1 Score - Slot	95.61 ± 0.09	95.53

Table 2: The test set performance of BERT-large finetuned model on ATIS-NLU task with setting: batch size = 16, max length = 128, learning rate = 2e-5, num epoch = 5.0

Reference

Jacob Devlin, Ming-Wei Chang, Kenton Lee and Kristina Toutanova. BERT: Pre-training of deep bidirectional transformers for language understanding [2018]
Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matthew Gardner, Christopher T Clark, Kenton Lee, and Luke S. Zettlemoyer. Deep contextualized word representations [2018]
Alec Radford, Karthik Narasimhan, Tim Salimans and Ilya Sutskever. Improving language understanding by generative pre-training [2018]
Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei and Ilya Sutskever. Language models are unsupervised multitask learners [2019]
Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov and Quoc V. Le. XLNet: Generalized autoregressive pretraining for language understanding [2019]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 29

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗