This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Reading Comprehension on Novels"

Stars: ✭ 20 (-69.7%)

Mutual labels: reading-comprehension

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Stars: ✭ 2,317 (+3410.61%)

Mutual labels: bert

datagrand bert

2019达观杯信息提取第5名代码

Stars: ✭ 20 (-69.7%)

Mutual labels: bert

co-attention

Pytorch implementation of "Dynamic Coattention Networks For Question Answering"

Stars: ✭ 54 (-18.18%)

Mutual labels: reading-comprehension

DiscEval

Discourse Based Evaluation of Language Understanding

Stars: ✭ 18 (-72.73%)

Mutual labels: bert

PersianQA

Persian (Farsi) Question Answering Dataset (+ Models)

Stars: ✭ 114 (+72.73%)

Mutual labels: reading-comprehension

beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Stars: ✭ 738 (+1018.18%)

Mutual labels: bert

extractive rc by runtime mt

Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"

Stars: ✭ 36 (-45.45%)

Mutual labels: reading-comprehension

Romanian-Transformers

This repo is the home of Romanian Transformers.

Stars: ✭ 60 (-9.09%)

Mutual labels: bert

Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+84357.58%)

Mutual labels: bert

BERTOverflow

A Pre-trained BERT on StackOverflow Corpus

Stars: ✭ 40 (-39.39%)

Mutual labels: bert

Pytorch Sentiment Analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Stars: ✭ 3,209 (+4762.12%)

Mutual labels: bert

bert for corrector

基于bert进行中文文本纠错

Stars: ✭ 199 (+201.52%)

Mutual labels: bert

Texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Stars: ✭ 2,236 (+3287.88%)

Mutual labels: bert

LAMB Optimizer TF

LAMB Optimizer for Large Batch Training (TensorFlow version)

Stars: ✭ 119 (+80.3%)

Mutual labels: bert

Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Stars: ✭ 2,235 (+3286.36%)

Mutual labels: bert

mixed-language-training

Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems (AAAI-2020)

Stars: ✭ 29 (-56.06%)

Mutual labels: cross-lingual

embedding study

中文预训练模型生成字向量学习，测试BERT，ELMO的中文效果

Stars: ✭ 94 (+42.42%)

Mutual labels: bert

Clue

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Stars: ✭ 2,425 (+3574.24%)

Mutual labels: bert

Transformer-QG-on-SQuAD

Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)

Stars: ✭ 28 (-57.58%)

Mutual labels: bert

Awesome Bert

bert nlp papers, applications and github resources, including the newst xlnet ， BERT、XLNet 相关论文和 github 项目

Stars: ✭ 1,732 (+2524.24%)

Mutual labels: bert

iamQA

中文wiki百科QA阅读理解问答系统，使用了CCKS2016数据的NER模型和CMRC2018的阅读理解模型，还有W2V词向量搜索,使用torchserve部署

Stars: ✭ 46 (-30.3%)

Mutual labels: bert

Text Classification TF

用tf实现各种文本分类模型，并且封装restful接口，可以直接工程化

Stars: ✭ 32 (-51.52%)

Mutual labels: bert

Fast Bert

Super easy library for BERT based NLP models

Stars: ✭ 1,678 (+2442.42%)

Mutual labels: bert

banglabert

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…

Stars: ✭ 186 (+181.82%)

Mutual labels: bert

Chineseglue

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

Stars: ✭ 1,548 (+2245.45%)

Mutual labels: bert

NLPDataAugmentation

Chinese NLP Data Augmentation， BERT Contextual Augmentation

Stars: ✭ 94 (+42.42%)

Mutual labels: bert

Nlp Tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

Stars: ✭ 9,895 (+14892.42%)

Mutual labels: bert

BiaffineDependencyParsing

BERT+Self-attention Encoder ; Biaffine Decoder ; Pytorch Implement

Stars: ✭ 67 (+1.52%)

Mutual labels: bert

Chinese Bert Wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Stars: ✭ 6,357 (+9531.82%)

Mutual labels: bert

neuro-comma

🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺

Stars: ✭ 46 (-30.3%)

Mutual labels: bert

Bert Pytorch

Google AI 2018 BERT pytorch implementation

Stars: ✭ 4,642 (+6933.33%)

Mutual labels: bert

sticker2

Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot

Stars: ✭ 14 (-78.79%)

Mutual labels: bert

Bert Bilstm Crf Ner

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Stars: ✭ 3,838 (+5715.15%)

Mutual labels: bert

wisdomify

A BERT-based reverse dictionary of Korean proverbs

Stars: ✭ 95 (+43.94%)

Mutual labels: bert

BERT-QE

Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".

Stars: ✭ 43 (-34.85%)

Mutual labels: bert

[EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Qian Chen, Shuohang Wang, Bo Li

Stars: ✭ 25 (-62.12%)

Mutual labels: bert

OpenUE

OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)

Stars: ✭ 274 (+315.15%)

Mutual labels: bert

PIE

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Sequence Transduction": www.aclweb.org/anthology/D19-1435.pdf (EMNLP-IJCNLP 2019)

Stars: ✭ 164 (+148.48%)

Mutual labels: bert

BertSimilarity

Computing similarity of two sentences with google's BERT algorithm。利用Bert计算句子相似度。语义相似度计算。文本相似度计算。

Stars: ✭ 348 (+427.27%)

Mutual labels: bert

gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Stars: ✭ 216 (+227.27%)

Mutual labels: bert

ChineseNER

中文NER的那些事儿

Stars: ✭ 241 (+265.15%)

Mutual labels: bert

NAG-BERT

[EACL'21] Non-Autoregressive with Pretrained Language Model

Stars: ✭ 47 (-28.79%)

Mutual labels: bert

wechsel

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.