Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Stars: ✭ 216 (+440%)

Mutual labels: bert

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Stars: ✭ 2,828 (+6970%)

Mutual labels: bert

bert-sentiment

Fine-grained Sentiment Classification Using BERT

Stars: ✭ 49 (+22.5%)

Mutual labels: bert

question generator

An NLP system for generating reading comprehension questions

Stars: ✭ 188 (+370%)

Mutual labels: bert

wrench

WRENCH: Weak supeRvision bENCHmark

Stars: ✭ 185 (+362.5%)

Mutual labels: sequence-labeling

CrossNER

CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)

Stars: ✭ 87 (+117.5%)

Mutual labels: sequence-labeling

pn-summary

A well-structured summarization dataset for the Persian language!

Stars: ✭ 29 (-27.5%)

Mutual labels: bert

neural-ranking-kd

Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation

Stars: ✭ 74 (+85%)

Mutual labels: bert

DrFAQ

DrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.

Stars: ✭ 29 (-27.5%)

Mutual labels: bert

NLP-paper

🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/

Stars: ✭ 23 (-42.5%)

Mutual labels: bert

Bert-model-code-interpretation

解读tensorflow版本Bert中modeling.py数据流

Stars: ✭ 19 (-52.5%)

Mutual labels: bert

CheXbert

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

Stars: ✭ 51 (+27.5%)

Mutual labels: bert

AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Stars: ✭ 28 (-30%)

Mutual labels: bert

KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

Stars: ✭ 58 (+45%)

Mutual labels: bert

GLUE-bert4keras

基于bert4keras的GLUE基准代码

Stars: ✭ 59 (+47.5%)

Mutual labels: bert

TriB-QA

吹逼我们是认真的

Stars: ✭ 45 (+12.5%)

Mutual labels: bert

protonet-bert-text-classification

finetune bert for small dataset text classification in a few-shot learning manner using ProtoNet

Stars: ✭ 28 (-30%)

Mutual labels: bert

consistency

Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models

Stars: ✭ 26 (-35%)

Mutual labels: bert

Nlp Architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Stars: ✭ 2,768 (+6820%)

Mutual labels: bert

korpatbert

특허분야 특화된 한국어 AI언어모델 KorPatBERT

Stars: ✭ 48 (+20%)

Mutual labels: bert

Bertviz

Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)

Stars: ✭ 3,443 (+8507.5%)

Mutual labels: bert

Romanian-Transformers

This repo is the home of Romanian Transformers.

Stars: ✭ 60 (+50%)

Mutual labels: bert

TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Stars: ✭ 209 (+422.5%)

Mutual labels: bert

Albert zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

Stars: ✭ 3,500 (+8650%)

Mutual labels: bert

Pycorrector

pycorrector is a toolkit for text error correction. 文本纠错，Kenlm，Seq2Seq_Attention，BERT，MacBERT，ELECTRA，ERNIE，Transformer等模型实现，开箱即用。

Stars: ✭ 2,857 (+7042.5%)

Mutual labels: bert

NAG-BERT

[EACL'21] Non-Autoregressive with Pretrained Language Model

Stars: ✭ 47 (+17.5%)

Mutual labels: bert

Awesome Sentence Embedding

A curated list of pretrained sentence and word embedding models

Stars: ✭ 1,973 (+4832.5%)

Mutual labels: bert

R-AT

Regularized Adversarial Training

Stars: ✭ 19 (-52.5%)

Mutual labels: bert

Mt Dnn

Multi-Task Deep Neural Networks for Natural Language Understanding

Stars: ✭ 1,871 (+4577.5%)

Mutual labels: bert

BiLSTM-CRF-NER-PyTorch

This repo contains a PyTorch implementation of a BiLSTM-CRF model for named entity recognition task.

Stars: ✭ 109 (+172.5%)

Mutual labels: bilstm-crf

Roberta zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Stars: ✭ 1,953 (+4782.5%)

Mutual labels: bert

GEANet-BioMed-Event-Extraction

Code for the paper Biomedical Event Extraction with Hierarchical Knowledge Graphs

Stars: ✭ 52 (+30%)

Mutual labels: bert

Haystack

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (+8422.5%)

Mutual labels: bert

roberta-wwm-base-distill

this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large

Stars: ✭ 61 (+52.5%)

Mutual labels: bert

Bert As Service

Mapping a variable-length sentence to a fixed-length vector using BERT model

Stars: ✭ 9,779 (+24347.5%)

Mutual labels: bert

datagrand bert

2019达观杯信息提取第5名代码

Stars: ✭ 20 (-50%)

Mutual labels: bert

Nlp chinese corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Stars: ✭ 6,656 (+16540%)

Mutual labels: bert

tensorflow-ml-nlp-tf2

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료

Stars: ✭ 245 (+512.5%)

Mutual labels: bert

Tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Stars: ✭ 5,077 (+12592.5%)

Mutual labels: bert

bert attn viz

Visualize BERT's self-attention layers on text classification tasks

Stars: ✭ 41 (+2.5%)

Mutual labels: bert

Ernie

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Stars: ✭ 4,659 (+11547.5%)

Mutual labels: bert

CAIL

法研杯CAIL2019阅读理解赛题参赛模型

Stars: ✭ 34 (-15%)

Mutual labels: bert

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Stars: ✭ 1,479 (+3597.5%)

Mutual labels: bert

embedding study

中文预训练模型生成字向量学习，测试BERT，ELMO的中文效果