This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…

Stars: ✭ 186 (+481.25%)

Mutual labels: bert, xlm-roberta

AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Stars: ✭ 28 (-12.5%)

Mutual labels: pretrained-models, bert

roberta-wwm-base-distill

this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large

Stars: ✭ 61 (+90.63%)

Mutual labels: pretrained-models, bert

Pytorch-NLU

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (+371.88%)

Mutual labels: pretrained-models, bert

TorchBlocks

A PyTorch-based toolkit for natural language processing

Stars: ✭ 85 (+165.63%)

Mutual labels: bert

Morphos-Blade

Morphos adapter for Blade

Stars: ✭ 32 (+0%)

Mutual labels: morphology

textgo

Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!

Stars: ✭ 33 (+3.13%)

Mutual labels: bert

LightLM

高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task

Stars: ✭ 54 (+68.75%)

Mutual labels: bert

mcQA

🔮 Answering multiple choice questions with Language Models.

Stars: ✭ 23 (-28.12%)

Mutual labels: bert

classy

classy is a simple-to-use library for building high-performance Machine Learning models in NLP.

Stars: ✭ 61 (+90.63%)

Mutual labels: bert

Medi-CoQA

Conversational Question Answering on Clinical Text

Stars: ✭ 22 (-31.25%)

Mutual labels: bert

simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

Stars: ✭ 32 (+0%)

Mutual labels: lemmatization

text2text

Text2Text: Cross-lingual natural language processing and generation toolkit

Stars: ✭ 188 (+487.5%)

Mutual labels: bert

SemEval2019Task3

Code for ANA at SemEval-2019 Task 3

Stars: ✭ 41 (+28.13%)

Mutual labels: bert

bert experimental

code and supplementary materials for a series of Medium articles about the BERT model

Stars: ✭ 72 (+125%)

Mutual labels: bert

udar

UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.

Stars: ✭ 15 (-53.12%)

Mutual labels: lemmatization

rl-trained-agents

A collection of pre-trained RL agents using Stable Baselines3

Stars: ✭ 47 (+46.88%)

Mutual labels: pretrained-models

finetuner

Finetuning any DNN for better embedding on neural search tasks

Stars: ✭ 442 (+1281.25%)

Mutual labels: pretrained-models

DeepMorphy

Морфологический анализатор для русского языка на C# для .NET

Stars: ✭ 23 (-28.12%)

Mutual labels: morphology

OpenDialog

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (+193.75%)

Mutual labels: bert

ALBERT-Pytorch

Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)

Stars: ✭ 214 (+568.75%)

Mutual labels: bert

semantic-document-relations

Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"

Stars: ✭ 21 (-34.37%)

Mutual labels: bert

bert quora question pairs

BERT Model Fine-tuning on Quora Questions Pairs

Stars: ✭ 28 (-12.5%)

Mutual labels: bert

TextPair

文本对关系比较 - 语义相似度、字面相似度、文本蕴含等等

Stars: ✭ 44 (+37.5%)

Mutual labels: bert

pptod

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)

Stars: ✭ 77 (+140.63%)

Mutual labels: pretrained-models

cdQA-ui

⛔ [NOT MAINTAINED] A web interface for cdQA and other question answering systems.

Stars: ✭ 19 (-40.62%)

Mutual labels: bert

NER-FunTool

本NER项目包含多个中文数据集，模型采用BiLSTM+CRF、BERT+Softmax、BERT+Cascade、BERT+WOL等，最后用TFServing进行模型部署，线上推理和线下推理。

Stars: ✭ 56 (+75%)

Mutual labels: bert

robo-vln

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Stars: ✭ 34 (+6.25%)

Mutual labels: bert

bert tokenization for java

This is a java version of Chinese tokenization descried in BERT.

Stars: ✭ 39 (+21.88%)

Mutual labels: bert

KAREN

KAREN: Unifying Hatespeech Detection and Benchmarking

Stars: ✭ 18 (-43.75%)

Mutual labels: bert

ObjectNet

PyTorch implementation of "Pyramid Scene Parsing Network".

Stars: ✭ 15 (-53.12%)

Mutual labels: pretrained-models

bangla-bert

Bangla-Bert is a pretrained bert model for Bengali language

Stars: ✭ 41 (+28.13%)

Mutual labels: bert

KLUE

📖 Korean NLU Benchmark

Stars: ✭ 420 (+1212.5%)

Mutual labels: bert

retinal-exudates-detection

exudates detection using hybrid approach (Image Morphology & Machine Learning)

Stars: ✭ 53 (+65.63%)

Mutual labels: morphology

bern

A neural named entity recognition and multi-type normalization tool for biomedical text mining

Stars: ✭ 151 (+371.88%)

Mutual labels: bert

KWDLC

Kyoto University Web Document Leads Corpus

Stars: ✭ 64 (+100%)

Mutual labels: dependency-parsing

text2class

Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT

Stars: ✭ 15 (-53.12%)

Mutual labels: bert

WSDM-Cup-2019

[ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.

Stars: ✭ 62 (+93.75%)

Mutual labels: bert

SQUAD2.Q-Augmented-Dataset

Augmented version of SQUAD 2.0 for Questions

Stars: ✭ 31 (-3.12%)

Mutual labels: bert

OpenGNT

Open Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources

Stars: ✭ 55 (+71.88%)

Mutual labels: morphology

Quality-Estimation2

机器翻译子任务-翻译质量评价-在BERT模型后面加上Bi-LSTM进行fine-tuning

Stars: ✭ 31 (-3.12%)

Mutual labels: bert

BERT-Chinese-Couplet

BERT for Chinese Couplet | BERT用于自动对对联

Stars: ✭ 19 (-40.62%)

Mutual labels: bert

golgotha

Contextualised Embeddings and Language Modelling using BERT and Friends using R

Stars: ✭ 39 (+21.88%)

Mutual labels: bert

textstem

Tools for fast text stemming & lemmatization

Stars: ✭ 36 (+12.5%)

Mutual labels: lemmatization

modular-assemblies

[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"

Stars: ✭ 98 (+206.25%)

Mutual labels: morphology

CPPE-Dataset

Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset

Stars: ✭ 42 (+31.25%)

Mutual labels: pretrained-models

are-16-heads-really-better-than-1

Code for the paper "Are Sixteen Heads Really Better than One?"

Stars: ✭ 128 (+300%)

Mutual labels: bert

pyrrha

A language independant post correction app for POS and lemmatization

Stars: ✭ 14 (-56.25%)

Mutual labels: lemmatization

1-60 of 368 similar projects

›

next*5