vietnamese-robertaA Robustly Optimized BERT Pretraining Approach for Vietnamese
Stars: ✭ 22 (-31.25%)
lemmaA Morphological Parser (Analyser) / Lemmatizer written in Elixir.
Stars: ✭ 45 (+40.63%)
Spacy Course👩🏫 Advanced NLP with spaCy: A free online course
Stars: ✭ 1,920 (+5900%)
Clue中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+7478.13%)
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+115.63%)
HugsVisionHugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (+381.25%)
zeyrekPython morphological analyzer for Turkish language. Partial port of ZemberekNLP.
Stars: ✭ 36 (+12.5%)
Transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+174093.75%)
sticker2Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot
Stars: ✭ 14 (-56.25%)
banglabertThis repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…
Stars: ✭ 186 (+481.25%)
AiSpaceAiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0
Stars: ✭ 28 (-12.5%)
roberta-wwm-base-distillthis is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
Stars: ✭ 61 (+90.63%)
Pytorch-NLUPytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (+371.88%)
TorchBlocksA PyTorch-based toolkit for natural language processing
Stars: ✭ 85 (+165.63%)
textgoText preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Stars: ✭ 33 (+3.13%)
LightLM高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task
Stars: ✭ 54 (+68.75%)
mcQA🔮 Answering multiple choice questions with Language Models.
Stars: ✭ 23 (-28.12%)
classyclassy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Stars: ✭ 61 (+90.63%)
Medi-CoQAConversational Question Answering on Clinical Text
Stars: ✭ 22 (-31.25%)
simplemmaSimple multilingual lemmatizer for Python, especially useful for speed and efficiency
Stars: ✭ 32 (+0%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+487.5%)
bert experimentalcode and supplementary materials for a series of Medium articles about the BERT model
Stars: ✭ 72 (+125%)
udarUDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.
Stars: ✭ 15 (-53.12%)
rl-trained-agentsA collection of pre-trained RL agents using Stable Baselines3
Stars: ✭ 47 (+46.88%)
finetunerFinetuning any DNN for better embedding on neural search tasks
Stars: ✭ 442 (+1281.25%)
DeepMorphyМорфологический анализатор для русского языка на C# для .NET
Stars: ✭ 23 (-28.12%)
OpenDialogAn Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Stars: ✭ 94 (+193.75%)
ALBERT-PytorchPytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)
Stars: ✭ 214 (+568.75%)
semantic-document-relationsImplementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"
Stars: ✭ 21 (-34.37%)
TextPair文本对关系比较 - 语义相似度、字面相似度、文本蕴含等等
Stars: ✭ 44 (+37.5%)
pptodMulti-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)
Stars: ✭ 77 (+140.63%)
cdQA-ui⛔ [NOT MAINTAINED] A web interface for cdQA and other question answering systems.
Stars: ✭ 19 (-40.62%)
NER-FunTool本NER项目包含多个中文数据集,模型采用BiLSTM+CRF、BERT+Softmax、BERT+Cascade、BERT+WOL等,最后用TFServing进行模型部署,线上推理和线下推理。
Stars: ✭ 56 (+75%)
robo-vlnPytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
Stars: ✭ 34 (+6.25%)
KARENKAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-43.75%)
ObjectNetPyTorch implementation of "Pyramid Scene Parsing Network".
Stars: ✭ 15 (-53.12%)
bangla-bertBangla-Bert is a pretrained bert model for Bengali language
Stars: ✭ 41 (+28.13%)
KLUE📖 Korean NLU Benchmark
Stars: ✭ 420 (+1212.5%)
bernA neural named entity recognition and multi-type normalization tool for biomedical text mining
Stars: ✭ 151 (+371.88%)
KWDLCKyoto University Web Document Leads Corpus
Stars: ✭ 64 (+100%)
text2classMulti-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT
Stars: ✭ 15 (-53.12%)
WSDM-Cup-2019[ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.
Stars: ✭ 62 (+93.75%)
OpenGNTOpen Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
Stars: ✭ 55 (+71.88%)
golgothaContextualised Embeddings and Language Modelling using BERT and Friends using R
Stars: ✭ 39 (+21.88%)
textstemTools for fast text stemming & lemmatization
Stars: ✭ 36 (+12.5%)
modular-assemblies[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
Stars: ✭ 98 (+206.25%)
CPPE-DatasetCode for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
Stars: ✭ 42 (+31.25%)
pyrrhaA language independant post correction app for POS and lemmatization
Stars: ✭ 14 (-56.25%)