anonymisationAnonymization of legal cases (Fr) based on Flair embeddings
Stars: ✭ 85 (-94.51%)
Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (-42.44%)
PaperrobotCode for PaperRobot: Incremental Draft Generation of Scientific Ideas
Stars: ✭ 372 (-75.97%)
instance-watcherGet notified for Instances mistakenly left running across all AWS regions for specific AWS Account
Stars: ✭ 90 (-94.19%)
Transformers-TutorialsThis repository contains demos I made with the Transformers library by HuggingFace.
Stars: ✭ 2,828 (+82.69%)
mcQA🔮 Answering multiple choice questions with Language Models.
Stars: ✭ 23 (-98.51%)
roberta-wwm-base-distillthis is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
Stars: ✭ 61 (-96.06%)
NAG-BERT[EACL'21] Non-Autoregressive with Pretrained Language Model
Stars: ✭ 47 (-96.96%)
SER-datasetsA collection of datasets for the purpose of emotion recognition/detection in speech.
Stars: ✭ 74 (-95.22%)
Spatio-Temporal-papersThis project is a collection of recent research in areas such as new infrastructure and urban computing, including white papers, academic papers, AI lab and dataset etc.
Stars: ✭ 180 (-88.37%)
Bert Bilstm Crf NerTensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
Stars: ✭ 3,838 (+147.93%)
DE-LIMITDeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.
Stars: ✭ 90 (-94.19%)
OgbBenchmark datasets, data loaders, and evaluators for graph machine learning
Stars: ✭ 799 (-48.39%)
KitanaQAKitanaQA: Adversarial training and data augmentation for neural question-answering models
Stars: ✭ 58 (-96.25%)
classyclassy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Stars: ✭ 61 (-96.06%)
epoxyExtra-strength glue engines for R Markdown and Quarto
Stars: ✭ 141 (-90.89%)
ChakinSimple downloader for pre-trained word vectors
Stars: ✭ 323 (-79.13%)
JointIDSFBERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)
Stars: ✭ 55 (-96.45%)
TextPair文本对关系比较 - 语义相似度、字面相似度、文本蕴含等等
Stars: ✭ 44 (-97.16%)
Nlp TutorialNatural Language Processing Tutorial for Deep Learning Researchers
Stars: ✭ 9,895 (+539.21%)
cifairA duplicate-free variant of the CIFAR test set.
Stars: ✭ 13 (-99.16%)
NER-FunTool本NER项目包含多个中文数据集,模型采用BiLSTM+CRF、BERT+Softmax、BERT+Cascade、BERT+WOL等,最后用TFServing进行模型部署,线上推理和线下推理。
Stars: ✭ 56 (-96.38%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (-97.22%)
rs datasetsTool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-98.58%)
Kevinpro-NLP-demoAll NLP you Need Here. 个人实现了一些好玩的NLP demo,目前包含13个NLP应用的pytorch实现
Stars: ✭ 117 (-92.44%)
bugrepoA collection of publicly available bug reports
Stars: ✭ 93 (-93.99%)
bert sabert sentiment analysis tensorflow serving with RESTful API
Stars: ✭ 35 (-97.74%)
Medical Datasetstracking medical datasets, with a focus on medical imaging
Stars: ✭ 296 (-80.88%)
Animal MattingGithub repository for the paper End-to-end Animal Image Matting
Stars: ✭ 363 (-76.55%)
cdQA-ui⛔ [NOT MAINTAINED] A web interface for cdQA and other question answering systems.
Stars: ✭ 19 (-98.77%)
mindsdb-examplesExamples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (-98.39%)
awesome-sweden-datasetsA curated list of awesome datasets to use when coding for the Swedish market.
Stars: ✭ 17 (-98.9%)
oreilly-bert-nlpThis repository contains code for the O'Reilly Live Online Training for BERT
Stars: ✭ 19 (-98.77%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (-81.65%)
neural-ranking-kdImproving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
Stars: ✭ 74 (-95.22%)
NSP-BERTThe code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
Stars: ✭ 166 (-89.28%)
CheXbertCombining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT
Stars: ✭ 51 (-96.71%)
EasyprAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
Stars: ✭ 6,046 (+290.57%)
BERT-QECode and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".
Stars: ✭ 43 (-97.22%)
systematic-review-datasetsA collection of fully labeled systematic review datasets (title-abstract screening)
Stars: ✭ 25 (-98.39%)
korpatbert특허분야 특화된 한국어 AI언어모델 KorPatBERT
Stars: ✭ 48 (-96.9%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (-82.04%)
AnnA Anki neuronal AppendixUsing machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
Stars: ✭ 39 (-97.48%)
KLUE📖 Korean NLU Benchmark
Stars: ✭ 420 (-72.87%)
hard-label-attackNatural Language Attacks in a Hard Label Black Box Setting.
Stars: ✭ 26 (-98.32%)
TorchBlocksA PyTorch-based toolkit for natural language processing
Stars: ✭ 85 (-94.51%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-10.98%)
Persian Swear Wordsدیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
Stars: ✭ 95 (-93.86%)