Company Names Corpus公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。
Stars: ✭ 868 (+3238.46%)
Mutual labels: corpus, dataset, dict
Chinese Names Corpus中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+11642.31%)
Mutual labels: corpus, dataset, dict
Dataset Listlists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (+223.08%)
Mutual labels: corpus, dataset
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (+111.54%)
Mutual labels: corpus, dataset
Isic Archive DownloaderA script to download the ISIC Archive of lesion images
Stars: ✭ 153 (+488.46%)
Mutual labels: medical, dataset
Awesome Hungarian NlpA curated list of NLP resources for Hungarian
Stars: ✭ 121 (+365.38%)
Mutual labels: corpus, dataset
Pubmed RctPubMed 200k RCT dataset: a large dataset for sequential sentence classification.
Stars: ✭ 101 (+288.46%)
Mutual labels: corpus, medical
Dialog corpus用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Stars: ✭ 1,662 (+6292.31%)
Mutual labels: corpus, dataset
Clue中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+9226.92%)
Mutual labels: corpus, dataset
Nlp chinese corpus大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+25500%)
Mutual labels: corpus, dataset
Cluepretrainedmodels高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Stars: ✭ 493 (+1796.15%)
Mutual labels: corpus, dataset
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+880.77%)
Mutual labels: corpus, dataset
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (+434.62%)
Mutual labels: corpus, dataset
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (+315.38%)
Mutual labels: corpus, dataset
Medmnist[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis
Stars: ✭ 338 (+1200%)
Mutual labels: medical, dataset
Awesome chinese medical nlp中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
Stars: ✭ 623 (+2296.15%)
Mutual labels: medical, dataset