All Projects → Cluepretrainedmodels → Similar Projects or Alternatives

1173 Open source projects that are alternatives of or similar to Cluepretrainedmodels

Nlp chinese corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Stars: ✭ 6,656 (+1250.1%)

Mutual labels: chinese, dataset, corpus, text-classification

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Stars: ✭ 2,425 (+391.89%)

Mutual labels: chinese, dataset, corpus, pretrained-models

Cluedatasetsearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Stars: ✭ 2,112 (+328.4%)

Mutual labels: chinese, corpus, text-classification

Musical Onset Efficient

Supplementary information and code for the paper: An efficient deep learning model for musical onset detection

Stars: ✭ 26 (-94.73%)

Mutual labels: dataset, pretrained-models

CLUEmotionAnalysis2020

CLUE Emotion Analysis Dataset 细粒度情感分析数据集

Stars: ✭ 3 (-99.39%)

Mutual labels: corpus, chinese

🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!

Stars: ✭ 678 (+37.53%)

Mutual labels: dataset, text-classification

Indonesian Nlp Resources

data resource untuk NLP bahasa indonesia

Stars: ✭ 143 (-70.99%)

Mutual labels: dataset, corpus

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Stars: ✭ 139 (-71.81%)

Mutual labels: dataset, corpus

Chinese Names Corpus

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

Stars: ✭ 3,053 (+519.27%)

Mutual labels: dataset, corpus

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

Stars: ✭ 689 (+39.76%)

Mutual labels: chinese, dataset

Awesome Pretrained Chinese Nlp Models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型集合

Stars: ✭ 195 (-60.45%)

Mutual labels: chinese, pretrained-models

No description or website provided.

Stars: ✭ 33 (-93.31%)

Mutual labels: corpus, chinese

基于Pytorch和torchtext的自然语言处理深度学习框架。

Stars: ✭ 739 (+49.9%)

Mutual labels: chinese, text-classification

Data repository for pretrained NLP models and NLP corpora.

Stars: ✭ 622 (+26.17%)

Mutual labels: dataset, pretrained-models

Medical-Names-Corpus

医疗语料库。医疗机构名语料库。药品本位码。

Stars: ✭ 26 (-94.73%)

Mutual labels: corpus, dataset

Awesome Hungarian Nlp

A curated list of NLP resources for Hungarian

Stars: ✭ 121 (-75.46%)

Mutual labels: dataset, corpus

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Stars: ✭ 1,662 (+237.12%)

Mutual labels: dataset, corpus

EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"

Stars: ✭ 216 (-56.19%)

Mutual labels: dataset, pretrained-models

Company Names Corpus

公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。

Stars: ✭ 868 (+76.06%)

Mutual labels: dataset, corpus

Poetry-related datasets developed by THUAIPoet (Jiuge) group.

Stars: ✭ 111 (-77.48%)

Mutual labels: chinese, corpus

Cnn Question Classification Keras

Chinese Question Classifier (Keras Implementation) on BQuLD

Stars: ✭ 28 (-94.32%)

Mutual labels: chinese, text-classification

Weibo terminater

Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator

Stars: ✭ 2,295 (+365.52%)

Mutual labels: chinese, corpus

BERT-chinese-text-classification-pytorch

This repo contains a PyTorch implementation of a pretrained BERT model for text classification.

Stars: ✭ 92 (-81.34%)

Mutual labels: text-classification, chinese

text-classification-cn

中文文本分类实践，基于搜狗新闻语料库，采用传统机器学习方法以及预训练模型等方法

Stars: ✭ 81 (-83.57%)

Mutual labels: text-classification, corpus

Bert Multitask Learning

BERT for Multitask Learning

Stars: ✭ 380 (-22.92%)

Mutual labels: text-classification, pretrained-models

Filipino-Text-Benchmarks

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

Stars: ✭ 22 (-95.54%)

Mutual labels: text-classification, corpus

Cnn Text Classification Tf Chinese

CNN for Chinese Text Classification in Tensorflow

Stars: ✭ 237 (-51.93%)

Mutual labels: chinese, text-classification

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Stars: ✭ 28 (-94.32%)

Mutual labels: chinese, pretrained-models

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (-80.93%)

Mutual labels: corpus, chinese

Species-Names-Corpus

物种名称语料库。植物名,动物名。

Stars: ✭ 23 (-95.33%)

Mutual labels: corpus, dataset

A dataset of millions of news articles scraped from a curated list of data sources.

Stars: ✭ 255 (-48.28%)

Mutual labels: dataset, corpus

Natural Language Processing Best Practices & Examples

Stars: ✭ 5,783 (+1073.02%)

Mutual labels: text-classification, pretrained-models

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (-69.37%)

Mutual labels: text-classification, pretrained-models

Insuranceqa Corpus Zh

🚁 保险行业语料库，聊天机器人

Stars: ✭ 821 (+66.53%)

Mutual labels: dataset, corpus

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Stars: ✭ 108 (-78.09%)

Mutual labels: dataset, corpus

lists of text corpus and more (mainly Japanese)

Stars: ✭ 84 (-82.96%)

Mutual labels: dataset, corpus

Gossiping Chinese Corpus

PTT 八卦版問答中文語料

Stars: ✭ 137 (-72.21%)

Mutual labels: dataset, corpus

Corpus of Annual Reports in Japan

Stars: ✭ 55 (-88.84%)

Mutual labels: dataset, corpus

HDLTex: Hierarchical Deep Learning for Text Classification

Stars: ✭ 191 (-61.26%)

Mutual labels: dataset, text-classification

Nlp bahasa resources

A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia

Stars: ✭ 158 (-67.95%)

Mutual labels: dataset, corpus

Eda nlp for chinese

An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

Stars: ✭ 660 (+33.87%)

Mutual labels: chinese, text-classification

自然语言处理（nlp），小姜机器人（闲聊检索式chatbot），BERT句向量-相似度（Sentence Similarity），XLNET句向量-相似度（text xlnet embedding），文本分类（Text classification），实体提取（ner，bert+bilstm+crf），数据增强（text augment, data enhance），同义句同义词生成，句子主干提取（mainpart），中文汉语短文本相似度，文本特征工程，keras-http-service调用

Stars: ✭ 954 (+93.51%)

Mutual labels: chinese, text-classification

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

Stars: ✭ 1,066 (+116.23%)

Mutual labels: chinese, pretrained-models

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

Stars: ✭ 278 (-43.61%)

Mutual labels: chinese, corpus

中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

Stars: ✭ 379 (-23.12%)

Mutual labels: corpus, chinese

Chinese Text Classification

Chinese-Text-Classification，Tensorflow CNN（卷积神经网络）实现的中文文本分类。QQ群：522785813，微信群二维码：http://www.tensorflownews.com/

Stars: ✭ 284 (-42.39%)

Mutual labels: chinese, text-classification

Text Classification Cnn Rnn

CNN-RNN中文文本分类，基于TensorFlow

Stars: ✭ 3,613 (+632.86%)

Mutual labels: chinese, text-classification

A collection of small corpuses of interesting data for the creation of bots and similar stuff.

Stars: ✭ 4,293 (+770.79%)

Mutual labels: corpus

Zh.javascript.info

现代 JavaScript 教程（The Modern JavaScript Tutorial）

Stars: ✭ 5,656 (+1047.26%)

Mutual labels: chinese

Deep Learning Resources

由淺入深的深度學習資源 Collection of deep learning materials for everyone

Stars: ✭ 422 (-14.4%)

Mutual labels: chinese

Visually Explore the Stanford Question Answering Dataset

Stars: ✭ 421 (-14.6%)

Mutual labels: dataset

Seq2seqchatbots

A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.

Stars: ✭ 466 (-5.48%)

Mutual labels: dataset

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Stars: ✭ 4,713 (+855.98%)

Mutual labels: pretrained-models

Text Classification Library in Keras

Stars: ✭ 421 (-14.6%)

Mutual labels: text-classification

zhparser is a PostgreSQL extension for full-text search of Chinese language

Stars: ✭ 418 (-15.21%)

Mutual labels: chinese

Manim Tutorial Cn

manim中文入门教程

Stars: ✭ 448 (-9.13%)

Mutual labels: chinese

Awesome Remote Sensing Change Detection

List of datasets, codes, and contests related to remote sensing change detection

Stars: ✭ 414 (-16.02%)

Mutual labels: dataset

Wuhan 2019 Ncov

2019-nCoV 新冠状病毒 2019-12-01至今国家、省、市三级每日统计数据（支持接口读取）

Stars: ✭ 414 (-16.02%)

Mutual labels: dataset

Microservices from Design to Deployment 中文版《微服务：从设计到部署》

Stars: ✭ 4,637 (+840.57%)

Mutual labels: chinese

Semantic and Instance Segmentation of LiDAR point clouds for autonomous driving

Stars: ✭ 465 (-5.68%)

Mutual labels: dataset

1-60 of 1173 similar projects