All Projects → Nlp_chinese_corpus → Similar Projects or Alternatives

2239 Open source projects that are alternatives of or similar to Nlp_chinese_corpus

Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (-63.57%)
Mutual labels:  chinese, dataset, corpus, language-model, bert
Cnn Question Classification Keras
Chinese Question Classifier (Keras Implementation) on BQuLD
Stars: ✭ 28 (-99.58%)
Lightnlp
基于Pytorch和torchtext的自然语言处理深度学习框架。
Stars: ✭ 739 (-88.9%)
Cluepretrainedmodels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Stars: ✭ 493 (-92.59%)
Mutual labels:  chinese, dataset, corpus, text-classification
Gossiping Chinese Corpus
PTT 八卦版問答中文語料
Stars: ✭ 137 (-97.94%)
backprop
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Stars: ✭ 229 (-96.56%)
Filipino-Text-Benchmarks
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
Stars: ✭ 22 (-99.67%)
Mutual labels:  text-classification, corpus, bert
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (-68.27%)
Mutual labels:  chinese, corpus, text-classification
OpenDialog
An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Stars: ✭ 94 (-98.59%)
Mutual labels:  corpus, chinese, bert
Insuranceqa Corpus Zh
🚁 保险行业语料库,聊天机器人
Stars: ✭ 821 (-87.67%)
Mutual labels:  question-answering, dataset, corpus
Bert language understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Stars: ✭ 933 (-85.98%)
Vaaku2Vec
Language Modeling and Text Classification in Malayalam Language using ULMFiT
Stars: ✭ 68 (-98.98%)
Haystack
🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (-48.78%)
BERT-chinese-text-classification-pytorch
This repo contains a PyTorch implementation of a pretrained BERT model for text classification.
Stars: ✭ 92 (-98.62%)
Mutual labels:  text-classification, chinese, bert
text-classification-cn
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Stars: ✭ 81 (-98.78%)
Mutual labels:  text-classification, word2vec, corpus
SQUAD2.Q-Augmented-Dataset
Augmented version of SQUAD 2.0 for Questions
Stars: ✭ 31 (-99.53%)
Mutual labels:  question-answering, bert
WSDM-Cup-2019
[ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.
Stars: ✭ 62 (-99.07%)
Mutual labels:  text-classification, bert
Chatito
🎯🗯 Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stars: ✭ 678 (-89.81%)
Mutual labels:  dataset, text-classification
LightLM
高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task
Stars: ✭ 54 (-99.19%)
Mutual labels:  chinese, bert
Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-99.55%)
Mutual labels:  text-classification, word2vec
textgo
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Stars: ✭ 33 (-99.5%)
Mutual labels:  text-classification, bert
NSP-BERT
The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
Stars: ✭ 166 (-97.51%)
Mutual labels:  text-classification, bert
text2text
Text2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (-97.18%)
Mutual labels:  question-answering, bert
cdQA-ui
⛔ [NOT MAINTAINED] A web interface for cdQA and other question answering systems.
Stars: ✭ 19 (-99.71%)
Mutual labels:  question-answering, bert
mcQA
🔮 Answering multiple choice questions with Language Models.
Stars: ✭ 23 (-99.65%)
Mutual labels:  question-answering, bert
Species-Names-Corpus
物种名称语料库。植物名,动物名。
Stars: ✭ 23 (-99.65%)
Mutual labels:  corpus, dataset
CLUEmotionAnalysis2020
CLUE Emotion Analysis Dataset 细粒度情感分析数据集
Stars: ✭ 3 (-99.95%)
Mutual labels:  corpus, chinese
ganbert-pytorch
Enhancing the BERT training with Semi-supervised Generative Adversarial Networks in Pytorch/HuggingFace
Stars: ✭ 60 (-99.1%)
Mutual labels:  text-classification, bert
Ngram2vec
Four word embedding models implemented in Python. Supporting arbitrary context features
Stars: ✭ 703 (-89.44%)
Mutual labels:  chinese, word2vec
feedIO
A Feed Aggregator that Knows What You Want to Read.
Stars: ✭ 26 (-99.61%)
Mutual labels:  news, text-classification
Medi-CoQA
Conversational Question Answering on Clinical Text
Stars: ✭ 22 (-99.67%)
Mutual labels:  question-answering, bert
bert tokenization for java
This is a java version of Chinese tokenization descried in BERT.
Stars: ✭ 39 (-99.41%)
Mutual labels:  chinese-nlp, bert
TorchBlocks
A PyTorch-based toolkit for natural language processing
Stars: ✭ 85 (-98.72%)
Mutual labels:  text-classification, bert
Pytorch-NLU
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (-97.73%)
Mutual labels:  text-classification, bert
text2class
Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT
Stars: ✭ 15 (-99.77%)
Mutual labels:  text-classification, bert
chinese-nlp-ner
一套针对中文实体识别的BLSTM-CRF解决方案
Stars: ✭ 14 (-99.79%)
Mutual labels:  chinese, chinese-nlp
Giveme5W
Extraction of the five journalistic W-questions (5W) from news articles
Stars: ✭ 16 (-99.76%)
Mutual labels:  news, question-answering
Text and Audio classification with Bert
Text Classification in Turkish Texts with Bert
Stars: ✭ 34 (-99.49%)
Mutual labels:  text-classification, bert
MobileQA
离线端阅读理解应用 QA for mobile, Android & iPhone
Stars: ✭ 49 (-99.26%)
Mutual labels:  chinese, bert
kwx
BERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-99.5%)
Mutual labels:  text-classification, bert
wordfish-python
extract relationships from standardized terms from corpus of interest with deep learning 🐟
Stars: ✭ 19 (-99.71%)
Mutual labels:  word2vec, corpus
policy-data-analyzer
Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
Stars: ✭ 22 (-99.67%)
Mutual labels:  text-classification, bert
squad-v1.1-pt
Portuguese translation of the SQuAD dataset
Stars: ✭ 13 (-99.8%)
Mutual labels:  dataset, question-answering
ODSQA
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
Stars: ✭ 43 (-99.35%)
Mutual labels:  question-answering, chinese
Medical-Names-Corpus
医疗语料库。医疗机构名语料库。药品本位码。
Stars: ✭ 26 (-99.61%)
Mutual labels:  corpus, dataset
FewCLUE
FewCLUE 小样本学习测评基准,中文版
Stars: ✭ 251 (-96.23%)
Mutual labels:  chinese, bert
AskNowNQS
A question answering system for RDF knowledge graphs.
Stars: ✭ 32 (-99.52%)
Mutual labels:  word2vec, question-answering
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-96.17%)
Mutual labels:  dataset, corpus
Chinese-Word-Segmentation-in-NLP
State of the art Chinese Word Segmentation with Bi-LSTMs
Stars: ✭ 23 (-99.65%)
Mutual labels:  chinese, language-model
sqlmap-wiki-zhcn
可能是最完整的 sqlmap 中文文档。
Stars: ✭ 51 (-99.23%)
Mutual labels:  wiki, chinese
Bertweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
Stars: ✭ 282 (-95.76%)
Cluecorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (-95.82%)
Mutual labels:  chinese, corpus
Nlu sim
all kinds of baseline models for sentence similarity 句子对语义相似度模型
Stars: ✭ 286 (-95.7%)
Mutual labels:  question-answering, word2vec
Text Cnn
嵌入Word2vec词向量的CNN中文文本分类
Stars: ✭ 298 (-95.52%)
Mutual labels:  text-classification, word2vec
Giveme5w1h
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
Stars: ✭ 316 (-95.25%)
Mutual labels:  news, question-answering
CLUE pytorch
CLUE baseline pytorch CLUE的pytorch版本基线
Stars: ✭ 72 (-98.92%)
Mutual labels:  chinese, bert
Chinese Text Classification
Chinese-Text-Classification,Tensorflow CNN(卷积神经网络)实现的中文文本分类。QQ群:522785813,微信群二维码:http://www.tensorflownews.com/
Stars: ✭ 284 (-95.73%)
Mutual labels:  chinese, text-classification
Albert zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
Stars: ✭ 3,500 (-47.42%)
Mutual labels:  bert, chinese-corpus
Eda nlp for chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
Stars: ✭ 660 (-90.08%)
Mutual labels:  chinese, text-classification
Bert Pytorch
Google AI 2018 BERT pytorch implementation
Stars: ✭ 4,642 (-30.26%)
Mutual labels:  language-model, bert
1-60 of 2239 similar projects