PycluePython toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
Stars: ✭ 91 (-79.22%)
Dataset Listlists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (-80.82%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-81.96%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-82.65%)
BlacklabA corpus retrieval engine based on Apache Lucene
Stars: ✭ 69 (-84.25%)
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (-87.44%)
Typing AssistantTyping Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-92.69%)
Lyrics CorporaAn unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts
Stars: ✭ 13 (-97.03%)
Naive Bayes ClassifierNaive Bayes classifier is classification algorithm. It uses Naive based Bernoulli and Multinomial equation to classify documents(Text) as ham or spam.
Stars: ✭ 6 (-98.63%)
Seq2seq ChatbotChatbot in 200 lines of code using TensorLayer
Stars: ✭ 777 (+77.4%)
QuantedaAn R package for the Quantitative Analysis of Textual Data
Stars: ✭ 647 (+47.72%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+5.02%)
Fancy NlpNLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.
Stars: ✭ 233 (-46.8%)
Nlp4han中文自然语言处理工具集【断句/分词/词性标注/组块/句法分析/语义分析/NER/N元语法/HMM/代词消解/情感分析/拼写检查】
Stars: ✭ 206 (-52.97%)
Lac百度NLP:分词,词性标注,命名实体识别,词重要性
Stars: ✭ 2,792 (+537.44%)
Weatherbot一个基于 Rasa 的中文天气情况问询机器人(chatbot), 带 Web UI 界面
Stars: ✭ 186 (-57.53%)
ThuctcAn Efficient Chinese Text Classifier
Stars: ✭ 179 (-59.13%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+457.31%)
G2pcg2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
Stars: ✭ 155 (-64.61%)
Information Extraction ChineseChinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Stars: ✭ 1,888 (+331.05%)
Segmentit任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
Stars: ✭ 139 (-68.26%)
Chinese Chatbot中文聊天机器人,基于10万组对白训练而成,采用注意力机制,对一般问题都会生成一个有意义的答复。已上传模型,可直接运行,跑不起来直播吃键盘。
Stars: ✭ 124 (-71.69%)
Thulac PythonAn Efficient Lexical Analyzer for Chinese
Stars: ✭ 1,619 (+269.63%)
Chinese nlu by using rasa nlu使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build a Chinese Natural Language Understanding System (NLU)
Stars: ✭ 99 (-77.4%)
ZhopenieChinese Open Information Extraction (Tree-based Triple Relation Extraction Module)
Stars: ✭ 98 (-77.63%)
ChinesenlpDatasets, SOTA results of every fields of Chinese NLP
Stars: ✭ 1,206 (+175.34%)
Awesome Chinese NlpA curated list of resources for Chinese NLP 中文自然语言处理相关资料
Stars: ✭ 6,599 (+1406.62%)
JcsegJcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for the latest lucene,solr,elasticsearch
Stars: ✭ 754 (+72.15%)
ThulacAn Efficient Lexical Analyzer for Chinese
Stars: ✭ 629 (+43.61%)
Ddparser百度开源的依存句法分析系统
Stars: ✭ 537 (+22.6%)
EasyprAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
Stars: ✭ 6,046 (+1280.37%)