Rasa nlu chiTurn Chinese natural language into structured data 中文自然语言理解
Stars: ✭ 1,166 (+362.7%)
GpyGo 语言汉字转拼音工具
Stars: ✭ 136 (-46.03%)
Learn VimVim 实操教程(Learning Vim)Vim practical tutorial.
Stars: ✭ 1,166 (+362.7%)
ZhvoiceChinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。
Stars: ✭ 327 (+29.76%)
PhobertPhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
Stars: ✭ 332 (+31.75%)
Macropodus自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NER(name entity recognition),Find(new words discovery),Keyword(keyword extraction),Summarize(text summarization),Sim(text similarity),Calculate(scientific calculator),Chi2num(chinese number to arabic number)
Stars: ✭ 309 (+22.62%)
MedcatMedical Concept Annotation Tool
Stars: ✭ 133 (-47.22%)
Bert seq2seqpytorch实现bert做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持GPT2进行文章续写。
Stars: ✭ 298 (+18.25%)
Farm🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Stars: ✭ 1,140 (+352.38%)
Chinese Hershey FontConvert Chinese Characters to Single-Line Fonts using Computer Vision
Stars: ✭ 70 (-72.22%)
Opencc4j🇨🇳Open Chinese Convert is an opensource project for conversion between Traditional Chinese and Simplified Chinese.(java 中文繁简体转换)
Stars: ✭ 187 (-25.79%)
Borgert CmsBorgert is a CMS Open Source created with Laravel Framework 5.6
Stars: ✭ 298 (+18.25%)
Typeset自动修正中文、英文、代码混合排版中的全半角、空格等问题
Stars: ✭ 63 (-75%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (+10.32%)
Roberta zhRoBERTa中文预训练模型: RoBERTa for Chinese
Stars: ✭ 1,953 (+675%)
BertweetBERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
Stars: ✭ 282 (+11.9%)
Awesome Cnawesome项目中文翻译,提升查阅效率
Stars: ✭ 62 (-75.4%)
Hscrf PytorchACL 2018: Hybrid semi-Markov CRF for Neural Sequence Labeling (http://aclweb.org/anthology/P18-2038)
Stars: ✭ 284 (+12.7%)
NonflowersProcedurally generated paintings of nonexistent flowers.
Stars: ✭ 208 (-17.46%)
TorchcrfAn Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0
Stars: ✭ 58 (-76.98%)
Ehviewer前任作者NekoInverter在gitlab重新更新EhViewer了,我不再独立维护项目,本项目暂时封存。 请前往 https://gitlab.com/NekoInverter/EhViewer 获取最新版本。
Stars: ✭ 127 (-49.6%)
Sequence taggingusing bilstm-crf,bert and other methods to do sequence tagging task
Stars: ✭ 263 (+4.37%)
Char rnn lm zhlanguage model in Chinese,基于Pytorch官方文档实现
Stars: ✭ 57 (-77.38%)
Persian Nerپیکره بزرگ شناسایی موجودیتهای نامدار فارسی برچسب خورده
Stars: ✭ 183 (-27.38%)
rust-course<<Rust语言圣经(Book & Course)>>对Rust语言进行全面且深入的讲解,书中辅以生动的示例和习题,带你攻克从入门学习到实践应用的各种难关。 我们的目标是做一门优秀的开源Rust教程(课程)——学Rust就上course.rs。
Stars: ✭ 2,739 (+986.9%)
PhonlpPhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
Stars: ✭ 56 (-77.78%)
BnlpBNLP is a natural language processing toolkit for Bengali Language.
Stars: ✭ 127 (-49.6%)
FewCLUEFewCLUE 小样本学习测评基准,中文版
Stars: ✭ 251 (-0.4%)
Ner blstm CrfLSTM-CRF for NER with ConLL-2002 dataset
Stars: ✭ 51 (-79.76%)
Gpt2 NewstitleChinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。
Stars: ✭ 235 (-6.75%)
PimeDevelop input methods for Windows easily with Python and node.js
Stars: ✭ 1,051 (+317.06%)
CLUE pytorchCLUE baseline pytorch CLUE的pytorch版本基线
Stars: ✭ 72 (-71.43%)
Ner EvaluationAn implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity
Stars: ✭ 126 (-50%)
Helpdesk Guide📖《桌维网管实典》主机与程控终端信息安全运维,IT方向速成就业入职
Stars: ✭ 183 (-27.38%)
NER-Multimodal-pytorchPytorch Implementation of "Adaptive Co-attention Network for Named Entity Recognition in Tweets" (AAAI 2018)
Stars: ✭ 42 (-83.33%)
fairseq-tagginga Fairseq fork for sequence tagging/labeling tasks
Stars: ✭ 26 (-89.68%)
Dotnetbook.NET Platform Architecture book (English, Chinese, Russian)
Stars: ✭ 1,763 (+599.6%)
JointreEnd-to-end neural relation extraction using deep biaffine attention (ECIR 2019)
Stars: ✭ 41 (-83.73%)
Pytorch ner bilstm cnn crfEnd-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF implement in pyotrch
Stars: ✭ 249 (-1.19%)
Malaya Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (-5.16%)
Spacy LookupNamed Entity Recognition based on dictionaries
Stars: ✭ 212 (-15.87%)
Marktool这是一款基于web的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持文本的迭代标注和实体的嵌套标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验和调整,提高了标注语料的准确率和可靠性。
Stars: ✭ 190 (-24.6%)
NcrfppNCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Stars: ✭ 1,767 (+601.19%)
Nezha chinese pytorchNEZHA: Neural Contextualized Representation for Chinese Language Understanding
Stars: ✭ 65 (-74.21%)