Nlp Experiments In PytorchPyTorch repository for text categorization and NER experiments in Turkish and English.
Stars: ✭ 35 (-86.11%)
Go Workwxa sensible Work Weixin(企业微信, Wechat Work) SDK for Go
Stars: ✭ 181 (-28.17%)
ipymarkupNER, syntax markup visualizations
Stars: ✭ 108 (-57.14%)
Nlp xiaojiang自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
Stars: ✭ 954 (+278.57%)
ChineseFontsConvert asian text to web fonts
Stars: ✭ 14 (-94.44%)
LightLM高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task
Stars: ✭ 54 (-78.57%)
DefactonlpDeFactoNLP: An Automated Fact-checking System that uses Named Entity Recognition, TF-IDF vector comparison and Decomposable Attention models.
Stars: ✭ 30 (-88.1%)
kaldi-timit-sre-ivectorDevelop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (-93.25%)
Bert nerNer with Bert
Stars: ✭ 240 (-4.76%)
GseGo efficient multilingual NLP and text segmentation; support english, chinese, japanese and other. Go 高性能多语言 NLP 和分词
Stars: ✭ 1,695 (+572.62%)
Chinese PoetryThe most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Stars: ✭ 34,881 (+13741.67%)
wasm-cn[翻译中] WebAssembly 中文文档
Stars: ✭ 22 (-91.27%)
JiaguJiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
Stars: ✭ 2,368 (+839.68%)
GameWord记录一下游戏常用单词的中英文对照
Stars: ✭ 157 (-37.7%)
Ner命名体识别(NER)综述-论文-模型-代码(BiLSTM-CRF/BERT-CRF)-竞赛资源总结-随时更新
Stars: ✭ 118 (-53.17%)
Tf nerSimple and Efficient Tensorflow implementations of NER models with tf.estimator and tf.data
Stars: ✭ 876 (+247.62%)
Nlp4han中文自然语言处理工具集【断句/分词/词性标注/组块/句法分析/语义分析/NER/N元语法/HMM/代词消解/情感分析/拼写检查】
Stars: ✭ 206 (-18.25%)
kulaLightweight and highly extensible .NET scripting language.
Stars: ✭ 43 (-82.94%)
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+235.32%)
Chinese rime收集現代漢語方言和古漢語的中州韻輸入法拼音方案 Collection of phonetic spelling schemas for Sinitic languages and dialects
Stars: ✭ 118 (-53.17%)
simple NERsimple rule based named entity recognition
Stars: ✭ 29 (-88.49%)
Anki BackupMy anki cards' backups. Java、大数据、数据结构八股文。
Stars: ✭ 26 (-89.68%)
Sequence taggingNamed Entity Recognition (LSTM + CRF) - Tensorflow
Stars: ✭ 1,889 (+649.6%)
ccalendarChinese Calendar in calendar(1) for BSD, Linux & macOS
Stars: ✭ 17 (-93.25%)
R NotesNotes for using R language to do data mining and machine learning (Chinese)
Stars: ✭ 25 (-90.08%)
CBLUE中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Stars: ✭ 379 (+50.4%)
PyCasiaA python library to work with the CASIA Chinese handwriting database.
Stars: ✭ 38 (-84.92%)
2020CCF-NER2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案
Stars: ✭ 66 (-73.81%)
WebstructNER toolkit for HTML data
Stars: ✭ 230 (-8.73%)
RelationshipChinese kinship system.中国亲戚关系计算器 - 家庭称谓/称呼计算/亲戚关系算法
Stars: ✭ 898 (+256.35%)
lingvo--Ner-ruNamed entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-84.92%)
Blog部署在 GitBook 上的个人博客。
Stars: ✭ 112 (-55.56%)
verseagilityRamp up your custom natural language processing (NLP) task, allowing you to bring your own data, use your preferred frameworks and bring models into production.
Stars: ✭ 23 (-90.87%)
Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+253.57%)
troveWeakly supervised medical named entity classification
Stars: ✭ 55 (-78.17%)
Nlp pytorch projectEmbedding, NMT, Text_Classification, Text_Generation, NER etc.
Stars: ✭ 153 (-39.29%)
Chatbot cn基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
Stars: ✭ 791 (+213.89%)
Pytorch ner bilstm cnn crfEnd-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF implement in pyotrch
Stars: ✭ 249 (-1.19%)
Malaya Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (-5.16%)
Spacy LookupNamed Entity Recognition based on dictionaries
Stars: ✭ 212 (-15.87%)
Marktool这是一款基于web的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持文本的迭代标注和实体的嵌套标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验和调整,提高了标注语料的准确率和可靠性。
Stars: ✭ 190 (-24.6%)
NcrfppNCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Stars: ✭ 1,767 (+601.19%)
Nezha chinese pytorchNEZHA: Neural Contextualized Representation for Chinese Language Understanding
Stars: ✭ 65 (-74.21%)
Osfcc一个收集可用于中文字体排印的开源字体集合。
Stars: ✭ 314 (+24.6%)