All Categories → Machine Learning → chinese-word-segmentation

Top 23 chinese-word-segmentation open source projects

The Jieba Chinese Word Segmentation Implemented in Rust

✭ 219

rust nlp wasm chinese-word-segmentation jieba

中文自然语言处理工具集【断句/分词/词性标注/组块/句法分析/语义分析/NER/N元语法/HMM/代词消解/情感分析/拼写检查】

✭ 206

java chinese sentiment-analysis chinese-nlp chinese-word-segmentation

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

✭ 203

python nlp named-entity-recognition ner pos pos-tagging chinese-word-segmentation word-segmentation

百度NLP：分词，词性标注，命名实体识别，词重要性

✭ 2,792

python java C++CMake c named-entity-recognition chinese-nlp part-of-speech-tagger chinese-word-segmentation word-segmentation lexical-analysis

中文分词词性标注命名实体识别依存句法分析新词发现关键词短语提取自动摘要文本分类聚类拼音简繁自然语言处理

✭ 2,564

python Jupyter Notebook HTML natural-language-processing named-entity-recognition part-of-speech-tagger chinese-word-segmentation hanlp dependency-parser

Jiagu深度学习自然语言处理工具知识图谱关系抽取中文分词词性标注命名实体识别情感分析新词发现关键词文本摘要文本聚类

✭ 2,368

python nlp ner pos chinese-word-segmentation cws

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

✭ 155

python crf pinyin chinese-nlp chinese-word-segmentation

Deeplearning nlp

基于深度学习的自然语言处理库

✭ 154

python deep-learning tensorflow natural-language-processing named-entity-recognition relation-extraction chinese-word-segmentation

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

✭ 1,976

C#python Batchfile fuzzy-search spellcheck levenshtein fuzzy-matching chinese-word-segmentation word-segmentation spell-check edit-distance levenshtein-distance spelling chinese-text-segmentation approximate-string-matching spelling-correction damerau-levenshtein text-segmentation symspell

Nlpcc Wordseg Weibo

NLPCC 2016 微博分词评测项目

✭ 120

python natural-language-processing chinese-word-segmentation

Source code for an ACL2017 paper on Chinese word segmentation

✭ 88

python acl chinese-word-segmentation

Datasets, SOTA results of every fields of Chinese NLP

✭ 1,206

html nlp question-answering machine-translation chinese-nlp chinese-word-segmentation

Chinese Word Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

✭ 9,548

python chinese embeddings word-embeddings chinese-word-segmentation embedding vectors-trained

利用深度学习实现中文分词

✭ 58

python deep-learning tensorflow chinese-word-segmentation

基于深度学习的自然语言处理库

✭ 34

python deep-learning tensorflow natural-language-processing named-entity-recognition chinese-word-segmentation

Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for the latest lucene,solr,elasticsearch

✭ 754

java nlp natural-language-processing chinese-nlp pos-tagging chinese-word-segmentation

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

✭ 5,692

python chinese-word-segmentation

Python port of SymSpell

✭ 420

python fuzzy-search spellcheck levenshtein fuzzy-matching chinese-word-segmentation word-segmentation spell-check

High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.

✭ 313

c tokenizer full-text-search chinese-word-segmentation

nlpir-analysis-cn-ictclas

Lucene/Solr Analyzer Plugin. Support MacOS,Linux x86/64,Windows x86/64. It's a maven project, which allows you change the lucene/solr version. //Maven工程，修改Lucene/Solr版本，以兼容相应版本。

✭ 71

java solr lucene chinese-word-segmentation lucene-analyzer nlpir ictclas

Berserker - BERt chineSE woRd toKenizER

✭ 17

python nlp tensorflow tokenizer chinese-nlp sequence-to-sequence bert chinese-word-segmentation tpu state-of-the-art bert-chinese

Cross-Domain-CWS

Code for IJCAI 2018 paper "Neural Networks Incorporating Unlabeled and Partially-labeled Data for Cross-domain Chinese Word Segmentation"

✭ 14

python perl tensorflow cross-domain chinese-word-segmentation

The Java Package of NLPIR-ICTCLAS.

✭ 16

java chinese-word-segmentation nlpir ictclas

1-23 of 23 chinese-word-segmentation projects