All Projects → messense → Jieba Rs

messense / Jieba Rs

Licence: mit
The Jieba Chinese Word Segmentation Implemented in Rust

Programming Languages

rust
11053 projects

Projects that are alternatives of or similar to Jieba Rs

Wasmpatch
🧱Yet Another Patch Module for iOS/macOS via WebAssembly
Stars: ✭ 192 (-12.33%)
Mutual labels:  wasm
Wasmersharp
.NET Bindings for the Wasmer Runtime
Stars: ✭ 203 (-7.31%)
Mutual labels:  wasm
Artichoke
💎 Artichoke is a Ruby made with Rust
Stars: ✭ 2,557 (+1067.58%)
Mutual labels:  wasm
Pyhanlp
中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁 自然语言处理
Stars: ✭ 2,564 (+1070.78%)
Mutual labels:  chinese-word-segmentation
Lac
百度NLP:分词,词性标注,命名实体识别,词重要性
Stars: ✭ 2,792 (+1174.89%)
Mutual labels:  chinese-word-segmentation
Python
Python学习第三方库案例总结
Stars: ✭ 202 (-7.76%)
Mutual labels:  jieba
Yarte
Yarte stands for Yet Another Rust Template Engine
Stars: ✭ 189 (-13.7%)
Mutual labels:  wasm
Python Wechat Itchat
微信机器人,基于Python itchat接口功能实例展示:01-itchat获取微信好友或者微信群分享文章、02-itchat获取微信公众号文章、03-itchat监听微信公众号发送的文章、04 itchat监听微信群或好友撤回的消息、05 itchat获得微信好友信息以及表图对比、06 python打印出微信被删除好友、07 itchat自动回复好友、08 itchat微信好友个性签名词云图、09 itchat微信好友性别比例、10 微信群或微信好友撤回消息拦截、11 itchat微信群或好友之间转发消息
Stars: ✭ 216 (-1.37%)
Mutual labels:  jieba
Blazorrepl
Write, compile, execute and share Blazor components entirely in the browser
Stars: ✭ 196 (-10.5%)
Mutual labels:  wasm
Sandspiel
Creative cellular automata browser game
Stars: ✭ 2,476 (+1030.59%)
Mutual labels:  wasm
Rustmart Yew Example
Single Page Application (SPA) written using Rust, Wasm and Yew
Stars: ✭ 196 (-10.5%)
Mutual labels:  wasm
Olaf
Olaf: Overly Lightweight Acoustic Fingerprinting is a portable acoustic fingerprinting system.
Stars: ✭ 198 (-9.59%)
Mutual labels:  wasm
Nlp4han
中文自然语言处理工具集【断句/分词/词性标注/组块/句法分析/语义分析/NER/N元语法/HMM/代词消解/情感分析/拼写检查】
Stars: ✭ 206 (-5.94%)
Mutual labels:  chinese-word-segmentation
Ruukh
An experimental next-gen frontend framework for the Web in Rust.
Stars: ✭ 194 (-11.42%)
Mutual labels:  wasm
Vue
The progressive framework for WebAssembly applications.
Stars: ✭ 211 (-3.65%)
Mutual labels:  wasm
Wasm Examples
WebAssembly Examples
Stars: ✭ 191 (-12.79%)
Mutual labels:  wasm
Monpa
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Stars: ✭ 203 (-7.31%)
Mutual labels:  chinese-word-segmentation
Pont
An online board game in Rust and WebAssembly
Stars: ✭ 218 (-0.46%)
Mutual labels:  wasm
Wasm Worker
Move a WebAssembly module into its own thread
Stars: ✭ 215 (-1.83%)
Mutual labels:  wasm
Gossamer
🕸️ Gossamer: A Go implementation of the Polkadot Host (WIP)
Stars: ✭ 207 (-5.48%)
Mutual labels:  wasm

jieba-rs

GitHub Actions codecov Crates.io docs.rs

The Jieba Chinese Word Segmentation Implemented in Rust

Installation

Add it to your Cargo.toml:

[dependencies]
jieba-rs = "0.6"

then you are good to go. If you are using Rust 2015 you have to extern crate jieba_rs to your crate root as well.

Example

use jieba_rs::Jieba;

fn main() {
    let jieba = Jieba::new();
    let words = jieba.cut("我们中出了一个叛徒", false);
    assert_eq!(words, vec!["我们", "中", "出", "了", "一个", "叛徒"]);
}

Enabling Additional Features

  • default-dict feature enables embedded dictionary, this features is enabled by default
  • tfidf feature enables TF-IDF keywords extractor
  • textrank feature enables TextRank keywords extractor
[dependencies]
jieba-rs = { version = "0.6", features = ["tfidf", "textrank"] }

Run benchmark

cargo bench --all-features

Benchmark: Compare with cppjieba

jieba-rs bindings

License

This work is released under the MIT license. A copy of the license is provided in the LICENSE file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].