All Projects → howl-anderson → Hanzi_chaizi

howl-anderson / Hanzi_chaizi

汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Hanzi chaizi

Chaizi
漢語拆字字典
Stars: ✭ 384 (+163.01%)
Mutual labels:  chinese, components
Vueport
Single file components for Rails with Vue JS and Webpack
Stars: ✭ 141 (-3.42%)
Mutual labels:  components
Ebayui Core
Collection of Marko widgets; considered to be the core building blocks for all eBay components, pages & apps
Stars: ✭ 134 (-8.22%)
Mutual labels:  components
Coreui Angular
https://coreui.io/angular/
Stars: ✭ 139 (-4.79%)
Mutual labels:  components
Gpy
Go 语言汉字转拼音工具
Stars: ✭ 136 (-6.85%)
Mutual labels:  chinese
X0
Document & develop React components without breaking a sweat
Stars: ✭ 1,706 (+1068.49%)
Mutual labels:  components
Rmdi
React Material Design Icons – built with Pixo, Styled Components, and Styled System
Stars: ✭ 132 (-9.59%)
Mutual labels:  components
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+1560.96%)
Mutual labels:  chinese
React Impression
快速构建企业级应用
Stars: ✭ 140 (-4.11%)
Mutual labels:  components
Ionic3 Components
A project full of ionic 3 components and samples - to make life easier :)
Stars: ✭ 1,689 (+1056.85%)
Mutual labels:  components
Bert Sequence Tagging
基于BERT的中文序列标注
Stars: ✭ 137 (-6.16%)
Mutual labels:  chinese
Torus
Torus is an event-driven model-view UI framework for the web, focused on being tiny, efficient, and free of dependencies.
Stars: ✭ 136 (-6.85%)
Mutual labels:  components
Easyreader
基于React native 的开源小说阅读器
Stars: ✭ 139 (-4.79%)
Mutual labels:  chinese
Android Components
A collection of Android libraries to build browsers or browser-like applications.
Stars: ✭ 1,849 (+1166.44%)
Mutual labels:  components
Aspnetcore Angular Ngrx
🚀 An ASP.NET Core WebAPI Demo with an Angular Client using Ngrx store and effects and Signalr
Stars: ✭ 141 (-3.42%)
Mutual labels:  components
Chinese Colors
🇨🇳🎨Chinese traditional color cheatsheet online
Stars: ✭ 1,788 (+1124.66%)
Mutual labels:  chinese
Redux React Starter
DEPRECATED use the new https://github.com/didierfranc/react-webpack-4
Stars: ✭ 137 (-6.16%)
Mutual labels:  components
Segmentit
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
Stars: ✭ 139 (-4.79%)
Mutual labels:  chinese
Blazorise
Blazorise is a component library built on top of Blazor with support for CSS frameworks like Bootstrap, Bulma, AntDesign, and Material.
Stars: ✭ 2,103 (+1340.41%)
Mutual labels:  components
Icopy Site.github.io
icopy.site github mirror
Stars: ✭ 142 (-2.74%)
Mutual labels:  chinese

汉字拆字

拆字是指將一文字,以筆畫、字形等基本組成單位分解成多個文字。

汉字拆字让字型相似的字具有相似的拆解结果。

这种特性可以被深度学习模型用来作为字的特征之一:字形的特征。

使用

from hanzi_chaizi import HanziChaizi

hc = HanziChaizi()
result = hc.query('名')

print(result)

输出

['夕', '口']

从原始数据生成

数据来源

数据来自于 漢語拆字字典

解析

pytohn ./parse.py

致谢

拆字数据来自于 漢語拆字字典

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].