All Projects → mayabot → Mynlp

mayabot / Mynlp

Licence: apache-2.0
一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to Mynlp

Biosentvec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (-40.66%)
Mutual labels:  fasttext
Fast sentence embeddings
Compute Sentence Embeddings Fast!
Stars: ✭ 384 (-26.01%)
Mutual labels:  fasttext
Limax
Node.js module to generate URL slugs. Another one? This one cares about i18n and transliterates non-Latin scripts to conform to the RFC3986 standard. Mostly API-compatible with similar modules.
Stars: ✭ 423 (-18.5%)
Mutual labels:  pinyin
Cga.js
CGA 3D 计算几何算法库 | 3D Compute Geometry Algorithm Library webgl three.js babylon.js等任何库都可以使用
Stars: ✭ 313 (-39.69%)
Mutual labels:  segment
Jxpagelistview
高仿闲鱼、转转、京东、中央天气预报等主流APP列表底部分页滚动视图
Stars: ✭ 377 (-27.36%)
Mutual labels:  segment
Sppagemenu
分页菜单,功能非常齐全,满足绝大多数APP,简书地址:
Stars: ✭ 402 (-22.54%)
Mutual labels:  segment
Tinypinyin
适用于Java和Android的快速、低内存占用的汉字转拼音库。
Stars: ✭ 3,348 (+545.09%)
Mutual labels:  pinyin
Pinyin
Pure JavaScript library for converting Hanzi to Pinyin.
Stars: ✭ 504 (-2.89%)
Mutual labels:  pinyin
Text Classification Models Pytorch
Implementation of State-of-the-art Text Classification Models in Pytorch
Stars: ✭ 379 (-26.97%)
Mutual labels:  fasttext
Chineseutil
PHP 中文工具包,支持汉字转拼音、拼音分词、简繁互转、数字、金额大写;QQ群:17916227
Stars: ✭ 413 (-20.42%)
Mutual labels:  pinyin
Kafka Go
Kafka library in Go
Stars: ✭ 4,200 (+709.25%)
Mutual labels:  segment
Pinyin
🇨🇳 基于词库的中文转拼音优质解决方案
Stars: ✭ 3,824 (+636.8%)
Mutual labels:  pinyin
Lmdb Embeddings
Fast word vectors with little memory usage in Python
Stars: ✭ 404 (-22.16%)
Mutual labels:  fasttext
Neural chinese transliterator
Can CNNs transliterate Pinyin into Chinese characters correctly?
Stars: ✭ 310 (-40.27%)
Mutual labels:  pinyin
Easysegmentedbarview
本库主要提供一个简单易用的自定义分段控件,方便快速实现分段效果,支持xml配置、代码配置、分段规则按均分/比例分、数字分段、文本分段、渐变分段、bar条样式正常/圆形/三角形,segment文字样式、进度设置、进度标记类型设置、分段描述设置、其它更多自定义设置等功能。
Stars: ✭ 455 (-12.33%)
Mutual labels:  segment
Python Pinyin
汉字转拼音(pypinyin)
Stars: ✭ 3,618 (+597.11%)
Mutual labels:  pinyin
Pinyin Engine
JavaScript 拼音匹配引擎
Stars: ✭ 386 (-25.63%)
Mutual labels:  pinyin
Redux Segment
Segment.io analytics integration for redux.
Stars: ✭ 517 (-0.39%)
Mutual labels:  segment
Rime Cloverpinyin
🍀️四叶草拼音输入方案,做最好用的基于rime开源的简体拼音输入方案!
Stars: ✭ 478 (-7.9%)
Mutual labels:  pinyin
Wmzpagecontroller
分页控制器,替换UIPageController方案,具备完整的生命周期,多种指示器样式,多种标题样式,可悬浮,支持ios13暗黑模式(仿优酷,爱奇艺,今日头条,简书,京东等多种标题菜单) (Pagination controller with full life cycle, multiple indicator styles, multiple title styles)
Stars: ✭ 405 (-21.97%)
Mutual labels:  segment

= Mynlp: 高性能、可扩展的中文NLP工具包 :version: 3.3.0 :icons: font

image:https://img.shields.io/github/license/mayabot/mynlp.svg[] image:https://maven-badges.herokuapp.com/maven-central/com.mayabot.mynlp/mynlp/badge.svg[link=https://maven-badges.herokuapp.com/maven-central/com.mayabot.mynlp/mynlp] image:https://img.shields.io/github/release/mayabot/mynlp/all.svg[link=https://github.com/mayabot/mynlp/releases/latest] image:https://img.shields.io/github/repo-size/mayabot/mynlp[link=https://github.com/mayabot/mynlp] image:https://img.shields.io/github/issues-raw/mayabot/mynlp.svg[link=https://github.com/mayabot/mynlp/issues]

image::https://cdn.mayabot.com/mynlp/mynlp-banner.png[,500,align=center,link=https://mynlp.mayabot.com]

[NOTE] 访问完整在线文档link:https://mynlp.mayabot.com/[ mynlp.mayabot.com]

== 安装

该章节介绍如何安装和简单使用mynlp的基础功能。

mynlp已经发布在Maven中央仓库中,所以只需要在Maven或者Gradle中引入mynlp.jar依赖即可。

.Gradle [subs="attributes+"]

compile 'com.mayabot.mynlp:mynlp:{version}'

.Maven [source,xml,subs="attributes+"]

com.mayabot.mynlp mynlp {version} ----

因为资源文件较大,所以mynlp.jar包默认不包括资源文件(词典和模型文件)依赖。

懒人方案,通过引用mynlp-all依赖默认提供的资源词典,满足大部分需求。

.依赖 mynlp-all [subs="attributes+"]

compile 'com.mayabot.mynlp:mynlp-all:{version}'

=== 词典和模型资源

.词典&模型资源列表 [cols="6,^1,^1,4"] |=== |Gradle 坐标 | mynlp-all依赖 |文件大小 |说明

|com.mayabot.mynlp.resource:mynlp-resource-coredict:1.0.0 |Y |18.2M |核心词典(20w+词,500w+二元)

|com.mayabot.mynlp.resource:mynlp-resource-pos:1.0.0 |Y |17.5M |词性标注模型(感知机模型)

|com.mayabot.mynlp.resource:mynlp-resource-ner:1.0.0 |Y |13.4M |命名实体识别(人名识别、其他NER)

|com.mayabot.mynlp.resource:mynlp-resource-pinyin:1.1.0 |Y |272K |拼音词典、拼音切分模型

|com.mayabot.mynlp.resource:mynlp-resource-transform:1.0.0 |Y |478K |繁简体词典

|com.mayabot.mynlp.resource:mynlp-resource-cws:1.0.0 |N |62.4M |感知机分词模型

|com.mayabot.mynlp.resource:mynlp-resource-custom:1.0.0 |N |2.19M |自定义扩展词库

|===

根据实际的需要,按需引入资源包。

[source] .一个Gradle引用的例子

compile 'com.mayabot.mynlp:mynlp:3.2.0'

// 核心词典 implementation 'com.mayabot.mynlp.resource:mynlp-resource-coredict:1.0.0'

// 词性标注 implementation 'com.mayabot.mynlp.resource:mynlp-resource-pos:1.0.0'

// 命名实体 implementation 'com.mayabot.mynlp.resource:mynlp-resource-ner:1.0.0'

// 拼音 implementation 'com.mayabot.mynlp.resource:mynlp-resource-pinyin:1.1.0'

// 繁简体转换 implementation 'com.mayabot.mynlp.resource:mynlp-resource-transform:1.0.0'

// 感知机分词模型 // implementation 'com.mayabot.mynlp.resource:mynlp-resource-cws:1.0.0'

// 自定义扩展词库 // implementation 'com.mayabot.mynlp.resource:mynlp-resource-custom:1.0.0'

== 访问完整在线文档

link:https://mynlp.mayabot.com/[mynlp.mayabot.com]

== 致谢以下优秀开源项目

  • HanLP
  • ansj_seg

mynlp实现参考了他们算法实现和部分代码

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].