duydo / Elasticsearch Analysis Vietnamese
Licence: apache-2.0
Vietnamese Analysis Plugin for Elasticsearch
Stars: ✭ 304
Programming Languages
java
68154 projects - #9 most used programming language
Labels
Projects that are alternatives of or similar to Elasticsearch Analysis Vietnamese
Serilog Sinks Elasticsearch
A Serilog sink that writes events to Elasticsearch
Stars: ✭ 291 (-4.28%)
Mutual labels: elasticsearch
Sysmonsearch
Investigate suspicious activity by visualizing Sysmon's event log
Stars: ✭ 302 (-0.66%)
Mutual labels: elasticsearch
Doctor
基于知识图谱的医学诊断系统。Medical Diagnosis System Based on Knowledge Map.
Stars: ✭ 286 (-5.92%)
Mutual labels: elasticsearch
Workarea
Workarea is an enterprise-grade Ruby on Rails commerce platform
Stars: ✭ 290 (-4.61%)
Mutual labels: elasticsearch
Elasticsearchbundle
Symfony bundle for Elasticsearch with steroids
Stars: ✭ 296 (-2.63%)
Mutual labels: elasticsearch
Spring Elasticsearch
Spring factories for elasticsearch
Stars: ✭ 279 (-8.22%)
Mutual labels: elasticsearch
Elasticsearch Py
Official Elasticsearch client library for Python
Stars: ✭ 3,486 (+1046.71%)
Mutual labels: elasticsearch
Hastic Server
Hastic data management server for analyzing patterns and anomalies from Grafana
Stars: ✭ 292 (-3.95%)
Mutual labels: elasticsearch
Sens
基于SpringBoot+MyBatis+Shiro+Redis+ElasticSearch的企业级博客系统
Stars: ✭ 287 (-5.59%)
Mutual labels: elasticsearch
Ansible Elk
📊 Ansible playbook for setting up an ELK/EFK stack and clients.
Stars: ✭ 284 (-6.58%)
Mutual labels: elasticsearch
Fast Elasticsearch Vector Scoring
Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine
Stars: ✭ 304 (+0%)
Mutual labels: elasticsearch
Korio
Korio: Kotlin cORoutines I/O : Virtual File System + Async/Sync Streams + Async TCP Client/Server + WebSockets for Multiplatform Kotlin 1.3
Stars: ✭ 282 (-7.24%)
Mutual labels: elasticsearch
Elk Kubernetes
This repo shows how to configure complete EFK stack on top of Kubernetes
Stars: ✭ 294 (-3.29%)
Mutual labels: elasticsearch
Elasticsearch loader
A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch
Stars: ✭ 300 (-1.32%)
Mutual labels: elasticsearch
Vietnamese Analysis Plugin for Elasticsearch
Vietnamese Analysis plugin integrates Vietnamese language analysis into Elasticsearch.
The plugin provides the vi_analyzer
analyzer and vi_tokenizer
tokenizer. The vi_analyzer
is composed of the vi_tokenizer
tokenizer, the lowercase
and stop
filter.
Installation on Elasticsearch 5.x
In order to install the plugin, choose a version in releases page then run:
bin/elasticsearch-plugin install link/to/binary/version
Build from Source
Check this post: How to build Elasticsearch Vietnamese Analysis Plugin
Sample Usage
Sample request
GET localhost:9200/analyzer
{
"analyzer": "vi_analyzer",
"text": "một chuỗi tiếng việt"
}
Result
{
"tokens": [
{
"token": "một",
"start_offset": 0,
"end_offset": 3,
"type": "<PHRASE>",
"position": 0
},
{
"token": "chuỗi",
"start_offset": 4,
"end_offset": 9,
"type": "<PHRASE>",
"position": 1
},
{
"token": "tiếng",
"start_offset": 10,
"end_offset": 15,
"type": "<PHRASE>",
"position": 2
},
{
"token": "việt",
"start_offset": 16,
"end_offset": 20,
"type": "<PHRASE>",
"position": 3
}
]
}
If testing failed, consider restarting the elastic service to reload the plugin.
Compatible Versions
Vietnamese Analysis Plugin | Elasticsearch |
---|---|
master | 7.3.1 |
7.5.1 | 7.5.1 |
7.4.1 | 7.4.1 |
7.4.0 | 7.4.0 |
7.3.2 | 7.3.2 |
7.3.1 | 7.3.1 |
5.6.5 | 5.6.5 |
5.4.1 | 5.4.1 |
5.3.1 | 5.3.1 |
5.2.1 | 5.2.1 |
2.4.1 | 2.4.1 |
2.4.0 | 2.4.0 |
2.3.5 | 2.3.5 |
2.3.4 | 2.3.4 |
2.3.3 | 2.3.3 |
2.3.2 | 2.3.2 |
2.3.1 | 2.3.1 |
2.3.0 | 2.3.0 |
0.2.2 | 2.2.0 |
0.2.1.1 | 2.1.1 |
0.2.1 | 2.1.0 |
0.2 | 2.0.0 |
0.1.7 | 1.7+ |
0.1.6 | 1.6+ |
0.1.5 | 1.5+ |
0.1.1 | 1.4+ |
0.1 | 1.3 |
Thanks to
- Lê Hồng Phương for his VnTokenizer library
- JetBrains has provided a free license for their great tool: IntelliJ IDEA
License
This software is licensed under the Apache 2 license, quoted below.
Licensed under the Apache License, Version 2.0 (the "License"); you may not
use this file except in compliance with the License. You may obtain a copy of
the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
License for the specific language governing permissions and limitations under
the License.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].