All Projects → dveselov → mystem

dveselov / mystem

Licence: MIT License
CGo bindings to Yandex.Mystem

Programming Languages

go
31211 projects - #10 most used programming language
c
50402 projects - #5 most used programming language

Projects that are alternatives of or similar to mystem

mystem-scala
Morphological analyzer `mystem` (Russian language) wrapper for JVM languages
Stars: ✭ 21 (-25%)
Mutual labels:  russian-specific, mystem
ru punkt
Russian language support for NLTK's PunktSentenceTokenizer
Stars: ✭ 49 (+75%)
Mutual labels:  russian-specific
expletives
Expletives vomiting library...
Stars: ✭ 12 (-57.14%)
Mutual labels:  linguistics
lameta
The Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-35.71%)
Mutual labels:  linguistics
mlconjug3
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (+67.86%)
Mutual labels:  linguistics
neural-net-linguistics
Papers about NN and linguistics
Stars: ✭ 14 (-50%)
Mutual labels:  linguistics
langua
A suite of language tools
Stars: ✭ 29 (+3.57%)
Mutual labels:  linguistics
folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+100%)
Mutual labels:  linguistics
TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-3.57%)
Mutual labels:  linguistics
lingvo--Ner-ru
Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (+35.71%)
Mutual labels:  linguistics
FA
Репозиторий практик факультета ИТиАБД направления Прикладной Информатики в Финансовом Университете при Правительстве РФ
Stars: ✭ 26 (-7.14%)
Mutual labels:  russian-specific
LangPad
A word processor/dictionary/generally useful tool for linguistics.
Stars: ✭ 20 (-28.57%)
Mutual labels:  linguistics
swfk
“Snake wrangling for kids”: the Russian translation. Русский перевод книги «Snake Wrangling for Kids»
Stars: ✭ 24 (-14.29%)
Mutual labels:  russian-specific
libpalaso
Palaso Library: A set of .Net libraries useful for developers of Language Software.
Stars: ✭ 36 (+28.57%)
Mutual labels:  linguistics
clinical nlp elastic
Clinical NLP Analysis with Elasticsearch and Kibana
Stars: ✭ 32 (+14.29%)
Mutual labels:  linguistics
verbecc
Complete Conjugation of any Verb using Machine Learning for French, Spanish, Portuguese, Italian and Romanian
Stars: ✭ 45 (+60.71%)
Mutual labels:  linguistics
linguisticsdown
Easy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-14.29%)
Mutual labels:  linguistics
duree
Durée: the longest book ever written.
Stars: ✭ 67 (+139.29%)
Mutual labels:  linguistics
ego-demo
Envoy filters in Go
Stars: ✭ 34 (+21.43%)
Mutual labels:  cgo-bindings
NatLang
NatLang is an English parser with an extensible grammar
Stars: ✭ 20 (-28.57%)
Mutual labels:  linguistics

go-mystem Build Status GoDoc Go Report Card Coverage Status

CGo bindings to Yandex.Mystem - russian morphology analyzer.

Install

$ wget https://github.com/yandex/tomita-parser/releases/download/v1.0/libmystem_c_binding.so.linux_x64.zip
$ unzip libmystem_c_binding.so.linux_x64.zip
$ sudo cp libmystem_c_binding.so /usr/lib/
$ sudo ln -s /usr/lib/libmystem_c_binding.so /usr/lib/libmystem_c_binding.so.1
$ go get -u github.com/dveselov/mystem

Usage

package main

import (
    "fmt"
)

import "github.com/dveselov/mystem"

func main() {
    analyses := mystem.NewAnalyses("маша")
    defer analyses.Close()
    fmt.Println(fmt.Sprintf("Analyze of '%s':", "маша"))
    for i := 0; i < analyses.Count(); i++ {
        lemma := analyses.GetLemma(i)
        grammemes := lemma.StemGram()
        fmt.Println(fmt.Sprintf("%d. %s - %v", i+1, lemma.Text(), grammemes))
    }
}

Output'll looks like this:

Analyze of 'маша':
1. маша - [136 155 191 201]
2. махать - [137 196 206]

License

Source code of go-mystem is licensed under MIT license, but Yandex.Mystem have their own EULA (allows commercial use), that you must accept.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].