All Projects → mystem-scala → Similar Projects or Alternatives

207 Open source projects that are alternatives of or similar to mystem-scala

libmorph
libmorph rus/ukr - fast & accurate morphological analyzer/analyses for Russian and Ukrainian
Stars: ✭ 16 (-23.81%)
Mutual labels:  lemmatizer, russian-morphology
jargon
Tokenizers and lemmatizers for Go
Stars: ✭ 98 (+366.67%)
Mutual labels:  tokenizer, lemmatizer
mystem
CGo bindings to Yandex.Mystem
Stars: ✭ 28 (+33.33%)
Mutual labels:  russian-specific, mystem
ArabicProcessingCog
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-9.52%)
GrammarEngine
Грамматический Словарь Русского Языка (+ английский, японский, etc)
Stars: ✭ 68 (+223.81%)
Mutual labels:  lemmatizer, russian-morphology
simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
Stars: ✭ 32 (+52.38%)
Mutual labels:  tokenizer, lemmatizer
RussianNounsJS
Склонение существительных по падежам. Обычно требуются только форма в именительном падеже, одушевлённость и род.
Stars: ✭ 29 (+38.1%)
yandex-checkout-node
Node.js SDK for Yandex.Checkout (unofficial)
Stars: ✭ 64 (+204.76%)
Mutual labels:  yandex
YandexAlgorithms
Lecture notes, Code with comments.
Stars: ✭ 30 (+42.86%)
Mutual labels:  yandex
translate
A module grouping multiple translation APIs
Stars: ✭ 321 (+1428.57%)
Mutual labels:  yandex
hunspell
High-Performance Stemmer, Tokenizer, and Spell Checker for R
Stars: ✭ 101 (+380.95%)
Mutual labels:  tokenizer
frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+233.33%)
Mutual labels:  computational-linguistics
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-38.1%)
Mutual labels:  computational-linguistics
ucto
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …
Stars: ✭ 58 (+176.19%)
Mutual labels:  computational-linguistics
lemma
A Morphological Parser (Analyser) / Lemmatizer written in Elixir.
Stars: ✭ 45 (+114.29%)
Mutual labels:  lemmatizer
YaSeeker
Yandex OSINT tool
Stars: ✭ 104 (+395.24%)
Mutual labels:  yandex
Yandex.Music.Api
Client Yandex.Music.Api for Yandex.Music
Stars: ✭ 53 (+152.38%)
Mutual labels:  yandex
tokenizer
Tokenize CSS according to the CSS Syntax
Stars: ✭ 52 (+147.62%)
Mutual labels:  tokenizer
robots-txt-parser
PHP class for parse all directives from robots.txt files according to specifications
Stars: ✭ 38 (+80.95%)
Mutual labels:  yandex
wink-lemmatizer
English lemmatizer
Stars: ✭ 53 (+152.38%)
Mutual labels:  lemmatizer
citation-function
Measuring the Evolution of a Scientific Field through Citation Frames
Stars: ✭ 40 (+90.48%)
Mutual labels:  computational-linguistics
lara-hungarian-nlp
NLP class for rapid ChatBot development in Hungarian language
Stars: ✭ 27 (+28.57%)
Mutual labels:  lemmatizer
lxa5
Linguistica 5: Unsupervised Learning of Linguistic Structure
Stars: ✭ 27 (+28.57%)
Mutual labels:  computational-linguistics
nytwit
New York Times Word Innovation Types dataset
Stars: ✭ 21 (+0%)
Mutual labels:  computational-linguistics
gd-tokenizer
A small godot project with a tokenizer written in GDScript.
Stars: ✭ 34 (+61.9%)
Mutual labels:  tokenizer
python-mecab
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (+28.57%)
Mutual labels:  tokenizer
farasapy
A Python implementation of Farasa toolkit
Stars: ✭ 69 (+228.57%)
Mutual labels:  tokenizer
datastories-semeval2017-task6
Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-4.76%)
Mutual labels:  computational-linguistics
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+395.24%)
Mutual labels:  tokenizer
yandex-dialogs-php-sdk
PHP-библиотека для облегчения работы с диалогами от Яндекс
Stars: ✭ 23 (+9.52%)
Mutual labels:  yandex
yandex-direct-api
PHP library for Yandex.Direct API v5 (abandoned)
Stars: ✭ 12 (-42.86%)
Mutual labels:  yandex
Turkish-Lemmatizer
Lemmatization for Turkish Language
Stars: ✭ 72 (+242.86%)
Mutual labels:  lemmatizer
xontrib-output-search
Get identifiers, paths, URLs and words from the previous command output and use them for the next command in xonsh shell.
Stars: ✭ 26 (+23.81%)
Mutual labels:  tokenizer
appmetrica-logsapi-loader
A tool for automatic data loading from AppMetrica LogsAPI into (local) ClickHouse
Stars: ✭ 18 (-14.29%)
Mutual labels:  yandex
psr2r-sniffer
A PSR-2-R code sniffer and code-style auto-correction-tool - including many useful additions
Stars: ✭ 32 (+52.38%)
Mutual labels:  tokenizer
artefactory-connectors-kit
ACK is an E(T)L tool specialized in API data ingestion. It is accessible through a Command-Line Interface. The application allows you to easily extract, stream and load data (with minimum transformations), from the API source to the destination of your choice.
Stars: ✭ 34 (+61.9%)
Mutual labels:  yandex
xy2xy
A list of technologies similar to inner Yandex technologies
Stars: ✭ 112 (+433.33%)
Mutual labels:  yandex
wink-tokenizer
Multilingual tokenizer that automatically tags each token with its type
Stars: ✭ 51 (+142.86%)
Mutual labels:  tokenizer
lex
Lex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (+133.33%)
Mutual labels:  tokenizer
vscode-blockman
VSCode extension to highlight nested code blocks
Stars: ✭ 233 (+1009.52%)
Mutual labels:  tokenizer
tokenizer
A simple tokenizer in Ruby for NLP tasks.
Stars: ✭ 44 (+109.52%)
Mutual labels:  tokenizer
berserker
Berserker - BERt chineSE woRd toKenizER
Stars: ✭ 17 (-19.05%)
Mutual labels:  tokenizer
lindera
A morphological analysis library.
Stars: ✭ 226 (+976.19%)
Mutual labels:  tokenizer
sentiment-analysis-of-tweets-in-russian
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
Stars: ✭ 51 (+142.86%)
Mutual labels:  computational-linguistics
SwiLex
A universal lexer library in Swift.
Stars: ✭ 29 (+38.1%)
Mutual labels:  tokenizer
FA
Репозиторий практик факультета ИТиАБД направления Прикладной Информатики в Финансовом Университете при Правительстве РФ
Stars: ✭ 26 (+23.81%)
Mutual labels:  russian-specific
yandex-translate-api
A simple REST client library for Yandex.Translate
Stars: ✭ 29 (+38.1%)
Mutual labels:  yandex
drupal 8 unset html head link
🤖 Module for unset any wrong HTML links (like rel="delete-form", rel="edit-form", etc.) from head on Drupal 8.x websites. This is trust way to grow up position in SERP Google, Yandex, etc.
Stars: ✭ 19 (-9.52%)
Mutual labels:  yandex
datalinguist
Stanford CoreNLP in idiomatic Clojure.
Stars: ✭ 93 (+342.86%)
Mutual labels:  computational-linguistics
docker-machine-driver-yandex
Yandex.Cloud driver for Docker Machine
Stars: ✭ 21 (+0%)
Mutual labels:  yandex
word2vec-tsne
Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
Stars: ✭ 59 (+180.95%)
Mutual labels:  computational-linguistics
snapdragon-lexer
Converts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (-9.52%)
Mutual labels:  tokenizer
elasticsearch-plugins
Some native scoring script plugins for elasticsearch
Stars: ✭ 30 (+42.86%)
Mutual labels:  tokenizer
yandex-disk-api
This library is built to use Yandex Disk API with PHP
Stars: ✭ 19 (-9.52%)
Mutual labels:  yandex
liblex
C library for Lexical Analysis
Stars: ✭ 25 (+19.05%)
Mutual labels:  tokenizer
golem
A lemmatizer implemented in Go
Stars: ✭ 54 (+157.14%)
Mutual labels:  lemmatizer
embedding evaluation
Evaluate your word embeddings
Stars: ✭ 32 (+52.38%)
Mutual labels:  computational-linguistics
yametrikapy
Python library for Yandex Metrika API
Stars: ✭ 20 (-4.76%)
Mutual labels:  yandex
alice-renderer
Node.js библиотека для формирования ответов в навыках Яндекс Алисы.
Stars: ✭ 27 (+28.57%)
Mutual labels:  yandex
swfk
“Snake wrangling for kids”: the Russian translation. Русский перевод книги «Snake Wrangling for Kids»
Stars: ✭ 24 (+14.29%)
Mutual labels:  russian-specific
1-60 of 207 similar projects