fredwu / Stemmer
An English (Porter2) stemming implementation in Elixir.
Projects that are alternatives of or similar to Stemmer
lorcaNatural Language Processing for Spanish in Node.js. Stemmer, sentiment analysis, readability, tf-idf with batteries, concordance and more!
Stars: ✭ 95 (-29.1%)
Mutual labels: stemmer
Word formsAccurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.
Stars: ✭ 463 (+245.52%)
Mutual labels: stemmer
Kelime kok ayiriciDerin Öğrenme Tabanlı - seq2seq - Türkçe için kelime kökü bulma web uygulaması - Turkish Stemmer (tr_stemmer)
Stars: ✭ 76 (-43.28%)
Mutual labels: stemmer
lancaster-stemmerLancaster stemming algorithm
Stars: ✭ 22 (-83.58%)
Mutual labels: stemmer
Lunr LanguagesA collection of languages stemmers and stopwords for Lunr Javascript library
Stars: ✭ 296 (+120.9%)
Mutual labels: stemmer
AkarataIndonesian stemmer - Pustaka JavaScript untuk mengambil kata dasar dari kata berimbuhan pada bahasa Indonesia.
Stars: ✭ 26 (-80.6%)
Mutual labels: stemmer
stemmifyRuby module that converts a word to its approximate root form with the Porter stemmer. For example, observing and observation reduce to observ.
Stars: ✭ 54 (-59.7%)
Mutual labels: stemmer
StemmerFast Porter stemmer implementation
Stars: ✭ 86 (-35.82%)
Mutual labels: stemmer
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+243.28%)
Mutual labels: stemmer
Nlp Js Tools FrenchPOS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-76.12%)
Mutual labels: stemmer
CISTEMStemmer for German
Stars: ✭ 33 (-75.37%)
Mutual labels: stemmer
Ruby StemmerExpose libstemmer_c to Ruby
Stars: ✭ 254 (+89.55%)
Mutual labels: stemmer
PtstemStemming Algorithms for the Portuguese Language
Stars: ✭ 13 (-90.3%)
Mutual labels: stemmer
QutufQutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.
Stars: ✭ 84 (-37.31%)
Mutual labels: stemmer
perstemPersian stemmer and morphological analyzer
Stars: ✭ 18 (-86.57%)
Mutual labels: stemmer
SnowballSnowball version of the Porter stemmer for the Lithuanian language.
Stars: ✭ 5 (-96.27%)
Mutual labels: stemmer
ArabicstemmerAssem's Arabic Light Stemmer is a snowball-based stemming algorithm for Arabic aimed mainly to improve search.
Stars: ✭ 102 (-23.88%)
Mutual labels: stemmer
Php StemmerNative PHP Stemmer
Stars: ✭ 84 (-37.31%)
Mutual labels: stemmer
Arabic Light StemmerArabic light stemmer. Light stemming for Arabic words removes prefixes and suffixes and normalizes words
Stars: ✭ 14 (-89.55%)
Mutual labels: stemmer
Stemmer
An English (Porter2) stemming implementation in Elixir.
In linguistic morphology and information retrieval, stemming is the process of reducing inflected (or sometimes derived) words to their word stem, base or root form—generally a written word form. The stem need not be identical to the morphological root of the word; it is usually sufficient that related words map to the same stem, even if this stem is not in itself a valid root. - Wikipedia
Usage
The Stemmer.stem/1
function supports stemming a single word (String
), a sentence (String
) or a list of single words (List
of String
s).
Stemmer.stem("capabilities") # => "capabl"
Stemmer.stem("extraordinary capabilities") # => "extraordinari capabl"
Stemmer.stem(["extraordinary", "capabilities"]) # => ["extraordinari", "capabl"]
Compatibility
Stemmer is 100% compatible with the official Porter2 implementation, it is tested against the official diffs.txt
which contains more than 29000 words.
Naive Bayes
Stemmer was built to support the Simple Bayes library. ❤️
License
Licensed under MIT.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at
[email protected].