bredonA modern CSS value compiler in JavaScript
Stars: ✭ 39 (-87.54%)
Query TranslatorQuery Translator is a search query translator with AST representation
Stars: ✭ 165 (-47.28%)
neural tokenizerTokenize English sentences using neural networks.
Stars: ✭ 64 (-79.55%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (-48.88%)
rgpipelesspipe for ripgrep for common new filetypes using few dependencies
Stars: ✭ 21 (-93.29%)
TokenizerFast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (-57.83%)
FugashiA Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Stars: ✭ 125 (-60.06%)
mystem-scalaMorphological analyzer `mystem` (Russian language) wrapper for JVM languages
Stars: ✭ 21 (-93.29%)
SyntokText tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-60.7%)
mxusearch🔍 基于讯搜封装的 Laravel 全文检索服务。
Stars: ✭ 40 (-87.22%)
TokenizerSource code tokenizer
Stars: ✭ 119 (-61.98%)
JumanppJuman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (-18.85%)
Megamark😻 Markdown with easy tokenization, a fast highlighter, and a lean HTML sanitizer
Stars: ✭ 100 (-68.05%)
psr2r-snifferA PSR-2-R code sniffer and code-style auto-correction-tool - including many useful additions
Stars: ✭ 32 (-89.78%)
DjurlSimple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (-72.84%)
gatsby-plugin-lunrGatsby plugin for full text search implementation based on lunr client-side index. Supports multilanguage search.
Stars: ✭ 69 (-77.96%)
Sentence SplitterText to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-73.8%)
lexLex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (-84.35%)
WirbRuby Object Inspection for IRB
Stars: ✭ 69 (-77.96%)
search-for-kirbyKirby 3 plugin for adding a search index (sqlite or Algolia).
Stars: ✭ 42 (-86.58%)
ThotThot toolkit for statistical machine translation
Stars: ✭ 53 (-83.07%)
hunspellHigh-Performance Stemmer, Tokenizer, and Spell Checker for R
Stars: ✭ 101 (-67.73%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-85.3%)
paperless-ngA supercharged version of paperless: scan, index and archive all your physical documents
Stars: ✭ 4,840 (+1446.33%)
SharpmathA small .NET math library.
Stars: ✭ 36 (-88.5%)
linderaA morphological analysis library.
Stars: ✭ 226 (-27.8%)
Omnicat BayesNaive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-90.42%)
SacremosesPython port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (-6.39%)
lunr-moduleFull-text search with pre-build indexes for Nuxt.js using lunr.js
Stars: ✭ 45 (-85.62%)
ilmultiTooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-93.93%)
NatashaSolves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+151.76%)
python-mecabA repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (-91.37%)
Soynlp한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+95.85%)
Library-SpringThe library web application where you can borrow books. It's Spring MVC and Hibernate project.
Stars: ✭ 73 (-76.68%)
TokenizerA small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+1423.96%)
snapdragon-lexerConverts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (-93.93%)
Smoothnlp专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+38.98%)
nlpir-analysis-cn-ictclasLucene/Solr Analyzer Plugin. Support MacOS,Linux x86/64,Windows x86/64. It's a maven project, which allows you change the lucene/solr version. //Maven工程,修改Lucene/Solr版本,以兼容相应版本。
Stars: ✭ 71 (-77.32%)
MooOptimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+38.66%)
JflexThe fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+21.41%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-93.93%)
FtserverLightweight Embeddable iBoxDB Full Text Search Server for Java
Stars: ✭ 219 (-30.03%)
suikaSuika 🍉 is a Japanese morphological analyzer written in pure Ruby
Stars: ✭ 31 (-90.1%)
TntsearchA fully featured full text search engine written in PHP
Stars: ✭ 2,693 (+760.38%)
liblexC library for Lexical Analysis
Stars: ✭ 25 (-92.01%)
Everywhere🔧 A tool can really search everywhere for you.
Stars: ✭ 147 (-53.04%)
TokenizerA tokenizer for Icelandic text
Stars: ✭ 27 (-91.37%)
RiddleRuby Client API for Sphinx
Stars: ✭ 139 (-55.59%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (-39.94%)
lexertkC++ Lexer Toolkit Library (LexerTk) https://www.partow.net/programming/lexertk/index.html
Stars: ✭ 26 (-91.69%)
SentencesA multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (-6.39%)
MemexBrowser Extension to full-text search your browsing history & bookmarks.
Stars: ✭ 3,344 (+968.37%)
larasearchA driver based solution to searching your Eloquent models supports Laravel 5.2 and Elasticsearch engine.
Stars: ✭ 13 (-95.85%)
lucillaFast, efficient, in-memory Full Text Search for Kotlin
Stars: ✭ 102 (-67.41%)
jargonTokenizers and lemmatizers for Go
Stars: ✭ 98 (-68.69%)
Roy VnTokenizerVietnamese tokenizer (Maximum Matching and CRF)
Stars: ✭ 49 (-84.35%)