All Projects → Friso → Similar Projects or Alternatives

151 Open source projects that are alternatives of or similar to Friso

bredon
A modern CSS value compiler in JavaScript
Stars: ✭ 39 (-87.54%)
Mutual labels:  tokenizer
Query Translator
Query Translator is a search query translator with AST representation
Stars: ✭ 165 (-47.28%)
Mutual labels:  tokenizer
neural tokenizer
Tokenize English sentences using neural networks.
Stars: ✭ 64 (-79.55%)
Mutual labels:  tokenizer
Udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (-48.88%)
Mutual labels:  tokenizer
rgpipe
lesspipe for ripgrep for common new filetypes using few dependencies
Stars: ✭ 21 (-93.29%)
Mutual labels:  full-text-search
Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (-57.83%)
Mutual labels:  tokenizer
pg-search-sequelize
Postgres full-text search in Node.js and Sequelize.
Stars: ✭ 31 (-90.1%)
Mutual labels:  full-text-search
Fugashi
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Stars: ✭ 125 (-60.06%)
Mutual labels:  tokenizer
mystem-scala
Morphological analyzer `mystem` (Russian language) wrapper for JVM languages
Stars: ✭ 21 (-93.29%)
Mutual labels:  tokenizer
Syntok
Text tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-60.7%)
Mutual labels:  tokenizer
mxusearch
🔍 基于讯搜封装的 Laravel 全文检索服务。
Stars: ✭ 40 (-87.22%)
Mutual labels:  full-text-search
Tokenizer
Source code tokenizer
Stars: ✭ 119 (-61.98%)
Mutual labels:  tokenizer
Jumanpp
Juman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (-18.85%)
Mutual labels:  tokenizer
Megamark
😻 Markdown with easy tokenization, a fast highlighter, and a lean HTML sanitizer
Stars: ✭ 100 (-68.05%)
Mutual labels:  tokenizer
psr2r-sniffer
A PSR-2-R code sniffer and code-style auto-correction-tool - including many useful additions
Stars: ✭ 32 (-89.78%)
Mutual labels:  tokenizer
Djurl
Simple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (-72.84%)
Mutual labels:  tokenizer
gatsby-plugin-lunr
Gatsby plugin for full text search implementation based on lunr client-side index. Supports multilanguage search.
Stars: ✭ 69 (-77.96%)
Mutual labels:  full-text-search
Sentence Splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-73.8%)
Mutual labels:  tokenizer
lex
Lex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (-84.35%)
Mutual labels:  tokenizer
Wirb
Ruby Object Inspection for IRB
Stars: ✭ 69 (-77.96%)
Mutual labels:  tokenizer
search-for-kirby
Kirby 3 plugin for adding a search index (sqlite or Algolia).
Stars: ✭ 42 (-86.58%)
Mutual labels:  full-text-search
Thot
Thot toolkit for statistical machine translation
Stars: ✭ 53 (-83.07%)
Mutual labels:  tokenizer
hunspell
High-Performance Stemmer, Tokenizer, and Spell Checker for R
Stars: ✭ 101 (-67.73%)
Mutual labels:  tokenizer
Py Nltools
A collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-85.3%)
Mutual labels:  tokenizer
paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
Stars: ✭ 4,840 (+1446.33%)
Mutual labels:  full-text-search
Sharpmath
A small .NET math library.
Stars: ✭ 36 (-88.5%)
Mutual labels:  tokenizer
lindera
A morphological analysis library.
Stars: ✭ 226 (-27.8%)
Mutual labels:  tokenizer
Omnicat Bayes
Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-90.42%)
Mutual labels:  tokenizer
Sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (-6.39%)
Mutual labels:  tokenizer
Laravel Token
Laravel token management
Stars: ✭ 10 (-96.81%)
Mutual labels:  tokenizer
lunr-module
Full-text search with pre-build indexes for Nuxt.js using lunr.js
Stars: ✭ 45 (-85.62%)
Mutual labels:  full-text-search
Lisp Esque Language
💠The Lel programming language
Stars: ✭ 24 (-92.33%)
Mutual labels:  tokenizer
ilmulti
Tooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-93.93%)
Mutual labels:  tokenizer
Natasha
Solves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+151.76%)
Mutual labels:  tokenizer
python-mecab
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (-91.37%)
Mutual labels:  tokenizer
Soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+95.85%)
Mutual labels:  tokenizer
Library-Spring
The library web application where you can borrow books. It's Spring MVC and Hibernate project.
Stars: ✭ 73 (-76.68%)
Mutual labels:  full-text-search
Tokenizer
A small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+1423.96%)
Mutual labels:  tokenizer
snapdragon-lexer
Converts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (-93.93%)
Mutual labels:  tokenizer
Smoothnlp
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+38.98%)
Mutual labels:  tokenizer
nlpir-analysis-cn-ictclas
Lucene/Solr Analyzer Plugin. Support MacOS,Linux x86/64,Windows x86/64. It's a maven project, which allows you change the lucene/solr version. //Maven工程,修改Lucene/Solr版本,以兼容相应版本。
Stars: ✭ 71 (-77.32%)
Mutual labels:  chinese-word-segmentation
Moo
Optimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+38.66%)
Mutual labels:  tokenizer
chinese-tokenizer
Tokenizes Chinese texts into words.
Stars: ✭ 72 (-77%)
Mutual labels:  tokenizer
Jflex
The fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+21.41%)
Mutual labels:  tokenizer
ArabicProcessingCog
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-93.93%)
Mutual labels:  tokenizer
Ftserver
Lightweight Embeddable iBoxDB Full Text Search Server for Java
Stars: ✭ 219 (-30.03%)
Mutual labels:  full-text-search
suika
Suika 🍉 is a Japanese morphological analyzer written in pure Ruby
Stars: ✭ 31 (-90.1%)
Mutual labels:  tokenizer
Tntsearch
A fully featured full text search engine written in PHP
Stars: ✭ 2,693 (+760.38%)
Mutual labels:  full-text-search
liblex
C library for Lexical Analysis
Stars: ✭ 25 (-92.01%)
Mutual labels:  tokenizer
Everywhere
🔧 A tool can really search everywhere for you.
Stars: ✭ 147 (-53.04%)
Mutual labels:  full-text-search
Tokenizer
A tokenizer for Icelandic text
Stars: ✭ 27 (-91.37%)
Mutual labels:  tokenizer
Riddle
Ruby Client API for Sphinx
Stars: ✭ 139 (-55.59%)
Mutual labels:  full-text-search
text2text
Text2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (-39.94%)
Mutual labels:  tokenizer
lexertk
C++ Lexer Toolkit Library (LexerTk) https://www.partow.net/programming/lexertk/index.html
Stars: ✭ 26 (-91.69%)
Mutual labels:  tokenizer
Sentences
A multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (-6.39%)
Mutual labels:  tokenizer
Memex
Browser Extension to full-text search your browsing history & bookmarks.
Stars: ✭ 3,344 (+968.37%)
Mutual labels:  full-text-search
larasearch
A driver based solution to searching your Eloquent models supports Laravel 5.2 and Elasticsearch engine.
Stars: ✭ 13 (-95.85%)
Mutual labels:  full-text-search
lucilla
Fast, efficient, in-memory Full Text Search for Kotlin
Stars: ✭ 102 (-67.41%)
Mutual labels:  full-text-search
jargon
Tokenizers and lemmatizers for Go
Stars: ✭ 98 (-68.69%)
Mutual labels:  tokenizer
Roy VnTokenizer
Vietnamese tokenizer (Maximum Matching and CRF)
Stars: ✭ 49 (-84.35%)
Mutual labels:  tokenizer
61-120 of 151 similar projects