snapdragon-lexerConverts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (-26.92%)
bredonA modern CSS value compiler in JavaScript
Stars: ✭ 39 (+50%)
Snl CompilerSNL(Small Nested Language) Compiler. Maven jUnit Tokenizer Lexer Syntax Parser. 编译原理 词法分析 语法分析
Stars: ✭ 19 (-26.92%)
LexReplaced by foonathan/lexy
Stars: ✭ 137 (+426.92%)
Works For MeCollection of developer toolkits
Stars: ✭ 131 (+403.85%)
lexLex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (+88.46%)
LexmachineLex machinary for go.
Stars: ✭ 335 (+1188.46%)
SwiLexA universal lexer library in Swift.
Stars: ✭ 29 (+11.54%)
ChevrotainParser Building Toolkit for JavaScript
Stars: ✭ 1,795 (+6803.85%)
MooOptimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+1569.23%)
Php Parser🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+1438.46%)
Snapdragonsnapdragon is an extremely pluggable, powerful and easy-to-use parser-renderer factory.
Stars: ✭ 180 (+592.31%)
pascal-interpreterA simple interpreter for a large subset of Pascal language written for educational purposes
Stars: ✭ 21 (-19.23%)
JflexThe fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+1361.54%)
HippoPHP standards checker.
Stars: ✭ 82 (+215.38%)
Js TokensTiny JavaScript tokenizer.
Stars: ✭ 166 (+538.46%)
String CalcPHP calculator library for mathematical terms (expressions) passed as strings
Stars: ✭ 60 (+130.77%)
sinlingA collection of NLP tools for Sinhalese (සිංහල).
Stars: ✭ 38 (+46.15%)
TokenizersFast, Consistent Tokenization of Natural Language Text
Stars: ✭ 161 (+519.23%)
GreynirThe greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (+80.77%)
TalismaneNLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Stars: ✭ 38 (+46.15%)
SharpmathA small .NET math library.
Stars: ✭ 36 (+38.46%)
Nlp Js Tools FrenchPOS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (+23.08%)
LfuzzerFuzzing Parsers with Tokens
Stars: ✭ 28 (+7.69%)
DjurlSimple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (+226.92%)
BitextorBitextor generates translation memories from multilingual websites.
Stars: ✭ 168 (+546.15%)
Sentence SplitterText to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (+215.38%)
T-REXT-REX is a suite of smart contracts implementing the EIP 3643 and developed by Tokeny to manage and transfer financial assets on the Ethereum blockchain
Stars: ✭ 79 (+203.85%)
WirbRuby Object Inspection for IRB
Stars: ✭ 69 (+165.38%)
Query TranslatorQuery Translator is a search query translator with AST representation
Stars: ✭ 165 (+534.62%)
ThotThot toolkit for statistical machine translation
Stars: ✭ 53 (+103.85%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (+76.92%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+515.38%)
token-cliCommand line utility for interacting with OAuth2 infrastructure to generate tokens
Stars: ✭ 19 (-26.92%)
graspEssential NLP & ML, short & fast pure Python code
Stars: ✭ 58 (+123.08%)
KAIKAI is a distributed computing model written in modern C++ and is cross-plaftorm. Using custom language translators and an executor, KAI provides full reflection, persistence and cross-process communications without having to modify existing source code. KAI Comes with an automated, generational tricolor garbage collector, and Console- and Windo…
Stars: ✭ 13 (-50%)
go-jwt-issuerMicroservice generates the pair of JSON web tokens - access-token and refresh-token are signed by user identifier.
Stars: ✭ 30 (+15.38%)
Omnicat BayesNaive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (+15.38%)
TokenizerFast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (+407.69%)
FugashiA Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Stars: ✭ 125 (+380.77%)
Git-SecretGo scripts for finding sensitive data like API key / some keywords in the github repository
Stars: ✭ 156 (+500%)
NatashaSolves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+2930.77%)
Mustard🌭 Mustard is a Swift library for tokenizing strings when splitting by whitespace doesn't cut it.
Stars: ✭ 689 (+2550%)
Soynlp한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+2257.69%)
SyntokText tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (+373.08%)
KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+2030.77%)
TokenizerA small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+18246.15%)
Roy VnTokenizerVietnamese tokenizer (Maximum Matching and CRF)
Stars: ✭ 49 (+88.46%)
vdfA Lexer and Parser for Valves Data Format (known as vdf) written in Go
Stars: ✭ 30 (+15.38%)
Japanesetokenizersaim to use JapaneseTokenizer as easy as possible
Stars: ✭ 120 (+361.54%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+1584.62%)
Smoothnlp专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+1573.08%)
TokenizerSource code tokenizer
Stars: ✭ 119 (+357.69%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+1565.38%)
go-uciNative Go bindings for OpenWrt's UCI.
Stars: ✭ 69 (+165.38%)