lexertkC++ Lexer Toolkit Library (LexerTk) https://www.partow.net/programming/lexertk/index.html
Stars: ✭ 26 (-81.02%)
snapdragon-lexerConverts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (-86.13%)
LexmachineLex machinary for go.
Stars: ✭ 335 (+144.53%)
Works For MeCollection of developer toolkits
Stars: ✭ 131 (-4.38%)
Php Parser🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+191.97%)
Snl CompilerSNL(Small Nested Language) Compiler. Maven jUnit Tokenizer Lexer Syntax Parser. 编译原理 词法分析 语法分析
Stars: ✭ 19 (-86.13%)
MooOptimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+216.79%)
bredonA modern CSS value compiler in JavaScript
Stars: ✭ 39 (-71.53%)
ChevrotainParser Building Toolkit for JavaScript
Stars: ✭ 1,795 (+1210.22%)
JflexThe fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+177.37%)
SwiLexA universal lexer library in Swift.
Stars: ✭ 29 (-78.83%)
lexLex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (-64.23%)
pascal-interpreterA simple interpreter for a large subset of Pascal language written for educational purposes
Stars: ✭ 21 (-84.67%)
SharpmathA small .NET math library.
Stars: ✭ 36 (-73.72%)
DjurlSimple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (-37.96%)
Omnicat BayesNaive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-78.1%)
PlyaraParse YARA rules and operate over them more easily.
Stars: ✭ 108 (-21.17%)
Sentence SplitterText to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-40.15%)
MicoMico ("Monkey" in catalan). Monkey language implementation done with C++. https://interpreterbook.com/
Stars: ✭ 19 (-86.13%)
Mustard🌭 Mustard is a Swift library for tokenizing strings when splitting by whitespace doesn't cut it.
Stars: ✭ 689 (+402.92%)
KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+304.38%)
LibfsmDFA regular expression library & friends
Stars: ✭ 512 (+273.72%)
SimplecC/C++ develop tool for android.
Stars: ✭ 105 (-23.36%)
Charly VmFibers, Closures, C-Module System | NaN-boxing, bytecode-VM written in C++
Stars: ✭ 66 (-51.82%)
TokenizerA small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+3381.75%)
TalismaneNLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Stars: ✭ 38 (-72.26%)
SomajoA tokenizer and sentence splitter for German and English web and social media texts.
Stars: ✭ 85 (-37.96%)
Nlp Js Tools FrenchPOS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-76.64%)
TokenizerSource code tokenizer
Stars: ✭ 119 (-13.14%)
LfuzzerFuzzing Parsers with Tokens
Stars: ✭ 28 (-79.56%)
HippoPHP standards checker.
Stars: ✭ 82 (-40.15%)
FugashiA Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Stars: ✭ 125 (-8.76%)
Rs Monkey LangMonkey Programming Language written in Rust.
Stars: ✭ 80 (-41.61%)
NatashaSolves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+475.18%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-21.17%)
Soynlp한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+347.45%)
WirbRuby Object Inspection for IRB
Stars: ✭ 69 (-49.64%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+219.71%)
Minigominigo🐥is a small Go compiler made from scratch. It can compile itself.
Stars: ✭ 456 (+232.85%)
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (-54.01%)
Nodablea node-able bidirectionnal expression editor.
Stars: ✭ 103 (-24.82%)
Smoothnlp专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+217.52%)
CsstreeA tool set for CSS including fast detailed parser, walker, generator and lexer based on W3C specs and browser implementations
Stars: ✭ 1,121 (+718.25%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+216.06%)
String CalcPHP calculator library for mathematical terms (expressions) passed as strings
Stars: ✭ 60 (-56.2%)
Tiny CompilerA tiny compiler for a language featuring LL(2) with Lexer, Parser, ASM-like codegen and VM. Complex enough to give you a flavour of how the "real" thing works whilst not being a mere toy example
Stars: ✭ 425 (+210.22%)
SyntokText tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-10.22%)
Megamark😻 Markdown with easy tokenization, a fast highlighter, and a lean HTML sanitizer
Stars: ✭ 100 (-27.01%)
ThotThot toolkit for statistical machine translation
Stars: ✭ 53 (-61.31%)
VeribleVerible is a suite of SystemVerilog developer tools, including a parser, style-linter, and formatter.
Stars: ✭ 384 (+180.29%)
GreynirThe greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-65.69%)
Syntax ParserLight and fast 🚀parser! With zero dependents. - Sql Parser Demo added!
Stars: ✭ 317 (+131.39%)
RflexFast lexer code generator for Rust
Stars: ✭ 100 (-27.01%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-66.42%)
FrisoHigh performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Stars: ✭ 313 (+128.47%)
ExprtkC++ Mathematical Expression Parsing And Evaluation Library
Stars: ✭ 301 (+119.71%)