LexReplaced by foonathan/lexy
Stars: ✭ 137 (+621.05%)
lexertkC++ Lexer Toolkit Library (LexerTk) https://www.partow.net/programming/lexertk/index.html
Stars: ✭ 26 (+36.84%)
pascal-interpreterA simple interpreter for a large subset of Pascal language written for educational purposes
Stars: ✭ 21 (+10.53%)
LexmachineLex machinary for go.
Stars: ✭ 335 (+1663.16%)
Works For MeCollection of developer toolkits
Stars: ✭ 131 (+589.47%)
JflexThe fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+1900%)
ChevrotainParser Building Toolkit for JavaScript
Stars: ✭ 1,795 (+9347.37%)
SwiLexA universal lexer library in Swift.
Stars: ✭ 29 (+52.63%)
bredonA modern CSS value compiler in JavaScript
Stars: ✭ 39 (+105.26%)
MooOptimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+2184.21%)
Php Parser🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+2005.26%)
snapdragon-lexerConverts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (+0%)
lexLex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (+157.89%)
llvm-kaleidoscopeLLVM Tutorial: Kaleidoscope (Implementing a Language with LLVM)
Stars: ✭ 124 (+552.63%)
Hebrew-TokenizerA very simple python tokenizer for Hebrew text.
Stars: ✭ 16 (-15.79%)
cang-jieChinese tokenizer for tantivy, based on jieba-rs
Stars: ✭ 48 (+152.63%)
PaddleTokenizer使用 PaddlePaddle 实现基于深度神经网络的中文分词引擎 | A DNN Chinese Tokenizer by Using PaddlePaddle
Stars: ✭ 14 (-26.32%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+2205.26%)
FrisoHigh performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Stars: ✭ 313 (+1547.37%)
ariaExpressive, noiseless, interpreted, toy programming language
Stars: ✭ 40 (+110.53%)
LixyA Kotlin lexer framework with an easy-to-use DSL
Stars: ✭ 38 (+100%)
SentencesA multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (+1442.11%)
MonkeyLang.jl"Writing an Interpreter in GO" and "Writing a Compiler in GO" in Julia.
Stars: ✭ 30 (+57.89%)
KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+2815.79%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+2178.95%)
Re FlexThe regex-centric, fast lexical analyzer generator for C++ with full Unicode support. Faster than Flex. Accepts Flex specifications. Generates reusable source code that is easy to understand. Introduces indent/dedent anchors, lazy quantifiers, functions for lex/syntax error reporting, and more. Seamlessly integrates with Bison and other parsers.
Stars: ✭ 274 (+1342.11%)
malluscriptA simple,gentle,humble scripting language for mallus, based on malayalam memes.
Stars: ✭ 112 (+489.47%)
Minigominigo🐥is a small Go compiler made from scratch. It can compile itself.
Stars: ✭ 456 (+2300%)
alexaA Lexical Analyzer Generator
Stars: ✭ 54 (+184.21%)
Syntax ParserLight and fast 🚀parser! With zero dependents. - Sql Parser Demo added!
Stars: ✭ 317 (+1568.42%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+889.47%)
Soynlp한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+3126.32%)
parleParser and lexer for PHP
Stars: ✭ 68 (+257.89%)
ExprtkC++ Mathematical Expression Parsing And Evaluation Library
Stars: ✭ 301 (+1484.21%)
simplemmaSimple multilingual lemmatizer for Python, especially useful for speed and efficiency
Stars: ✭ 32 (+68.42%)
Smoothnlp专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+2189.47%)
soccSimple C Compiler in OCaml
Stars: ✭ 41 (+115.79%)
SacremosesPython port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (+1442.11%)
mystem-scalaMorphological analyzer `mystem` (Russian language) wrapper for JVM languages
Stars: ✭ 21 (+10.53%)
NatashaSolves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+4047.37%)
oceanProgramming language that compiles into a x86 ELF executable.
Stars: ✭ 164 (+763.16%)
EdgeNode.js templating engine with fresh air
Stars: ✭ 270 (+1321.05%)
tokenizerTokenize CSS according to the CSS Syntax
Stars: ✭ 52 (+173.68%)
SwiftpascalinterpreterSimple Swift interpreter for the Pascal language inspired by the Let’s Build A Simple Interpreter article series.
Stars: ✭ 270 (+1321.05%)
ilmultiTooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (+0%)
JumanppJuman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (+1236.84%)
vscode-blockmanVSCode extension to highlight nested code blocks
Stars: ✭ 233 (+1126.32%)
fayrant-langSimple, interpreted, dynamically-typed programming language
Stars: ✭ 30 (+57.89%)
LibfsmDFA regular expression library & friends
Stars: ✭ 512 (+2594.74%)
Tiny CompilerA tiny compiler for a language featuring LL(2) with Lexer, Parser, ASM-like codegen and VM. Complex enough to give you a flavour of how the "real" thing works whilst not being a mere toy example
Stars: ✭ 425 (+2136.84%)
graphql-metaLexing, parsing, pretty-printing, and metaprogramming facilities for dealing with GraphQL schemas and queries
Stars: ✭ 16 (-15.79%)
bshiftCompiler for a language called bshift
Stars: ✭ 15 (-21.05%)
liblexC library for Lexical Analysis
Stars: ✭ 25 (+31.58%)
wink-tokenizerMultilingual tokenizer that automatically tags each token with its type
Stars: ✭ 51 (+168.42%)
berserkerBerserker - BERt chineSE woRd toKenizER
Stars: ✭ 17 (-10.53%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (+0%)
compilerImplementing a complete Compiler for a simple C-like language using the C-tools Flex and Bison
Stars: ✭ 106 (+457.89%)