SomajoA tokenizer and sentence splitter for German and English web and social media texts.
Stars: ✭ 85 (-28.57%)
Componette Site➿ Addons, plugins, components and extensions (@componette ❤️ @nette)
Stars: ✭ 56 (-52.94%)
GuitarA Cross-Platform String and Regular Expression Library written in Swift.
Stars: ✭ 641 (+438.66%)
GreynirThe greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-60.5%)
Cyberchef RecipesA list of cyber-chef recipes and curated links
Stars: ✭ 619 (+420.17%)
DebugviewppDebugView++, collects, views, filters your application logs, and highlights information that is important to you!
Stars: ✭ 592 (+397.48%)
RegexrFor composing regular expressions without the need for double-escaping inside strings.
Stars: ✭ 53 (-55.46%)
DjurlSimple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (-28.57%)
Go RestructureMatch regular expressions into struct fields
Stars: ✭ 570 (+378.99%)
Eval Sql.netSQL Eval Function | Dynamically Evaluate Expression in SQL Server using C# Syntax
Stars: ✭ 84 (-29.41%)
TalismaneNLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Stars: ✭ 38 (-68.07%)
KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+365.55%)
Soynlp한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+415.13%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-61.34%)
Automa.jlA julia code generator for regular expressions
Stars: ✭ 111 (-6.72%)
SharpmathA small .NET math library.
Stars: ✭ 36 (-69.75%)
OnigmoOnigmo is a regular expressions library forked from Oniguruma.
Stars: ✭ 536 (+350.42%)
TokenizerA small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+3908.4%)
HippoPHP standards checker.
Stars: ✭ 82 (-31.09%)
Rexrex🦖 Composable JavaScript regular expressions
Stars: ✭ 34 (-71.43%)
Regulex🚧 Regular Expression Excited!
Stars: ✭ 4,877 (+3998.32%)
ChinamobilephonenumberregexRegular expressions that match the mobile phone number in mainland China. / 一组匹配中国大陆手机号码的正则表达式。
Stars: ✭ 4,440 (+3631.09%)
Nlp Js Tools FrenchPOS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-73.11%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+268.07%)
Smoothnlp专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+265.55%)
Sentence SplitterText to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-31.09%)
TempreitesOne-file semantic DSL-free templates direto da roça for the browser and server.
Stars: ✭ 31 (-73.95%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+263.87%)
MooOptimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+264.71%)
Omnicat BayesNaive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-74.79%)
Regexplain🔍 An RStudio addin slash regex utility belt
Stars: ✭ 413 (+247.06%)
RegexA Regular Expression game for Android
Stars: ✭ 80 (-32.77%)
Php Parser🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+236.13%)
HaeHaE - BurpSuite Highlighter and Extractor
Stars: ✭ 397 (+233.61%)
StringrA fresh approach to string manipulation in R
Stars: ✭ 397 (+233.61%)
Place2liveAnalysis of the characteristics of different countries
Stars: ✭ 30 (-74.79%)
PicomatchBlazing fast and accurate glob matcher written JavaScript, with no dependencies and full support for standard and extended Bash glob features, including braces, extglobs, POSIX brackets, and regular expressions.
Stars: ✭ 393 (+230.25%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-9.24%)
Regexp2A full-featured regex engine in pure Go based on the .NET engine
Stars: ✭ 389 (+226.89%)
RegenTool to generate random strings from Go/RE2 regular expressions (Migrated to https://git.sr.ht/~nilium/regen)
Stars: ✭ 79 (-33.61%)
JflexThe fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+219.33%)
RegexanalyzerRegular Expression Analyzer and Composer for Node.js / XPCOM / Browser Javascript, PHP, Python
Stars: ✭ 29 (-75.63%)
SubconverterUtility to convert between various subscription format
Stars: ✭ 4,912 (+4027.73%)
Minta✳️ Electron app for generating regular expressions
Stars: ✭ 353 (+196.64%)
NanomatchFast, minimal glob matcher for node.js. Similar to micromatch, minimatch and multimatch, but without support for extended globs (extglobs), posix brackets or braces, and with complete Bash 4.3 wildcard support: ("*", "**", and "?").
Stars: ✭ 79 (-33.61%)
LfuzzerFuzzing Parsers with Tokens
Stars: ✭ 28 (-76.47%)
Commit WatcherFind interesting and potentially hazardous commits in git projects
Stars: ✭ 345 (+189.92%)
RegexA sane interface for php's built in preg_* functions
Stars: ✭ 909 (+663.87%)
GenerexA Java library for generating String from a regular expression.
Stars: ✭ 316 (+165.55%)
FrisoHigh performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Stars: ✭ 313 (+163.03%)
To Regex RangePass two numbers, get a regex-compatible source string for matching ranges. Fast compiler, optimized regex, and validated against more than 2.78 million test assertions. Useful for creating regular expressions to validate numbers, ranges, years, etc.
Stars: ✭ 97 (-18.49%)
Code Checker✅ A simple tool to check source code against a set of Nette coding standards.
Stars: ✭ 76 (-36.13%)
Anabelle👸 API documentation generator (JSON-RPC / REST)
Stars: ✭ 20 (-83.19%)
RegexRegular expressions for swift
Stars: ✭ 306 (+157.14%)
SentencesA multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (+146.22%)
SacremosesPython port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (+146.22%)
Forms Multiplier🔁 Form multiplier & replicator for Nette Framework
Stars: ✭ 11 (-90.76%)
Anymatch‼️ Matches strings against configurable strings, globs, regular expressions, and/or functions
Stars: ✭ 289 (+142.86%)