stringxDrop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-93.14%)
TokenizerFast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (-35.29%)
Emoji RegexA regular expression to match all Emoji-only symbols as per the Unicode Standard.
Stars: ✭ 1,134 (+455.88%)
r4stringsHandling Strings in R
Stars: ✭ 39 (-80.88%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-27.45%)
RegexpuA source code transpiler that enables the use of ES2015 Unicode regular expressions in ES5.
Stars: ✭ 201 (-1.47%)
TextAn efficient packed, immutable Unicode text type for Haskell, with a powerful loop fusion optimization framework.
Stars: ✭ 248 (+21.57%)
Regex AutomataA low level regular expression library that uses deterministic finite automata.
Stars: ✭ 203 (-0.49%)
Chr🔤 Lightweight R package for manipulating [string] characters
Stars: ✭ 18 (-91.18%)
regXwild⏱ Superfast ^Advanced wildcards++? | Unique algorithms that was implemented on native unmanaged C++ but easily accessible in .NET via Conari (with caching of 0x29 opcodes +optimizations) etc.
Stars: ✭ 20 (-90.2%)
substSearch and des... argh... replace in many files at once. Use regexp and power of Python to replace what you want.
Stars: ✭ 20 (-90.2%)
SherlockNatural-language event parser for Javascript
Stars: ✭ 393 (+92.65%)
PicomatchBlazing fast and accurate glob matcher written JavaScript, with no dependencies and full support for standard and extended Bash glob features, including braces, extglobs, POSIX brackets, and regular expressions.
Stars: ✭ 393 (+92.65%)
Portable Utf8🉑 Portable UTF-8 library - performance optimized (unicode) string functions for php.
Stars: ✭ 405 (+98.53%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+114.71%)
Any Rule🦕 常用正则大全, 支持web / vscode / idea / Alfred Workflow多平台
Stars: ✭ 5,708 (+2698.04%)
Regulex🚧 Regular Expression Excited!
Stars: ✭ 4,877 (+2290.69%)
LibfsmDFA regular expression library & friends
Stars: ✭ 512 (+150.98%)
Commonregex🍫 A collection of common regular expressions for Go
Stars: ✭ 733 (+259.31%)
Language ModellingGenerating Text using Deep Learning in Python - LSTM, RNN, Keras
Stars: ✭ 38 (-81.37%)
Common Regex🎃 常用正则表达式 - 收集一些在平时项目开发中经常用到的正则表达式。
Stars: ✭ 2,488 (+1119.61%)
RegexrFor composing regular expressions without the need for double-escaping inside strings.
Stars: ✭ 53 (-74.02%)
OnigmoOnigmo is a regular expressions library forked from Oniguruma.
Stars: ✭ 536 (+162.75%)
IcuThe new home of the ICU project source code.
Stars: ✭ 1,011 (+395.59%)
Is GlobIf you use globs, this will make your code faster. Returns `true` if the given string looks like a glob pattern or an extglob pattern. This makes it easy to create code that only uses external modules like node-glob when necessary, resulting in much faster code execution and initialization time, and a better user experience. 55+ million downloads.
Stars: ✭ 63 (-69.12%)
RegexparamA tiny (308B) utility that converts route patterns into RegExp. Limited alternative to `path-to-regexp` 🙇♂️
Stars: ✭ 390 (+91.18%)
Regexp2A full-featured regex engine in pure Go based on the .NET engine
Stars: ✭ 389 (+90.69%)
Nlp[UNMANTEINED] Extract values from strings and fill your structs with nlp.
Stars: ✭ 367 (+79.9%)
Rust UnicUNIC: Unicode and Internationalization Crates for Rust
Stars: ✭ 189 (-7.35%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+108.82%)
Matchzoo PyFacilitating the design, comparison and sharing of deep text matching models.
Stars: ✭ 362 (+77.45%)
Nlp RecipesNatural Language Processing Best Practices & Examples
Stars: ✭ 5,783 (+2734.8%)
GuitarA Cross-Platform String and Regular Expression Library written in Swift.
Stars: ✭ 641 (+214.22%)
Ugrep🔍NEW ugrep v3.1: ultra fast grep with interactive query UI and fuzzy search: search file systems, source code, text, binary files, archives (cpio/tar/pax/zip), compressed files (gz/Z/bz2/lzma/xz/lz4), documents and more. A faster, user-friendly and compatible grep replacement.
Stars: ✭ 626 (+206.86%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-43.63%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-40.2%)
Lingua FrancaMycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-75%)
PhobosThe standard library of the D programming language
Stars: ✭ 1,038 (+408.82%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+70.59%)
Command Line Text Processing⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨
Stars: ✭ 9,771 (+4689.71%)
OrchestraOne language to be RegExp's Successor. Visually readable and rich, technically safe and extended, naturally scalable, advanced, and optimized
Stars: ✭ 103 (-49.51%)
Konoha🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-36.27%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (-30.39%)
Regex Dos👮 👊 RegEx Denial of Service (ReDos) Scanner
Stars: ✭ 143 (-29.9%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (-36.27%)
Youtube RegexBest YouTube Video ID regex. Online: https://regex101.com/r/rN1qR5/2 and http://regexr.com/3anm9
Stars: ✭ 87 (-57.35%)
GrexA command-line tool and library for generating regular expressions from user-provided test cases
Stars: ✭ 4,847 (+2275.98%)
TextwrapAn efficient and powerful Rust library for word wrapping text.
Stars: ✭ 164 (-19.61%)
Voca rsVoca_rs is the ultimate Rust string library inspired by Voca.js, string.py and Inflector, implemented as independent functions and on Foreign Types (String and str).
Stars: ✭ 167 (-18.14%)
Stringz💯 Super fast unicode-aware string manipulation Javascript library
Stars: ✭ 181 (-11.27%)
NlprePython library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (-22.55%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-18.14%)
Regex BenchmarkIt's just a simple regex benchmark of different programming languages.
Stars: ✭ 171 (-16.18%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+1096.57%)
Tiny Utf8Unicode (UTF-8) capable std::string
Stars: ✭ 322 (+57.84%)