All Projects → Stringi → Similar Projects or Alternatives

1651 Open source projects that are alternatives of or similar to Stringi

stringx
Drop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-93.14%)
Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (-35.29%)
Emoji Regex
A regular expression to match all Emoji-only symbols as per the Unicode Standard.
Stars: ✭ 1,134 (+455.88%)
Mutual labels:  regex, unicode, regexp
r4strings
Handling Strings in R
Stars: ✭ 39 (-80.88%)
Proposal Regexp Unicode Property Escapes
Proposal to add Unicode property escapes `\p{…}` and `\P{…}` to regular expressions in ECMAScript.
Stars: ✭ 112 (-45.1%)
Mutual labels:  regex, unicode, regexp
Xioc
Extract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-27.45%)
Mutual labels:  regex, text-processing, regexp
Regexpu
A source code transpiler that enables the use of ES2015 Unicode regular expressions in ES5.
Stars: ✭ 201 (-1.47%)
Mutual labels:  regex, unicode, regexp
Text
An efficient packed, immutable Unicode text type for Haskell, with a powerful loop fusion optimization framework.
Stars: ✭ 248 (+21.57%)
Mutual labels:  text, unicode, string-manipulation
Regex Automata
A low level regular expression library that uses deterministic finite automata.
Stars: ✭ 203 (-0.49%)
Mutual labels:  regex, text-processing, regexp
Chr
🔤 Lightweight R package for manipulating [string] characters
Stars: ✭ 18 (-91.18%)
regXwild
⏱ Superfast ^Advanced wildcards++? | Unique algorithms that was implemented on native unmanaged C++ but easily accessible in .NET via Conari (with caching of 0x29 opcodes +optimizations) etc.
Stars: ✭ 20 (-90.2%)
Mutual labels:  text, regex, regexp
subst
Search and des... argh... replace in many files at once. Use regexp and power of Python to replace what you want.
Stars: ✭ 20 (-90.2%)
Mutual labels:  text, regex, regexp
Sherlock
Natural-language event parser for Javascript
Stars: ✭ 393 (+92.65%)
Picomatch
Blazing fast and accurate glob matcher written JavaScript, with no dependencies and full support for standard and extended Bash glob features, including braces, extglobs, POSIX brackets, and regular expressions.
Stars: ✭ 393 (+92.65%)
Mutual labels:  regex, regexp
Portable Utf8
🉑 Portable UTF-8 library - performance optimized (unicode) string functions for php.
Stars: ✭ 405 (+98.53%)
Mutual labels:  unicode, string-manipulation
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+114.71%)
Any Rule
🦕 常用正则大全, 支持web / vscode / idea / Alfred Workflow多平台
Stars: ✭ 5,708 (+2698.04%)
Mutual labels:  regex, regexp
Regulex
🚧 Regular Expression Excited!
Stars: ✭ 4,877 (+2290.69%)
Mutual labels:  regex, regexp
Libfsm
DFA regular expression library & friends
Stars: ✭ 512 (+150.98%)
Mutual labels:  regex, regexp
Commonregex
🍫 A collection of common regular expressions for Go
Stars: ✭ 733 (+259.31%)
Mutual labels:  regex, regexp
Language Modelling
Generating Text using Deep Learning in Python - LSTM, RNN, Keras
Stars: ✭ 38 (-81.37%)
Common Regex
🎃 常用正则表达式 - 收集一些在平时项目开发中经常用到的正则表达式。
Stars: ✭ 2,488 (+1119.61%)
Mutual labels:  regex, regexp
Regexr
For composing regular expressions without the need for double-escaping inside strings.
Stars: ✭ 53 (-74.02%)
Mutual labels:  regex, regexp
Onigmo
Onigmo is a regular expressions library forked from Oniguruma.
Stars: ✭ 536 (+162.75%)
Mutual labels:  regex, regexp
Icu
The new home of the ICU project source code.
Stars: ✭ 1,011 (+395.59%)
Mutual labels:  unicode, icu
Applied Text Mining In Python
Repo for Applied Text Mining in Python (coursera) by University of Michigan
Stars: ✭ 59 (-71.08%)
Mutual labels:  regex, text-processing
Is Glob
If you use globs, this will make your code faster. Returns `true` if the given string looks like a glob pattern or an extglob pattern. This makes it easy to create code that only uses external modules like node-glob when necessary, resulting in much faster code execution and initialization time, and a better user experience. 55+ million downloads.
Stars: ✭ 63 (-69.12%)
Mutual labels:  regex, regexp
Regexparam
A tiny (308B) utility that converts route patterns into RegExp. Limited alternative to `path-to-regexp` 🙇‍♂️
Stars: ✭ 390 (+91.18%)
Mutual labels:  regex, regexp
Regexp2
A full-featured regex engine in pure Go based on the .NET engine
Stars: ✭ 389 (+90.69%)
Mutual labels:  regex, regexp
Nlp
[UNMANTEINED] Extract values from strings and fill your structs with nlp.
Stars: ✭ 367 (+79.9%)
Rust Unic
UNIC: Unicode and Internationalization Crates for Rust
Stars: ✭ 189 (-7.35%)
Mutual labels:  unicode, text-processing
Pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+108.82%)
Matchzoo Py
Facilitating the design, comparison and sharing of deep text matching models.
Stars: ✭ 362 (+77.45%)
Learn Regex Zh
🇨🇳 翻译: 学习正则表达式的简单方法
Stars: ✭ 1,772 (+768.63%)
Mutual labels:  regex, regexp
Nlp Recipes
Natural Language Processing Best Practices & Examples
Stars: ✭ 5,783 (+2734.8%)
Guitar
A Cross-Platform String and Regular Expression Library written in Swift.
Stars: ✭ 641 (+214.22%)
Mutual labels:  regex, string-manipulation
Ugrep
🔍NEW ugrep v3.1: ultra fast grep with interactive query UI and fuzzy search: search file systems, source code, text, binary files, archives (cpio/tar/pax/zip), compressed files (gz/Z/bz2/lzma/xz/lz4), documents and more. A faster, user-friendly and compatible grep replacement.
Stars: ✭ 626 (+206.86%)
Mutual labels:  regex, unicode
Cogcomp Nlpy
CogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-43.63%)
Nlp Pretrained Model
A collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-40.2%)
Lingua Franca
Mycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-75%)
Phobos
The standard library of the D programming language
Stars: ✭ 1,038 (+408.82%)
Mutual labels:  regex, unicode
Artificial Adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+70.59%)
Mutual labels:  text, text-processing
Guide To Swift Strings Sample Code
Xcode Playground Sample Code for the Flight School Guide to Swift Strings
Stars: ✭ 136 (-33.33%)
Mutual labels:  regex, unicode
Command Line Text Processing
⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨
Stars: ✭ 9,771 (+4689.71%)
Mutual labels:  regex, text-processing
Js Codepage
💱 Codepages for JS
Stars: ✭ 119 (-41.67%)
Mutual labels:  text, unicode
Orchestra
One language to be RegExp's Successor. Visually readable and rich, technically safe and extended, naturally scalable, advanced, and optimized
Stars: ✭ 103 (-49.51%)
Mutual labels:  regex, regexp
Konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-36.27%)
Stanza Old
Stanford NLP group's shared Python tools.
Stars: ✭ 142 (-30.39%)
Regex Dos
👮 👊 RegEx Denial of Service (ReDos) Scanner
Stars: ✭ 143 (-29.9%)
Mutual labels:  regex, regexp
Prenlp
Preprocessing Library for Natural Language Processing
Stars: ✭ 130 (-36.27%)
Youtube Regex
Best YouTube Video ID regex. Online: https://regex101.com/r/rN1qR5/2 and http://regexr.com/3anm9
Stars: ✭ 87 (-57.35%)
Mutual labels:  regex, regexp
Grex
A command-line tool and library for generating regular expressions from user-provided test cases
Stars: ✭ 4,847 (+2275.98%)
Mutual labels:  regex, regexp
Textwrap
An efficient and powerful Rust library for word wrapping text.
Stars: ✭ 164 (-19.61%)
Mutual labels:  text, unicode
Voca rs
Voca_rs is the ultimate Rust string library inspired by Voca.js, string.py and Inflector, implemented as independent functions and on Foreign Types (String and str).
Stars: ✭ 167 (-18.14%)
Mutual labels:  unicode, string-manipulation
Stringz
💯 Super fast unicode-aware string manipulation Javascript library
Stars: ✭ 181 (-11.27%)
Mutual labels:  unicode, string-manipulation
Nlpre
Python library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (-22.55%)
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-18.14%)
Regex Benchmark
It's just a simple regex benchmark of different programming languages.
Stars: ✭ 171 (-16.18%)
Mutual labels:  regex, regexp
Fastnlp
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+1096.57%)
Tiny Utf8
Unicode (UTF-8) capable std::string
Stars: ✭ 322 (+57.84%)
Mutual labels:  unicode, string-manipulation
1-60 of 1651 similar projects