All Projects β†’ gagolews β†’ stringx

gagolews / stringx

Licence: other
Drop-in replacements for base R string functions powered by stringi

Programming Languages

r
7636 projects
HTML
75241 projects
CSS
56736 projects
Makefile
30231 projects
python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to stringx

Stringi
THE String Processing Package for R (with ICU)
Stars: ✭ 204 (+1357.14%)
Mutual labels:  unicode, text, icu, regex, regexp, string-manipulation, text-processing
Chr
πŸ”€ Lightweight R package for manipulating [string] characters
Stars: ✭ 18 (+28.57%)
Mutual labels:  regex, string-manipulation, text-processing
Xioc
Extract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+957.14%)
Mutual labels:  regex, regexp, text-processing
subst
Search and des... argh... replace in many files at once. Use regexp and power of Python to replace what you want.
Stars: ✭ 20 (+42.86%)
Mutual labels:  text, regex, regexp
regXwild
⏱ Superfast ^Advanced wildcards++? | Unique algorithms that was implemented on native unmanaged C++ but easily accessible in .NET via Conari (with caching of 0x29 opcodes +optimizations) etc.
Stars: ✭ 20 (+42.86%)
Mutual labels:  text, regex, regexp
Regexpu
A source code transpiler that enables the use of ES2015 Unicode regular expressions in ES5.
Stars: ✭ 201 (+1335.71%)
Mutual labels:  unicode, regex, regexp
Regex Automata
A low level regular expression library that uses deterministic finite automata.
Stars: ✭ 203 (+1350%)
Mutual labels:  regex, regexp, text-processing
icu-dotnet
C# wrapper for ICU4C
Stars: ✭ 48 (+242.86%)
Mutual labels:  unicode, icu, icu4c
Proposal Regexp Unicode Property Escapes
Proposal to add Unicode property escapes `\p{…}` and `\P{…}` to regular expressions in ECMAScript.
Stars: ✭ 112 (+700%)
Mutual labels:  unicode, regex, regexp
Emoji Regex
A regular expression to match all Emoji-only symbols as per the Unicode Standard.
Stars: ✭ 1,134 (+8000%)
Mutual labels:  unicode, regex, regexp
Text
An efficient packed, immutable Unicode text type for Haskell, with a powerful loop fusion optimization framework.
Stars: ✭ 248 (+1671.43%)
Mutual labels:  unicode, text, string-manipulation
r4strings
Handling Strings in R
Stars: ✭ 39 (+178.57%)
Mutual labels:  regex, string-manipulation, text-processing
Stringy
πŸ‰‘ Stringy - A PHP string manipulation library with multibyte support, performance optimized
Stars: ✭ 135 (+864.29%)
Mutual labels:  unicode, string-manipulation
html-comment-regex
Regular expression for matching HTML comments
Stars: ✭ 15 (+7.14%)
Mutual labels:  text, regexp
RegExp-Learning
ε­¦δΉ ζ­£εˆ™θ‘¨θΎΎεΌ
Stars: ✭ 30 (+114.29%)
Mutual labels:  regex, regexp
cregex
A small implementation of regular expression matching engine in C
Stars: ✭ 72 (+414.29%)
Mutual labels:  regex, regexp
regexp-expand
Show the ELisp regular expression at point in rx form.
Stars: ✭ 18 (+28.57%)
Mutual labels:  regex, regexp
IronRure
.NET Bindings to the Rust Regex Crate
Stars: ✭ 16 (+14.29%)
Mutual labels:  regex, regexp
icu-swift
Swift APIs for ICU
Stars: ✭ 23 (+64.29%)
Mutual labels:  unicode, icu
is-regex
Is this value a JS regex?
Stars: ✭ 22 (+57.14%)
Mutual labels:  regex, regexp

stringx

Drop-in replacements for base R string functions powered by stringi

English is the native language for only 5% of the World population. Also, only 17% of us can understand this text. Moreover, the Latin alphabet is the main one for merely 36% of the total. The early computer era, now a very long time ago, was dominated by the US. Due to the proliferation of the internet, smartphones, social media, and other technologies and media, this is no longer the case. This package replaces base R string functions with ones that fully support the Unicode standards related to natural language and date-time processing. Thanks to ICU (International Components for Unicode) and stringi, they are fast, reliable, and portable across different platforms.

stringx replaces base R functions like paste, gregexpr, tolower, and strptime with ones that:

  • work in the same way on every platform,
  • support a wide range of languages and scripts,
  • fix some long-standing inconsistencies in base R (which are pointed out in the package manual).

stringx is a set of wrappers around stringi β€” a mature R package for fast, consistent, convenient, and portable string/text/natural language processing in any locale that relies on ICU – International Components for Unicode.


Package Maintainer and Author: Marek Gagolewski

Homepage: https://stringx.gagolewski.com

CRAN Entry: https://cran.r-project.org/package=stringx

License: stringx is distributed under the terms of the GNU General Public License, either Version 2 or Version 3, see LICENSE.

Changelog: see NEWS.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].