All Projects → janlelis → Uniscribe

janlelis / Uniscribe

Licence: mit
Know your Unicode ✀

Programming Languages

ruby
36898 projects - #4 most used programming language

Projects that are alternatives of or similar to Uniscribe

Unibits
Visualize different Unicode encodings in the terminal
Stars: ✭ 125 (-53.01%)
Mutual labels:  cli-command, debugging-tool, unicode
Unicopy
Unicode command-line codepoint dumper
Stars: ✭ 16 (-93.98%)
Mutual labels:  cli-command, unicode
ripdb
Remotely accessible IPython-enabled debugger
Stars: ✭ 21 (-92.11%)
Mutual labels:  debugging-tool
Objectdiff.js
Compares JavaScript objects
Stars: ✭ 253 (-4.89%)
Mutual labels:  debugging-tool
torisetsu
Torisetsu - Add README.md Template.
Stars: ✭ 20 (-92.48%)
Mutual labels:  cli-command
text-rendering-tests
Unicode’s test suite for text rendering engines
Stars: ✭ 135 (-49.25%)
Mutual labels:  unicode
core
The XP Framework is an all-purpose, object oriented PHP framework.
Stars: ✭ 13 (-95.11%)
Mutual labels:  unicode
unicode-c
A C library for handling Unicode, UTF-8, surrogate pairs, etc.
Stars: ✭ 32 (-87.97%)
Mutual labels:  unicode
Runtimeunityeditor
In-game inspector and debugging tools for applications made with Unity3D game engine
Stars: ✭ 254 (-4.51%)
Mutual labels:  debugging-tool
quran-data
Unicode-encoded Quran data
Stars: ✭ 67 (-74.81%)
Mutual labels:  unicode
no-facebook-emoji
Get rid of those ugly emojis now! [stopped working 😢]
Stars: ✭ 15 (-94.36%)
Mutual labels:  unicode
jquery-manager
Manage jQuery and jQuery Migrate on a WordPress website, activate a specific jQuery and/or jQuery Migrate version. The ultimate jQuery debugging tool for WordPress
Stars: ✭ 27 (-89.85%)
Mutual labels:  debugging-tool
guide-charles-proxy
Charles - Web Debugging Proxy Application. I want to share my experiences when I worked with Charles. It is such an amazing application for debugging and testing the presentation of UI when trying different set of data. Hope you guys will master Charles after reading this section. Let’s find out! 🖍
Stars: ✭ 22 (-91.73%)
Mutual labels:  debugging-tool
simdutf
Unicode routines (UTF8, UTF16): billions of characters per second.
Stars: ✭ 108 (-59.4%)
Mutual labels:  unicode
libWinTF8
The library handling things related to UTF-8 and Unicode when you want to port your program to Windows
Stars: ✭ 18 (-93.23%)
Mutual labels:  unicode
Go Edlib
Golang string comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
Stars: ✭ 253 (-4.89%)
Mutual labels:  unicode
widestring-rs
A wide string Rust library for converting to and from wide Unicode strings.
Stars: ✭ 48 (-81.95%)
Mutual labels:  unicode
b2a
btoa and atob support for node.js or old browsers, with the Unicode Problems fixed
Stars: ✭ 21 (-92.11%)
Mutual labels:  unicode
phpdebugbar
PSR-15 middleware for PHP Debug bar
Stars: ✭ 64 (-75.94%)
Mutual labels:  debugging-tool
Watchpoints
watchpoints is an easy-to-use, intuitive variable/object monitor tool for python that behaves similar to watchpoints in gdb.
Stars: ✭ 261 (-1.88%)
Mutual labels:  debugging-tool

uniscribe | Describe the Unicode [version] [ci]

Describes Unicode characters with their name and shows compositions.

  • Helps you understand how glyphs and codepoints are structured within the data
  • Gives you the names of glyphs and codepoints, which can be used for further research
  • Highlights invalid/special/blank codepoints

Uses a similar color coding like its lower-level companion tool unibits.

Setup

Make sure you have Ruby installed and installing gems works properly. Then do:

$ gem install uniscribe

Usage

Pass the string to debug to uniscribe:

From CLI

$ uniscribe "test strı̈ng"

From Ruby

require "uniscribe/kernel_method"
uniscribe "test strı̈ng"

Output


0074 ├─ t		├─ LATIN SMALL LETTER T
0065 ├─ e		├─ LATIN SMALL LETTER E
0073 ├─ s		├─ LATIN SMALL LETTER S
0074 ├─ t		├─ LATIN SMALL LETTER T
0020 ├─ ] [		├─ SPACE
0073 ├─ s		├─ LATIN SMALL LETTER S
0074 ├─ t		├─ LATIN SMALL LETTER T
0072 ├─ r		├─ LATIN SMALL LETTER R
---- ├┬ ı̈		├┬ Composition
0131 │├─ ı		│├─ LATIN SMALL LETTER DOTLESS I
0308 │└─ ◌̈		│└─ COMBINING DIAERESIS
006E ├─ n		├─ LATIN SMALL LETTER N
0067 ├─ g		├─ LATIN SMALL LETTER G

Examples

Tamil

>> uniscribe "நகரத்தில்"

Screenshot Tamil

Thai

>> uniscribe "ม้าลายหกตัว"

Screenshot Thai

Ideographic Variations

>> uniscribe "辻󠄀㚑󠄁"

Screenshot Ideographic Variations

(the variation is not visible in the screenshot, because my system does not render it correctly)

Emoji Sequences

>> uniscribe "3️⃣🤸‍♀"

Screenshot Emoji

Lots of Combining Marks

>> uniscribe "̶̧̨̱̹̭̯ͧ̾ͬC̷̙̲̝͖ͭ̏ͥͮ͟Oͮ͏̮̪̝͍"

Screenshot Marks

Random Sequences of some Special Unicode Codepoints

>> uniscribe "\0A\u{E01D7}\x7F\r\n\u{D0000}\u{81}\u{FFF9}B\u{FFFB}🏴\u{E0061}\u{E007F}\u{10FFFF}"

Screenshot Strange

Some Blanks

>> uniscribe "­ᅠ 𝅸"

Screenshot Blanks

Notes

Although the gem is generally up to date with Unicode 12.1, the proper detection of compositions / graphemes / combined characters depends on your Ruby version:

You can run uniscribe -v to check for the Unicode level of your uniscribe version.

Also see

Copyright (C) 2017-2020 Jan Lelis https://janlelis.com. Released under the MIT license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].