All Projects → vhf → Confusable_homoglyphs

vhf / Confusable_homoglyphs

Licence: mit
ϲοnfuѕаblе_һοmоɡlyphs

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Confusable homoglyphs

Plotille
Plot in the terminal using braille dots.
Stars: ✭ 99 (-23.85%)
Mutual labels:  unicode
Uniseg
Unicode Text Segmentation for Go (or: How to Count Characters in a String)
Stars: ✭ 113 (-13.08%)
Mutual labels:  unicode
Js Codepage
💱 Codepages for JS
Stars: ✭ 119 (-8.46%)
Mutual labels:  unicode
Nonce Disrespect
Nonce-Disrespecting Adversaries: Practical Forgery Attacks on GCM in TLS
Stars: ✭ 103 (-20.77%)
Mutual labels:  attack
Misp Maltego
Set of Maltego transforms to inferface with a MISP Threat Sharing instance, and also to explore the whole MITRE ATT&CK dataset.
Stars: ✭ 112 (-13.85%)
Mutual labels:  attack
Jlsca
Side-channel toolkit in Julia
Stars: ✭ 114 (-12.31%)
Mutual labels:  attack
Pythonimproved
The best Python language definition for Sublime Text - ever. Includes full support for Unicode, as well as both Python 2 and Python 3 syntax. Check out the Neon Color Scheme for highlighting.
Stars: ✭ 95 (-26.92%)
Mutual labels:  unicode
Free adv train
Official TensorFlow Implementation of Adversarial Training for Free! which trains robust models at no extra cost compared to natural training.
Stars: ✭ 127 (-2.31%)
Mutual labels:  attack
Tendo
Official repository of python tendo library, always welcoming new contributions.
Stars: ✭ 113 (-13.08%)
Mutual labels:  unicode
Box Cli Maker
Make Highly Customized Boxes for your CLI
Stars: ✭ 115 (-11.54%)
Mutual labels:  unicode
Active Directory Exploitation Cheat Sheet
A cheat sheet that contains common enumeration and attack methods for Windows Active Directory.
Stars: ✭ 1,392 (+970.77%)
Mutual labels:  attack
Proposal Regexp Unicode Property Escapes
Proposal to add Unicode property escapes `\p{…}` and `\P{…}` to regular expressions in ECMAScript.
Stars: ✭ 112 (-13.85%)
Mutual labels:  unicode
Chaos
Proof of concept, general purpose pastejacker for GNU/Linux
Stars: ✭ 115 (-11.54%)
Mutual labels:  unicode
Hybrid Fonts
Monospaced fonts patched with Chinese characters and extra glyphs from Nerd Fonts
Stars: ✭ 102 (-21.54%)
Mutual labels:  unicode
Ratel
RAT-el is an open source penetration test tool that allows you to take control of a windows machine. It works on the client-server model, the server sends commands and the client executes the commands and sends the result back to the server. The client is completely undetectable by anti-virus software.
Stars: ✭ 121 (-6.92%)
Mutual labels:  unicode
Unicode Display width
Monospace Unicode character width in Ruby
Stars: ✭ 98 (-24.62%)
Mutual labels:  unicode
Nailgun
Nailgun attack on ARM devices.
Stars: ✭ 114 (-12.31%)
Mutual labels:  attack
Ansiweather
Weather in terminal, with ANSI colors and Unicode symbols
Stars: ✭ 1,663 (+1179.23%)
Mutual labels:  unicode
Unibits
Visualize different Unicode encodings in the terminal
Stars: ✭ 125 (-3.85%)
Mutual labels:  unicode
Cheatsheet
Pretty cheat sheets, or ``reference cards'', obtainable from Org files.
Stars: ✭ 116 (-10.77%)
Mutual labels:  unicode

confusable_homoglyphs [doc] <http://confusable-homoglyphs.readthedocs.io/en/latest/>__

.. image:: https://img.shields.io/travis/vhf/confusable_homoglyphs.svg :target: https://travis-ci.org/vhf/confusable_homoglyphs

.. image:: https://img.shields.io/pypi/v/confusable_homoglyphs.svg :target: https://pypi.python.org/pypi/confusable_homoglyphs

.. image:: https://readthedocs.org/projects/confusable_homoglyphs/badge/?version=latest :target: http://confusable-homoglyphs.readthedocs.io/en/latest/ :alt: Documentation Status

a homoglyph is one of two or more graphemes, characters, or glyphs with shapes that appear identical or very similar wikipedia:Homoglyph <https://en.wikipedia.org/wiki/Homoglyph>__

Unicode homoglyphs can be a nuisance on the web. Your most popular client, AlaskaJazz, might be upset to be impersonated by a trickster who deliberately chose the username ΑlaskaJazz.

  • AlaskaJazz is single script: only Latin characters.
  • ΑlaskaJazz is mixed-script: the first character is a greek letter.

You might also want to avoid people being tricked into entering their password on www.microsоft.com or www.faϲebook.com instead of www.microsoft.com or www.facebook.com. Here is a utility <http://unicode.org/cldr/utility/confusables.jsp>__ to play with these confusable homoglyphs.

Not all mixed-script strings have to be ruled out though, you could only exclude mixed-script strings containing characters that might be confused with a character from some unicode blocks of your choosing.

  • Allo and ρττ are fine: single script.
  • AlloΓ is fine when our preferred script alias is 'latin': mixed script, but Γ is not confusable.
  • Alloρ is dangerous: mixed script and ρ could be confused with p.

This library is compatible Python 2 and Python 3.

API documentation <http://confusable-homoglyphs.readthedocs.io/en/latest/apidocumentation.html>__

Is the data up to date?

Yep.

The unicode blocks aliases and names for each character are extracted from this file <http://www.unicode.org/Public/UNIDATA/Scripts.txt>__ provided by the unicode consortium.

The matrix of which character can be confused with which other characters is built using this file <http://www.unicode.org/Public/security/latest/confusables.txt>__ provided by the unicode consortium.

This data is stored in two JSON files: categories.json and confusables.json. If you delete them, they will both be recreated by downloading and parsing the two abovementioned files and stored as JSON files again.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].