All Projects → DissectMalware → Pysamesame

DissectMalware / Pysamesame

Licence: apache-2.0
This is a python version of samesame repo to generate homograph strings

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pysamesame

Ecoji
Encodes (and decodes) data as emojis
Stars: ✭ 671 (+3255%)
Mutual labels:  unicode
Unicopy
Unicode command-line codepoint dumper
Stars: ✭ 16 (-20%)
Mutual labels:  unicode
Idutf8lib
Idiot's UTF-8 Library
Stars: ✭ 12 (-40%)
Mutual labels:  unicode
String
Provides an object-oriented API to strings and deals with bytes, UTF-8 code points and grapheme clusters in a unified way.
Stars: ✭ 709 (+3445%)
Mutual labels:  unicode
Unicode Types
Basic Unicode Types of a Ruby String
Stars: ✭ 5 (-75%)
Mutual labels:  unicode
Nim Unicodedb
Unicode Character Database (UCD, tr44) for Nim
Stars: ✭ 19 (-5%)
Mutual labels:  unicode
Urlify
A fast PHP slug generator and transliteration library that converts non-ascii characters for use in URLs.
Stars: ✭ 633 (+3065%)
Mutual labels:  unicode
Utf8.h
📚 single header utf8 string functions for C and C++
Stars: ✭ 875 (+4275%)
Mutual labels:  unicode
Pragmatapro
PragmataPro font is designed to help pros to work better
Stars: ✭ 887 (+4335%)
Mutual labels:  unicode
Git Praise
A nicer git blame.
Stars: ✭ 24 (+20%)
Mutual labels:  unicode
Unicodeplots.jl
Unicode-based scientific plotting for working in the terminal
Stars: ✭ 724 (+3520%)
Mutual labels:  unicode
Zws
Shorten URLs using invisible spaces.
Stars: ✭ 780 (+3800%)
Mutual labels:  unicode
Lexical Sort
Sort Unicode strings lexicographically
Stars: ✭ 23 (+15%)
Mutual labels:  unicode
Awesome Unicode
😂 👌 A curated list of delightful Unicode tidbits, packages and resources.
Stars: ✭ 693 (+3365%)
Mutual labels:  unicode
Crx Jtrans
jTransliter - the roman to unicode transliter as Google chrome extension
Stars: ✭ 13 (-35%)
Mutual labels:  unicode
Uni
Query the Unicode database from the commandline, with good support for emojis
Stars: ✭ 633 (+3065%)
Mutual labels:  unicode
Nepali Romanized Pro
Nepali Romanized Keyboard Layout with installer for macOS
Stars: ✭ 18 (-10%)
Mutual labels:  unicode
Unicode 9.0.0
JavaScript-compatible Unicode data. Arrays of code points, arrays of symbols, and regular expressions for Unicode v9.0.0’s categories, scripts, blocks, bidi, and other properties.
Stars: ✭ 15 (-25%)
Mutual labels:  unicode
Myanmar Unicode Fonts
Fonts preview for list of Myanmar Unicode fonts
Stars: ✭ 14 (-30%)
Mutual labels:  unicode
Alfred Unicode
Preview Unicode characters and emoji in Alfred
Stars: ✭ 23 (+15%)
Mutual labels:  unicode

Introduction to PySameSame

This is a python port of samesame utility developed by @TheTarquin. It also offers a few more features that deemed to be useful.

By using PySameSame, you can:

  1. replace ASCII characters with homograph (look-alike) characters in a given string
homoglyph = english_confusables.get_homograph("this is for test")

  1. obtain all of the potential ASCII representations of a given Unicode homoglyph.
ascii_reprs = english_confusables.convert_to_ascii("𝐈𝟎𝐈𝔅١𝔑S",ignore_case=True)

  1. obtain an HTML table representing the internal mappings that is used by PySameSame
mappings = english_confusables.generate_table()
  1. manually select a Unicode homoglyph for each English Alphabet and use the resulted mapping to transform a text using a web interface.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].