All Projects → nalgeon → iuliia

nalgeon / iuliia

Licence: MIT license
Transliterate Cyrillic → Latin in every possible way

Projects that are alternatives of or similar to iuliia

Hanbaobao
Mandarin Chinese text segmentation and mobile dictionary Android app (中文分词)
Stars: ✭ 17 (-72.58%)
Mutual labels:  transliteration
Cyrillic To Translit Js
Ultra-lightweight JavaScript library for converting Cyrillic symbols to Translit and vice versa
Stars: ✭ 91 (+46.77%)
Mutual labels:  transliteration
Slugify
Slugify a string
Stars: ✭ 2,110 (+3303.23%)
Mutual labels:  transliteration
Cyr2lat
Converts Cyrillic characters in post, page and term slugs to Latin characters. Useful for creating human-readable URLs.
Stars: ✭ 25 (-59.68%)
Mutual labels:  transliteration
Speakingurl
Generate a slug – transliteration with a lot of options
Stars: ✭ 1,056 (+1603.23%)
Mutual labels:  transliteration
Inflected
A port of ActiveSupport's inflector to Node.js and the browser.
Stars: ✭ 144 (+132.26%)
Mutual labels:  transliteration
Urlify
A fast PHP slug generator and transliteration library that converts non-ascii characters for use in URLs.
Stars: ✭ 633 (+920.97%)
Mutual labels:  transliteration
carakanjs
Convert/transliterate Latin script into Javanese script, also known as Aksara Jawa or Carakan.
Stars: ✭ 23 (-62.9%)
Mutual labels:  transliteration
Lipika Ime
Input Method Engine (IME) for Mac OS X with built-in support for all Indic Languages
Stars: ✭ 76 (+22.58%)
Mutual labels:  transliteration
Neural japanese transliterator
Can neural networks transliterate Romaji into Japanese correctly?
Stars: ✭ 170 (+174.19%)
Mutual labels:  transliteration
Crx Jtrans
jTransliter - the roman to unicode transliter as Google chrome extension
Stars: ✭ 13 (-79.03%)
Mutual labels:  transliteration
Slugify Cli
Slugify a string
Stars: ✭ 49 (-20.97%)
Mutual labels:  transliteration
Transliterate
Convert Unicode characters to Latin characters using transliteration
Stars: ✭ 152 (+145.16%)
Mutual labels:  transliteration
Lexical Sort
Sort Unicode strings lexicographically
Stars: ✭ 23 (-62.9%)
Mutual labels:  transliteration
Transliterate
Bi-directional transliterator for Python. Transliterates (unicode) strings according to the rules specified in the language packs.
Stars: ✭ 193 (+211.29%)
Mutual labels:  transliteration
Slug Generator
Slug Generator Library for PHP, based on Unicode’s CLDR data
Stars: ✭ 740 (+1093.55%)
Mutual labels:  transliteration
Mrz
Machine Readable Zone generator and checker for official travel documents sizes 1, 2, 3, MRVA and MRVB (Passports, Visas, national id cards and other travel documents)
Stars: ✭ 119 (+91.94%)
Mutual labels:  transliteration
iuliia-go
Transliterate Cyrillic → Latin in every possible way
Stars: ✭ 36 (-41.94%)
Mutual labels:  transliteration
transliteration-php
🇺🇦 🇬🇧 🔡 🐘 PHP library for transliteration.
Stars: ✭ 34 (-45.16%)
Mutual labels:  transliteration
Avspeechsynthesizer Example
A companion project to the NSHipster article about AVSpeechSynthesizer
Stars: ✭ 157 (+153.23%)
Mutual labels:  transliteration

Iuliia

Transliterate Cyrillic → Latin in every possible way

Transliteration means representing Cyrillic data (mainly names and geographic locations) with Latin letters. It is used for international passports, visas, green cards, driving licenses, mail and goods delivery etc.

Iuliia makes transliteration easy as calling iuliia.translate() in your favorite programming language.

Why use Iuliia:

  • 20 transliteration schemas (rule sets), including all main international and Russian standards.
  • Correctly implements not only the base mapping, but all the special rules for letter combinations and word endings (AFAIK, Iuliia is the only library which does so).
  • Simple API and zero third-party dependencies.

Supported schemas

Actual schemas:

And deprecated ones:

For schema details and other information, see iuliia.ru (in Russian).

Supported languages

This repo contains schema descriptions. It is not intended to be used directly. If you want to add another schema — send a PR! See schema.jsd for JSON Schema definition and wikipedia.json as a reference.

Issues and limitations

In general:

  • Only Russian subset of Cyrillic is supported.
  • Does not support composite Unicode characters (e.g., Ё, but not Ё).

Schema-specific:

  • BS 2979:1958. This schema defines two alternative translations for Ы: ЫȲ (used by the Oxford University Press) and ЫUI (used by the British Library). iuliia uses ЫȲ.
  • GOST R 7.0.34-2014. This schema defines alternatives for many letters, but does not specify when to use which. Therefore, iuliia uses the first of suggested translations for each such letter.
  • MVD-310. This schema defines "С between two vowels → SS" rule. There is no such rule in other schemas, and MVD-310 itself is deprecated, so I decided to ignore this specific rule for the sake of code simplicity.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].