All Projects → scarletcho → Kog2p

scarletcho / Kog2p

Licence: gpl-3.0
Korean grapheme-to-phone conversion in Python

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Kog2p

Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+606.45%)
Mutual labels:  korean
Atom Korean Spell Daum
Daum 맞춤법 검사기로 텍스트를 보내 손쉽게 교정 가능한 Atom package
Stars: ✭ 24 (-61.29%)
Mutual labels:  korean
Gobyexample
🎁 Go By Example 한국어 버전
Stars: ✭ 50 (-19.35%)
Mutual labels:  korean
Cppcoreguidelines
C++ 핵심 가이드라인 한글화 프로젝트 (C++ Core Guidelines)
Stars: ✭ 572 (+822.58%)
Mutual labels:  korean
Ko.javascript.info
Modern JavaScript Tutorial in Korean (모던 JavaScript 튜토리얼)
Stars: ✭ 822 (+1225.81%)
Mutual labels:  korean
Ununiga
[은는이가] 한글 조사(助詞) 대응 I18n engine extension
Stars: ✭ 34 (-45.16%)
Mutual labels:  korean
Southkorea Maps
South Korea administrative divisions in ESRI Shapefile, GeoJSON and TopoJSON formats.
Stars: ✭ 367 (+491.94%)
Mutual labels:  korean
Konlpy
Python package for Korean natural language processing.
Stars: ✭ 1,098 (+1670.97%)
Mutual labels:  korean
Eulyoo1945
을유문화사에서 만든 을유1945 폰트를 웹에서 손쉽게 사용할 수 있는 환경을 제공합니다.
Stars: ✭ 24 (-61.29%)
Mutual labels:  korean
Graphql Kr.github.io
🇰🇷 GraphQL Document in Korean
Stars: ✭ 47 (-24.19%)
Mutual labels:  korean
Awesome Swift Korean Lecture
훌륭한 Swift 세션 동영상(강좌), 한글 자막있는 혹은 한국어 강의 정보 링크 모음 (Awesome Swift Korean lecture information)
Stars: ✭ 649 (+946.77%)
Mutual labels:  korean
Awesome Sushi
🍣 국내 스시 오마카세 맛집 리스트
Stars: ✭ 700 (+1029.03%)
Mutual labels:  korean
Haskellstudy
HaskellStudy
Stars: ✭ 44 (-29.03%)
Mutual labels:  korean
Kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+793.55%)
Mutual labels:  korean
Yesterday I Learned
Brainfarts are caused by the rupturing of the cerebral sphincter.
Stars: ✭ 50 (-19.35%)
Mutual labels:  korean
Open Apis Korea
🇰🇷 한국어 사용자를 위한 서비스에 사용하기 위한 오픈 API 모음
Stars: ✭ 391 (+530.65%)
Mutual labels:  korean
Hangul Asm
Hangul disasm/asm
Stars: ✭ 10 (-83.87%)
Mutual labels:  korean
Typescript Handbook Ko
TypeScript Handbook in Korean.
Stars: ✭ 60 (-3.23%)
Mutual labels:  korean
Char Rnn Tensorflow
Multi-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (-6.45%)
Mutual labels:  korean
Owasp Masvs
The Mobile Application Security Verification Standard (MASVS) is a standard for mobile app security.
Stars: ✭ 1,030 (+1561.29%)
Mutual labels:  korean

KoG2P

Given an input of a series of Korean graphemes/letters (i.e. Hangul), KoG2P outputs the corresponding pronunciations.

한국어의 문자열로부터 발음열을 생성하는 파이썬 기반 G2P 패키지입니다.
터미널에서 원하는 문자열을 함께 입력해 사용할 수 있습니다.

How to use?

On terminal, you simply can type in your input within quotations:

$ python g2p.py '박물관'

Then you'll get /방물관/ symbolized as follows:

p0 aa ng mm uu ll k0 wa nf

NB. Your input does not necessarily need to be a lemma or a legitimate sequence of Korean; the system will provide an output based on the phonological rules of Korean for any sequences in Hangul.

Requirement

  • Python 2.7 or 3.x

Symbol table

Please check out the symbol table below for the mapping.

C/V Position Symbols in Hangul Symbols in KoG2P
consonant onset p0
consonant onset ph
consonant onset pp
consonant onset t0
consonant onset th
consonant onset tt
consonant onset k0
consonant onset kh
consonant onset kk
consonant onset s0
consonant onset ss
consonant onset h0
consonant onset c0
consonant onset ch
consonant onset cc
consonant onset mm
consonant onset nn
consonant onset rr
consonant coda pf
consonant coda ph
consonant coda tf
consonant coda th
consonant coda kf
consonant coda kh
consonant coda kk
consonant coda s0
consonant coda ss
consonant coda h0
consonant coda c0
consonant coda ch
consonant coda mf
consonant coda nf
consonant coda ng
consonant coda ll
consonant coda ㄱㅅ ks
consonant coda ㄴㅈ nc
consonant coda ㄴㅎ nh
consonant coda ㄹㄱ lk
consonant coda ㄹㅁ lm
consonant coda ㄹㅂ lb
consonant coda ㄹㅅ ls
consonant coda ㄹㅌ lt
consonant coda ㄹㅍ lp
consonant coda ㄹㅎ lh
consonant coda ㅂㅅ ps
vowel monophthong ii
vowel monophthong ee
vowel monophthong qq
vowel monophthong aa
vowel monophthong xx
vowel monophthong vv
vowel monophthong uu
vowel monophthong oo
vowel diphthong ye
vowel diphthong yq
vowel diphthong ya
vowel diphthong yv
vowel diphthong yu
vowel diphthong yo
vowel diphthong wi
vowel diphthong wo
vowel diphthong wq
vowel diphthong we
vowel diphthong wa
vowel diphthong wv
vowel diphthong xi

NB. IPA symbols for Korean phones can be found in the following page: IPA for Korean.

Reference

Please cite the following if using this code:

@misc{cho2017kog2p,
  title = {Korean Grapheme-to-Phoneme Analyzer (KoG2P)},
  author = {Yejin Cho},
  year = {2017},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/scarletcho/KoG2P}}
}

Thank you for your citations!

  • Yoon Seok Hong, Kyung Seo Ki, and Gahgene Gweon. 2018. Automatic Miscue Detection Using RNN Based Models with Data Augmentation. In Proc. Interspeech 2018. 1646-1650. [pdf]

  • Younggun Lee and Taesu Kim. 2018. Learning pronunciation from a foreign language in speech synthesis network. arXiv preprint. arXiv:1811.09364. [pdf]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].