All Projects → anna-hope → Phonemes

anna-hope / Phonemes

Licence: mit
Jason Riggle's chart of phonological features in JSON format + extras

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Phonemes

linguisticsdown
Easy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-27.27%)
Mutual labels:  linguistics
wikipron
Massively multilingual pronunciation mining
Stars: ✭ 167 (+406.06%)
Mutual labels:  linguistics
Pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+1190.91%)
Mutual labels:  linguistics
TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-18.18%)
Mutual labels:  linguistics
mystem
CGo bindings to Yandex.Mystem
Stars: ✭ 28 (-15.15%)
Mutual labels:  linguistics
concepticon-data
The curation repository for the data behind Concepticon.
Stars: ✭ 25 (-24.24%)
Mutual labels:  linguistics
lingvo--Ner-ru
Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (+15.15%)
Mutual labels:  linguistics
Nltk data
NLTK Data
Stars: ✭ 675 (+1945.45%)
Mutual labels:  linguistics
OpenGNT
Open Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
Stars: ✭ 55 (+66.67%)
Mutual labels:  linguistics
rsyntaxtree
Syntax tree generator made with Ruby and RMagic
Stars: ✭ 62 (+87.88%)
Mutual labels:  linguistics
clinical nlp elastic
Clinical NLP Analysis with Elasticsearch and Kibana
Stars: ✭ 32 (-3.03%)
Mutual labels:  linguistics
duree
Durée: the longest book ever written.
Stars: ✭ 67 (+103.03%)
Mutual labels:  linguistics
treebender
A HDPSG-inspired symbolic natural language parser written in Rust
Stars: ✭ 24 (-27.27%)
Mutual labels:  linguistics
neural-net-linguistics
Papers about NN and linguistics
Stars: ✭ 14 (-57.58%)
Mutual labels:  linguistics
Lexpredict Lexnlp
LexNLP by LexPredict
Stars: ✭ 439 (+1230.3%)
Mutual labels:  linguistics
lameta
The Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-45.45%)
Mutual labels:  linguistics
TextGridTools
Read, write, and manipulate Praat TextGrid files with Python
Stars: ✭ 84 (+154.55%)
Mutual labels:  linguistics
Awesome Sentiment Analysis
😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤
Stars: ✭ 816 (+2372.73%)
Mutual labels:  linguistics
Weixin public corpus
微信公众号语料库
Stars: ✭ 465 (+1309.09%)
Mutual labels:  linguistics
spanish-corpora
Unannotated Spanish 3 Billion Words Corpora
Stars: ✭ 61 (+84.85%)
Mutual labels:  linguistics

This repository hosts a representation of Jason Riggle's chart of phonological features version 12.12 in a machine-readable JSON format.

The keys in the JSON file are the phonemes' IPA symbols. The values are their English-language name and the binary features from the chart linked above (see the JSON file for an example).

Additionally, this repository provides a script (phonemeviewer.py) which lets you view the phoneme features from the provided JSON file and see what positive (+ or ±) features each phoneme has. Additionally, when given a list of phonemes, the script calculates the 'similarity' between these phonemes and lists the features that every phoneme in the list shares (if any).

Example usage:

> ð
voiced dental fricative
+cons
-son
-syl
-labial
+coronal
+ant
-dist
+dorsal
-pharyngeal
+voice
-SG
-CG
+cont
-strident
-lateral
-del_rel
-nasal

> ð ʃ
0.782608695652174
-del_rel
+coronal
-son
-syl
-pharyngeal
-CG
-labial
-SG
-lateral
+cont
-nasal
+cons
['coronal', 'cont', 'cons']

The script was written for Python 3.4+, but will probably run on Python 3.3 if the backported enum package from 3.4 is installed. You may use http://ipa.typeit.org/full/ to type the IPA symbols into the script.

Please report any inconsistencies you may find between the JSON file and Jason Riggle's chart, or, better yet, please fix them and create pull requests. Please do not report any errors you may find in Jason Riggle's chart here. Instead, send your comments directly to the author; there is a non-zero chance of him responding to them. Additionally, please let me know if this repository references an outdated version of Jason Riggle's chart.

Big thanks to Rafael Abramovitz for helping me with the creation of a machine-readable version of the chart.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].