All Projects → erayyildiz → Neural-Morphological-Disambiguation-for-Turkish-DEPRECATED

erayyildiz / Neural-Morphological-Disambiguation-for-Turkish-DEPRECATED

Licence: other
Neural morphological disambiguation for Turkish. Implemented in DyNet

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Neural-Morphological-Disambiguation-for-Turkish-DEPRECATED

yap
Yet Another (natural language) Parser
Stars: ✭ 40 (+263.64%)
Mutual labels:  morphological-analysis, morphological-disambiguator
udar
UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.
Stars: ✭ 15 (+36.36%)
Mutual labels:  morphological-analysis, morphological-disambiguator
zeyrek
Python morphological analyzer for Turkish language. Partial port of ZemberekNLP.
Stars: ✭ 36 (+227.27%)
Mutual labels:  turkish, morphological-analysis
fsharp kitap
Kitap Önizleme Versiyonu
Stars: ✭ 25 (+127.27%)
Mutual labels:  turkish
startup-sozlugu
Startup dünyasında sık kullan kelimeler ve terimler
Stars: ✭ 21 (+90.91%)
Mutual labels:  turkish
ruby-tr
Ruby Türkiye Topluluk Sitesi
Stars: ✭ 50 (+354.55%)
Mutual labels:  turkish
openssl-certificate-authority-guide
Bu kılavuz, OpenSSL komut satırı araçlarını kullanarak kendi sertifika yetkilinizi (CA) kurup nasıl kullanacağınızı gösterir.
Stars: ✭ 17 (+54.55%)
Mutual labels:  turkish
tdk
python library for turkish dictionary. 📕🇹🇷
Stars: ✭ 17 (+54.55%)
Mutual labels:  turkish
TurkishWordNet
Turkish WordNet KeNet
Stars: ✭ 32 (+190.91%)
Mutual labels:  turkish
bilkent-turkish-writings-dataset
Turkish writings dataset that promotes creativity, content, composition, grammar, spelling and punctuation.
Stars: ✭ 30 (+172.73%)
Mutual labels:  turkish
turkish banks
All Turkish Banks and Their Branches
Stars: ✭ 28 (+154.55%)
Mutual labels:  turkish
phpwaf-phanalyzer
AliGuard PHP WAF
Stars: ✭ 12 (+9.09%)
Mutual labels:  turkish
Divan.hs
Ottoman Divan poetry vezin checker in Haskell!
Stars: ✭ 37 (+236.36%)
Mutual labels:  turkish
almanca
Almanca dilbilgisi ve gramer notlari / Lesson notes I have taken to learn the German language beginning from A1.
Stars: ✭ 15 (+36.36%)
Mutual labels:  turkish
clean-code-javascript-tr
🛁 JavaScript için uyarlanmış Temiz Kod kavramları
Stars: ✭ 86 (+681.82%)
Mutual labels:  turkish
Turkish-Lemmatizer
Lemmatization for Turkish Language
Stars: ✭ 72 (+554.55%)
Mutual labels:  turkish
turkce-kufur-karaliste
Türkçe için bir kara liste (blacklist)
Stars: ✭ 117 (+963.64%)
Mutual labels:  turkish
backend-best-practices
Backend uygulamaları geliştirirken dikkate alınabilecek örnek yöntemlerin derlendiği güncellenen bir kaynak.
Stars: ✭ 80 (+627.27%)
Mutual labels:  turkish
language-detector
Detect the language of text
Stars: ✭ 28 (+154.55%)
Mutual labels:  turkish
Turkcekaynaklar Com
Özenle seçilmiş Türkçe kaynaklar listesi En: Curated list of Turkish resources
Stars: ✭ 1,810 (+16354.55%)
Mutual labels:  turkish

Deprecated: See the current version which is based on PyTorch.

Neural-Morphological-Disambiguation-for-Turkish

This project is dynet implementation of morphological disambiguation method proposed in [1]

Full context model is one of the methods proposed in the study and perform best for Turkish morphological disambiguation among four proposed methods. All the methods are based on character level bidirectional LSTM networks and Full Context Model represent the context applying LSTMs on surface words around the target word needs to be disambiguated. Here, we provide an implementation of this method written in dynet. We also provide train and test sets for Turkish morphological disambiguation. The disambiguation model can be trained using this dynet script. Model save and load functions are also provided so that users are able to use this tool as a Turkish morphological disambiguator.

This script is developed in a project about joint dependecy parsing and morphological disambiguation and willbe further developed. A better organized morphological disambiguation tool with much more functionality will be provided soon.

Just run them Neural_morph_disambiguator.py to start training. Values of default parameters are same as proposed values in the paper.

For the details about the method, please see the Shen's study.

accuracy on test set: 0.966707477868 ambiguous accuracy on test: 0.93011416992

[1] Shen,et. al. 2017. The Role of Context in Neural Morphological Disambiguation COLING 2016

Installation

To install the progrm you need a machine with a Python 3 or higher. To install python3, run the following program if your machine has linux based OS.

sudo apt update
sudo apt install software-properties-common
sudo add-apt-repository ppa:deadsnakes/ppa
sudo apt install python3.7

Then you should install python package manager pip:

sudo apt install python3-pip

After pip is installed you should install python packages which the project requires:

pip install -f REQUIREMENTS.txt

Then create a folder named models in project root directory so that the trained model can be saved into this directory.

mkdir models

Now installation is completed, you can run the program.

How to run

For training you can run the following python script:

from neural_morph_disambiguation_dynet import MorphologicalDisambiguator
disambiguator = MorphologicalDisambiguator(train_from_scratch=True, char_representation_len=100, word_lstm_rep_len=200, 
                                           train_data_path="data/data.train.txt",
                                           test_data_paths=["data/data.test.txt","data/test.merge"],
                                           model_file_name='my_model')

These codes will start training on given datasets and you can track the training status from console. When the training is completed, the model will be saved into models path with name 'my_model'

To load and use a trained model use the codes below:

from neural_morph_disambiguation_dynet import MorphologicalDisambiguator
disambiguator = MorphologicalDisambiguator.create_from_existed_model("model-22.10.2017")
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].