All Projects → onurgu → ner-tagger-dynet

onurgu / ner-tagger-dynet

Licence: MIT license
See http://github.com/onurgu/joint-ner-and-md-tagger This repository is basically a Bi-LSTM based sequence tagger in both Tensorflow and Dynet which can utilize several sources of information about each word unit like word embeddings, character based embeddings and morphological tags from an FST to obtain the representation for that specific wor…

Programming Languages

sed
78 projects
python
139335 projects - #7 most used programming language
shell
77523 projects
perl
6916 projects
Jupyter Notebook
11667 projects
awk
318 projects

Projects that are alternatives of or similar to ner-tagger-dynet

Sequence tagging
Named Entity Recognition (LSTM + CRF) - Tensorflow
Stars: ✭ 1,889 (+8113.04%)
Mutual labels:  named-entity-recognition, bi-lstm
ckipnlp
CKIP CoreNLP Toolkits
Stars: ✭ 92 (+300%)
Mutual labels:  named-entity-recognition
Neural-Morphological-Disambiguation-for-Turkish-DEPRECATED
Neural morphological disambiguation for Turkish. Implemented in DyNet
Stars: ✭ 11 (-52.17%)
Mutual labels:  dynet
NER-and-Linking-of-Ancient-and-Historic-Places
An NER tool for ancient place names based on Pleiades and Spacy.
Stars: ✭ 26 (+13.04%)
Mutual labels:  named-entity-recognition
PhoNER COVID19
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Stars: ✭ 55 (+139.13%)
Mutual labels:  named-entity-recognition
TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+265.22%)
Mutual labels:  named-entity-recognition
OpenUE
OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)
Stars: ✭ 274 (+1091.3%)
Mutual labels:  named-entity-recognition
PersianNER
Named-Entity Recognition in Persian Language
Stars: ✭ 48 (+108.7%)
Mutual labels:  named-entity-recognition
scikitcrf NER
Python library for custom entity recognition using Sklearn CRF
Stars: ✭ 17 (-26.09%)
Mutual labels:  named-entity-recognition
BiLSTM-and-CNN-for-Link-Prediction
Keras implementation of path-based link prediction model for knowledge graph completion
Stars: ✭ 17 (-26.09%)
Mutual labels:  bi-lstm
BioMedical-NLP-corpus
Biomedical NLP Corpus or Datasets.
Stars: ✭ 44 (+91.3%)
Mutual labels:  named-entity-recognition
neural name tagging
Code for "Reliability-aware Dynamic Feature Composition for Name Tagging" (ACL2019)
Stars: ✭ 39 (+69.57%)
Mutual labels:  named-entity-recognition
SynLSTM-for-NER
Code and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.
Stars: ✭ 26 (+13.04%)
Mutual labels:  named-entity-recognition
spert
PyTorch code for SpERT: Span-based Entity and Relation Transformer
Stars: ✭ 572 (+2386.96%)
Mutual labels:  named-entity-recognition
packard-belle
Windows 98 React Component Library
Stars: ✭ 127 (+452.17%)
Mutual labels:  reimplementation
lima
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Stars: ✭ 75 (+226.09%)
Mutual labels:  named-entity-recognition
DreeRally
Death Rally engine reimplementation
Stars: ✭ 68 (+195.65%)
Mutual labels:  reimplementation
Wisty.js
🧚‍♀️ Chatbot library turning conversations into actions, locally, in the browser.
Stars: ✭ 24 (+4.35%)
Mutual labels:  named-entity-recognition
namaco
Character Based Named Entity Recognition.
Stars: ✭ 41 (+78.26%)
Mutual labels:  named-entity-recognition
Pytorch-PCGrad
Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"
Stars: ✭ 179 (+678.26%)
Mutual labels:  reimplementation

See updated version at http://github.com/onurgu/joint-ner-and-md-tagger

Neural Tagger for MD and NER

This repo contains the software that was used to conduct the experiments reported in our article titled "Improving Named Entity Recognition by Jointly Learning to Disambiguate Morphological Tags" [1] to be presented at COLING 2018.

Training and testing

We recommend using the helper scripts for conducting experiments. The scripts named helper-script-* run the experiments in the paper with given hyper parameters.

bash ./scripts/helper-script-to-run-the-experiment-set-small-sizes.sh campaing_name | parallel -j6

For the reporting part to work, you should set up a working sacred environment, which is very easy if you choose a filesystem based storage. You can find an example of this in the helper script found in ./scripts/TRUBA folder.

Tag sentences

This project do not have a designated tagger script for now but you can obtain the output in eval_dir. You should provide the text in tokenized form in CoNLL format. The script will tag both the development and testing files and produce files in ./evaluation/temp/eval_logs/. If you need this and want to contribute by coding and sharing it with the project, you are welcome.

Replication of the experiments

To reproduce the experiments reported with our model, you can use Docker and build a replica of our experimentation environment.

To build:

docker build -t yourimagename:yourversion .

To run:

docker run -ti -v `pwd`/dataset:/opt/ner-tagger-dynet/dataset -v `pwd`/models:/opt/ner-tagger-dynet/models yourimagename:yourversion python train.py --train dataset/gungor.ner.train.small --dev dataset/gungor.ner.dev.small --test dataset/gungor.ner.test.small --word_dim 300 --word_lstm_dim 200 --word_bidirect 1 --cap_dim 100 --crf 1 --lr_method=adam --maximum-epochs 50 --char_dim 200 --char_lstm_dim 200 --char_bidirect 1 --overwrite-mappings 1 --batch-size 1

You should create or set permissions accordingly for `pwd`/dataset and `pwd`/models.

References

[1] Gungor, O., Uskudarli, S., Gungor, T., Improving Named Entity Recognition by Jointly Learning to Disambiguate Morphological Tags, 2018, COLING 2018, 19-25 August, (to appear).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].