All Projects → iesl → Dilated Cnn Ner

iesl / Dilated Cnn Ner

Dilated CNNs for NER in TensorFlow

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Dilated Cnn Ner

Spacy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+9800%)
Mutual labels:  natural-language-processing, neural-networks, named-entity-recognition
Ncrfpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Stars: ✭ 1,767 (+695.95%)
Mutual labels:  natural-language-processing, neural-networks, named-entity-recognition
Spacy Course
👩‍🏫 Advanced NLP with spaCy: A free online course
Stars: ✭ 1,920 (+764.86%)
Mutual labels:  natural-language-processing, named-entity-recognition
Crf Layer On The Top Of Bilstm
The CRF Layer was implemented by using Chainer 2.0. Please see more details here: https://createmomo.github.io/2017/09/12/CRF_Layer_on_the_Top_of_BiLSTM_1/
Stars: ✭ 148 (-33.33%)
Mutual labels:  natural-language-processing, named-entity-recognition
Fixy
Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-25.68%)
Mutual labels:  natural-language-processing, neural-networks
Clicr
Machine reading comprehension on clinical case reports
Stars: ✭ 123 (-44.59%)
Mutual labels:  natural-language-processing, neural-networks
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-45.5%)
Mutual labels:  natural-language-processing, named-entity-recognition
Spacy Lookup
Named Entity Recognition based on dictionaries
Stars: ✭ 212 (-4.5%)
Mutual labels:  natural-language-processing, named-entity-recognition
Flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+4884.23%)
Mutual labels:  natural-language-processing, named-entity-recognition
Knockknock
🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
Stars: ✭ 2,304 (+937.84%)
Mutual labels:  natural-language-processing, neural-networks
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+1034.23%)
Mutual labels:  natural-language-processing, named-entity-recognition
Deep Math Machine Learning.ai
A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Stars: ✭ 173 (-22.07%)
Mutual labels:  natural-language-processing, neural-networks
Nlp Pretrained Model
A collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-45.05%)
Mutual labels:  natural-language-processing, neural-networks
Neuroner
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
Stars: ✭ 1,579 (+611.26%)
Mutual labels:  neural-networks, named-entity-recognition
Pymetamap
Python wraper for MetaMap
Stars: ✭ 119 (-46.4%)
Mutual labels:  natural-language-processing, named-entity-recognition
Deeplearning nlp
基于深度学习的自然语言处理库
Stars: ✭ 154 (-30.63%)
Mutual labels:  natural-language-processing, named-entity-recognition
Pyhanlp
中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁 自然语言处理
Stars: ✭ 2,564 (+1054.95%)
Mutual labels:  natural-language-processing, named-entity-recognition
Chatbot
Русскоязычный чатбот
Stars: ✭ 106 (-52.25%)
Mutual labels:  natural-language-processing, neural-networks
Danlp
DaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (-50%)
Mutual labels:  natural-language-processing, named-entity-recognition
Vntk
Vietnamese NLP Toolkit for Node
Stars: ✭ 170 (-23.42%)
Mutual labels:  natural-language-processing, named-entity-recognition

dilated-cnn-ner

This code implements the models described in the paper "Fast and Accurate Entity Recognition with Iterated Dilated Convolutions" by Emma Strubell, Patrick Verga, David Belanger and Andrew McCallum.

Requirements

This code uses TensorFlow v[1.0, 1.4) and Python 2.7.

It will probably train on a CPU, but honestly we haven't tried, and highly recommend training on a GPU.

Setup

  1. Set up environment variables. For example, from the root directory of this project:
export DILATED_CNN_NER_ROOT=`pwd`
export DATA_DIR=/path/to/conll-2003
  1. Get some pretrained word embeddings, e.g. SENNA embeddings or Glove embeddings. The code expects a space-separated file with one word and its embedding per line, e.g.:

    word 0.45 0.67 0.99 ...
    

    Make a directory for the embeddings:

    mkdir -p data/embeddings
    

    and place the file there.

  2. Perform all data preprocessing for a given configuration. For example:

./bin/preprocess.sh conf/conll/dilated-cnn.conf

This calls preprocess.py, which loads the data from text files, maps the tokens, labels and any other features to integers, and writes to TensorFlow tfrecords.

Training

Once the data preprocessing is completed, you can train a tagger:

./bin/train-cnn.sh conf/conll/dilated-cnn.conf

Evaluation

By default, the trainer will write the model which achieved the best dev F1. To evaluate a saved model on the dev set:

./bin/eval-cnn.sh conf/conll/dilated-cnn.conf --load_model path/to/model

To evaluate a saved model on the test set:

./bin/eval-cnn.sh conf/conll/dilated-cnn.conf test --load_model path/to/model

Configs

Configuration files (conf/*) specify all the data, parameters, etc. for an experiment.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].