All Projects → yahshibu → nested-ner-tacl2020-flair

yahshibu / nested-ner-tacl2020-flair

Licence: GPL-3.0 license
Implementation of Nested Named Entity Recognition using Flair

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to nested-ner-tacl2020-flair

neji
Flexible and powerful platform for biomedical information extraction from text
Stars: ✭ 37 (+60.87%)
Mutual labels:  information-extraction, named-entity-recognition
Ner Bert Pytorch
PyTorch solution of named entity recognition task Using Google AI's pre-trained BERT model.
Stars: ✭ 249 (+982.61%)
Mutual labels:  information-extraction, named-entity-recognition
Dan Jurafsky Chris Manning Nlp
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+439.13%)
Mutual labels:  information-extraction, named-entity-recognition
Understanding Financial Reports Using Natural Language Processing
Investigate how mutual funds leverage credit derivatives by studying their routine filings to the SEC using NLP techniques 📈🤑
Stars: ✭ 36 (+56.52%)
Mutual labels:  information-extraction, named-entity-recognition
CogIE
CogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021
Stars: ✭ 47 (+104.35%)
Mutual labels:  information-extraction, named-entity-recognition
Nested Ner Tacl2020 Transformers
Implementation of Nested Named Entity Recognition using BERT
Stars: ✭ 76 (+230.43%)
Mutual labels:  information-extraction, named-entity-recognition
Information Extraction Chinese
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Stars: ✭ 1,888 (+8108.7%)
Mutual labels:  information-extraction, named-entity-recognition
IE Paper Notes
Paper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Extraction (EE), Named Entity Disambiguation (NED).
Stars: ✭ 14 (-39.13%)
Mutual labels:  information-extraction, named-entity-recognition
slotminer
Tool for slot extraction from text
Stars: ✭ 15 (-34.78%)
Mutual labels:  information-extraction, named-entity-recognition
neural name tagging
Code for "Reliability-aware Dynamic Feature Composition for Name Tagging" (ACL2019)
Stars: ✭ 39 (+69.57%)
Mutual labels:  information-extraction, named-entity-recognition
Snips Nlu
Snips Python library to extract meaning from text
Stars: ✭ 3,583 (+15478.26%)
Mutual labels:  information-extraction, named-entity-recognition
InformationExtractionSystem
Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.
Stars: ✭ 27 (+17.39%)
Mutual labels:  information-extraction, named-entity-recognition
knowledge-graph-nlp-in-action
从模型训练到部署,实战知识图谱(Knowledge Graph)&自然语言处理(NLP)。涉及 Tensorflow, Bert+Bi-LSTM+CRF,Neo4j等 涵盖 Named Entity Recognition,Text Classify,Information Extraction,Relation Extraction 等任务。
Stars: ✭ 58 (+152.17%)
Mutual labels:  information-extraction, named-entity-recognition
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (+426.09%)
Mutual labels:  information-extraction, named-entity-recognition
LNEx
📍 🏢 🏦 🏣 🏪 🏬 LNEx: Location Name Extractor
Stars: ✭ 21 (-8.7%)
Mutual labels:  information-extraction, named-entity-recognition
Triggerner
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Stars: ✭ 141 (+513.04%)
Mutual labels:  information-extraction, named-entity-recognition
lima
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Stars: ✭ 75 (+226.09%)
Mutual labels:  information-extraction, named-entity-recognition
simple NER
simple rule based named entity recognition
Stars: ✭ 29 (+26.09%)
Mutual labels:  information-extraction, named-entity-recognition
trinity-ie
Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction
Stars: ✭ 59 (+156.52%)
Mutual labels:  information-extraction, named-entity-recognition
NER-using-Deep-Learning
A project on achieving Named-Entity Recognition using Deep Learning.
Stars: ✭ 24 (+4.35%)
Mutual labels:  named-entity-recognition

Implementation of Nested Named Entity Recognition

Some files are part of NeuroNLP2.

Requirements

We tested this library with the following libraries:

Running experiments

Testing this library with a sample data

  1. Put the embedding file PubMed-shuffle-win-2.bin into the "./embeddings/" directory
  2. Run the gen_data.py to generate the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data.py
  3. Run the train.py to start training
    python train.py

Reproducing our experiment on the ACE-2004 dataset

  1. Put the corpus ACE-2004 into the "../ACE2004/" directory
  2. Put this .tgz file into the "../" and extract it
  3. Run the parse_ace2004.py to extract sentences for training, and they will be placed at the "./data/ace2004/"
    python parse_ace2004.py
  4. Put the embedding file GoogleNews-vectors-negative300.bin.gz into the "./embeddings/" directory
  5. Decompress the embedding file GoogleNews-vectors-negative300.bin.gz
    gzip -d embeddings/GoogleNews-vectors-negative300.bin.gz
  6. Run the gen_data_for_ace2004.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data_for_ace2004.py
  7. Run the train.py to start training
    python train.py

Reproducing our experiment on the ACE-2005 dataset

  1. Put the corpus ACE-2005 into the "../ACE2005/" directory
  2. Put this .tgz file into the "../" and extract it
  3. Run the parse_ace2005.py to extract sentences for training, and they will be placed at the "./data/ace2005/"
    python parse_ace2005.py
  4. Put the embedding file GoogleNews-vectors-negative300.bin.gz into the "./embeddings/" directory
  5. Decompress the embedding file GoogleNews-vectors-negative300.bin.gz
    gzip -d embeddings/GoogleNews-vectors-negative300.bin.gz
  6. Run the gen_data_for_ace2005.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data_for_ace2005.py
  7. Run the train.py to start training
    python train.py

Reproducing our experiment on the GENIA dataset

  1. Put the corpus GENIA into the "../GENIA/" directory
  2. Run the parse_genia.py to extract sentences for training, and they will be placed at the "./data/genia/"
    python parse_genia.py
  3. Put the embedding file PubMed-shuffle-win-2.bin into the "./embeddings/" directory
  4. Run the gen_data_for_genia.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data_for_genia.py
  5. Run the train.py to start training
    python train.py

Configuration

Configurations of the model and training are in config.py

Citation

Please cite our paper:

@article{shibuya-hovy-2020-nested,
  title = "Nested Named Entity Recognition via Second-best Sequence Learning and Decoding",
  author = "Shibuya, Takashi and Hovy, Eduard",
  journal = "Transactions of the Association for Computational Linguistics",
  volume = "8",
  year = "2020",
  doi = "10.1162/tacl_a_00334",
  pages = "605--620",
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].