All Projects → yahshibu → Nested Ner Tacl2020 Transformers

yahshibu / Nested Ner Tacl2020 Transformers

Licence: gpl-3.0
Implementation of Nested Named Entity Recognition using BERT

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Nested Ner Tacl2020 Transformers

Understanding Financial Reports Using Natural Language Processing
Investigate how mutual funds leverage credit derivatives by studying their routine filings to the SEC using NLP techniques 📈🤑
Stars: ✭ 36 (-52.63%)
Mutual labels:  natural-language-processing, named-entity-recognition, information-extraction
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (+59.21%)
Mutual labels:  natural-language-processing, named-entity-recognition, information-extraction
Awesome Persian Nlp Ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+505.26%)
Mutual labels:  natural-language-processing, named-entity-recognition
Seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Stars: ✭ 508 (+568.42%)
Mutual labels:  natural-language-processing, named-entity-recognition
Stanza
Official Stanford NLP Python Library for Many Human Languages
Stars: ✭ 5,887 (+7646.05%)
Mutual labels:  natural-language-processing, named-entity-recognition
Transformers Tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks
Stars: ✭ 384 (+405.26%)
Mutual labels:  natural-language-processing, named-entity-recognition
Neuronlp2
Deep neural models for core NLP tasks (Pytorch version)
Stars: ✭ 397 (+422.37%)
Mutual labels:  natural-language-processing, named-entity-recognition
Hanlp
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Stars: ✭ 24,626 (+32302.63%)
Mutual labels:  natural-language-processing, named-entity-recognition
Vncorenlp
A Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (+365.79%)
Mutual labels:  natural-language-processing, named-entity-recognition
Named Entity Recognition
name entity recognition with recurrent neural network(RNN) in tensorflow
Stars: ✭ 20 (-73.68%)
Mutual labels:  natural-language-processing, named-entity-recognition
Deepnlp
基于深度学习的自然语言处理库
Stars: ✭ 34 (-55.26%)
Mutual labels:  natural-language-processing, named-entity-recognition
Nlp Progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Stars: ✭ 19,518 (+25581.58%)
Mutual labels:  natural-language-processing, named-entity-recognition
Usc Ds Relationextraction
Distantly Supervised Relation Extraction
Stars: ✭ 378 (+397.37%)
Mutual labels:  natural-language-processing, information-extraction
Spacy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+28818.42%)
Mutual labels:  natural-language-processing, named-entity-recognition
Spacy Streamlit
👑 spaCy building blocks and visualizers for Streamlit apps
Stars: ✭ 360 (+373.68%)
Mutual labels:  natural-language-processing, named-entity-recognition
Ner Lstm
Named Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (+600%)
Mutual labels:  natural-language-processing, named-entity-recognition
Corenlp
Stanford CoreNLP: A Java suite of core NLP tools.
Stars: ✭ 8,248 (+10752.63%)
Mutual labels:  natural-language-processing, named-entity-recognition
Gcn Over Pruned Trees
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)
Stars: ✭ 312 (+310.53%)
Mutual labels:  natural-language-processing, information-extraction
Snips Nlu
Snips Python library to extract meaning from text
Stars: ✭ 3,583 (+4614.47%)
Mutual labels:  named-entity-recognition, information-extraction
Entity Recognition Datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+1072.37%)
Mutual labels:  natural-language-processing, named-entity-recognition

Implementation of Nested Named Entity Recognition

Some files are part of NeuroNLP2.

Requirements

We tested this library with the following libraries:

Running experiments

Testing this library with a sample data

  1. Run the gen_data.py to generate the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data.py
    
  2. Run the train.py to start training
    python train.py
    

Reproducing our experiment on the ACE-2004 dataset

  1. Put the corpus ACE-2004 into the "../ACE2004/" directory
  2. Put this .tgz file into the "../" and extract it
  3. Run the parse_ace2004.py to extract sentences for training, and they will be placed at the "./data/ace2004/"
    python parse_ace2004.py
    
  4. Run the gen_data_for_ace2004.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data_for_ace2004.py
    
  5. Run the train.py to start training
    python train.py
    

Reproducing our experiment on the ACE-2005 dataset

  1. Put the corpus ACE-2005 into the "../ACE2005/" directory
  2. Put this .tgz file into the "../" and extract it
  3. Run the parse_ace2005.py to extract sentences for training, and they will be placed at the "./data/ace2005/"
    python parse_ace2005.py
    
  4. Run the gen_data_for_ace2005.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data_for_ace2005.py
    
  5. Run the train.py to start training
    python train.py
    

Reproducing our experiment on the GENIA dataset

  1. Put the corpus GENIA into the "../GENIA/" directory
  2. Run the parse_genia.py to extract sentences for training, and they will be placed at the "./data/genia/"
    python parse_genia.py
    
  3. Run the gen_data_for_genia.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
    python gen_data_for_genia.py
    
  4. Run the train.py to start training
    python train.py
    

Configuration

Configurations of the model and training are in config.py

Citation

Please cite our paper:

@article{shibuya-hovy-2020-nested,
  title = "Nested Named Entity Recognition via Second-best Sequence Learning and Decoding",
  author = "Shibuya, Takashi and Hovy, Eduard",
  journal = "Transactions of the Association for Computational Linguistics",
  volume = "8",
  year = "2020",
  doi = "10.1162/tacl_a_00334",
  pages = "605--620",
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].