All Projects → baoy-nlp → FAParser

baoy-nlp / FAParser

Licence: GPL-3.0 license
A Fast(er) and Accurate Syntactic Parsing by Exacter Searching.

Programming Languages

python
139335 projects - #7 most used programming language
c
50402 projects - #5 most used programming language

Projects that are alternatives of or similar to FAParser

stanford-corenlp-docker
build/run the most current Stanford CoreNLP server in a docker container
Stars: ✭ 38 (+123.53%)
Mutual labels:  dependency-parser, constituency-parser
datalinguist
Stanford CoreNLP in idiomatic Clojure.
Stars: ✭ 93 (+447.06%)
Mutual labels:  dependency-parser
dpar
Neural network transition-based dependency parser (in Rust)
Stars: ✭ 41 (+141.18%)
Mutual labels:  dependency-parser
frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+311.76%)
Mutual labels:  dependency-parser
TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+394.12%)
Mutual labels:  dependency-parser
syntaxnet
Syntaxnet Parsey McParseface wrapper for POS tagging and dependency parsing
Stars: ✭ 77 (+352.94%)
Mutual labels:  dependency-parser
yap
Yet Another (natural language) Parser
Stars: ✭ 40 (+135.29%)
Mutual labels:  dependency-parser
vietnamese-roberta
A Robustly Optimized BERT Pretraining Approach for Vietnamese
Stars: ✭ 22 (+29.41%)
Mutual labels:  fairseq
Pyhanlp
中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁 自然语言处理
Stars: ✭ 2,564 (+14982.35%)
Mutual labels:  dependency-parser
Hanlp
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Stars: ✭ 24,626 (+144758.82%)
Mutual labels:  dependency-parser
dependency parsing tf
Tensorflow implementation of "A Fast and Accurate Dependency Parser using Neural Networks"
Stars: ✭ 77 (+352.94%)
Mutual labels:  dependency-parser
udar
UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.
Stars: ✭ 15 (-11.76%)
Mutual labels:  dependency-parser
ipymarkup
NER, syntax markup visualizations
Stars: ✭ 108 (+535.29%)
Mutual labels:  dependency-parser
fairseq-tagging
a Fairseq fork for sequence tagging/labeling tasks
Stars: ✭ 26 (+52.94%)
Mutual labels:  fairseq

SOTA

Performance: Highest F1 Score/ UAS(LAS)

Speed: Fastest

This repo's target

Faster and Accurate Syntactic Parsing both on Constituency and Dependency.

Naive Motivation

Conversion: Dependency Tree could be converted from constituency by utilizing head rule.
Span: where the head rule is used.

Implementation

Self-Attentive Con Parser, start from scratch with:

  • Phrase Structure Tree Load/Conversion/Expr, Oracle Design
  • Pretrain Model Loaded: ELMO
  • Multi-Head Self-Attentive Encoder
  • Max-Span Tree Inference
  • Training

Biaffine Dep Parser, start from scratch with:

  • Dependency Relation Tree Load/Conversion/Expr, Oracle Design
  • Pretrain Word Embedding Loaded
  • Bi-LSTM Encoder
  • MST Inference with Attention Module
  • Training

FAParser's new features:

  • Enhanced Pretrain Module of Word Representation
  • Enhanced MST Inference with Multi-Head Attention Module
  • Conditional modeling on interaction between information flows of phrase structure and dependency relation

Similar to the design of fairseq, we organize our FAParser as:

FAParser
│   README.md
│   train.py
│   inference.py
│   preprocess.py
│
└───evaluation: for validation or testing
│   │   F1
│   │   Accuracy
│   │       │ UAS
│   │       └ LAS
│   └  ...
│
└───data: 
│   │   tree loaded or structure utils
│   │   
│   └ ...
│   
└───criterion: 
│   │   cross entropy. etc
│   │   
│   └ ...
│
└───models: 
│   │   three parser
│   │   
│   └ ...
│
└───modules: 
│   │   series of module used in models
│   │   
│   └ ...
│
└───optim: for optimizer
│   │   lr_shedule
│   │   adam...
│   └ ...
│
└───tasks: for loss computing
│   │   Constituency Parser
│   │   Dependenecy Parser
│   │   FAParser
│   └ ...
│
└───utlis:
│   │   command/preprocess/meters...
│   │   
│   └ ...

criterion,modules,optim and data could be initialized with the fairseq's sub-modules. we just need build some task-specific sub-packages.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].