All Projects → Rostlab → nalaf

Rostlab / nalaf

Licence: Apache-2.0 license
NLP framework in python for entity recognition and relationship extraction

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to nalaf

Snips Nlu
Snips Python library to extract meaning from text
Stars: ✭ 3,583 (+3345.19%)
Mutual labels:  ml, ner
huner
Named Entity Recognition for biomedical entities
Stars: ✭ 44 (-57.69%)
Mutual labels:  ner, bionlp
biaffine-ner
Named Entity Recognition as Dependency Parsing
Stars: ✭ 293 (+181.73%)
Mutual labels:  ner
Capricorn
提供强大的NLP能力, low-code实现chatbot
Stars: ✭ 14 (-86.54%)
Mutual labels:  ner
i-librarian-free
I, Librarian - open-source version of a PDF managing SaaS.
Stars: ✭ 110 (+5.77%)
Mutual labels:  pubmed
rs datasets
Tool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-78.85%)
Mutual labels:  ml
GaussianNB
Gaussian Naive Bayes (GaussianNB) classifier
Stars: ✭ 17 (-83.65%)
Mutual labels:  ml
creative-prediction
Creative Prediction with Neural Networks
Stars: ✭ 22 (-78.85%)
Mutual labels:  ml
ecole
Extensible Combinatorial Optimization Learning Environments
Stars: ✭ 249 (+139.42%)
Mutual labels:  ml
BiLSTM-CRF-NER-PyTorch
This repo contains a PyTorch implementation of a BiLSTM-CRF model for named entity recognition task.
Stars: ✭ 109 (+4.81%)
Mutual labels:  ner
CrossNER
CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)
Stars: ✭ 87 (-16.35%)
Mutual labels:  ner
nerus
Large silver standart Russian corpus with NER, morphology and syntax markup
Stars: ✭ 47 (-54.81%)
Mutual labels:  ner
pmml4s
PMML scoring library for Scala
Stars: ✭ 49 (-52.88%)
Mutual labels:  ml
PDF-Resume-Information-Extraction
天池比赛作品整理。实现从pdf中提取出姓名、出生年月、性别、电话、最高学历、籍贯、落户市县、政治面貌、毕业院校、工作单位、工作内容、职务、项目名称、项目责任、学位、毕业时间、工作时间、项目时间共18个字段。
Stars: ✭ 64 (-38.46%)
Mutual labels:  ner
SkillNER
A (smart) rule based NLP module to extract job skills from text
Stars: ✭ 69 (-33.65%)
Mutual labels:  ner
lukai
Luk.ai Clients - Federated Machine Learning for Everyone!
Stars: ✭ 20 (-80.77%)
Mutual labels:  ml
aws-experiments-comprehend-custom-classifier
How to train a custom NLP classifier with AWS Comprehend?
Stars: ✭ 25 (-75.96%)
Mutual labels:  ml
tensorflow-ml-nlp-tf2
텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료
Stars: ✭ 245 (+135.58%)
Mutual labels:  ner
mlflow-docker
Ready to run docker-compose configuration for ML Flow with Mysql and Minio S3
Stars: ✭ 146 (+40.38%)
Mutual labels:  ml
wink-nlp
Developer friendly Natural Language Processing ✨
Stars: ✭ 312 (+200%)
Mutual labels:  ner

☝️ We moved

This library is not maintained anymore, and is only ocassionally receiving bugfixes.

We moved the functionality to train NER & Relation models to the text annotation tool, tagtog:

tagtog, The Text Annotation Tool to Train AI




PyPI version Build Status codecov

nalaf - (Na)tural (La)nguage (F)ramework

nalaf is a NLP framework written in python. The goal is to be a general-purpose module-based and easy-to-use framework for common text mining tasks. At the moment two tasks are covered: named-entity recognition (NER) and relationship extraction. These modules support both training and annotating. Associated to these, helper components such as cross-validation training or reading and conversion from different corpora formats are given. At the moment, NER is implemented with Conditional Random Fields (CRFs) and relationship extraction with Support Vector Machines (SVMs) using either linear or tree kernels.

Historically, the framework started from 2 joint theses at Rostlab at Technische Universität München with a focus on bioinformatics / BioNLP. Concretely the first goal was to do extraction of NL mutation mentions. Soon after another master's thesis used and generalized the framework to do relationship extraction of transcription factors (TF) interacting with gene or gene products. The nalaf framework is planned to be used in other BioNLP tasks at Rostlab.

As a result of the original BioNLP focus, some parts of the code are tailored to the biomedical domain. However, current efforts are underway to generalize all parts and this process is almost done. Development is not active and code maintenance is not guaranteed.

Current maintainer: Juan Miguel Cejuela (@juanmirocks).

Pipeline diagram (editable version on Lucidchart of the pipeline diagram; requires log in)

Install

Requires Python ^3.6

From PyPi

pip3 install nalaf
python3 -m nalaf.download_data

From source

git clone https://github.com/Rostlab/nalaf.git
cd nalaf
poetry shell
poetry install  # or run `poetry update` if you really want to update the dependencies' versions
python3 -m nalaf.download_data

Developing

See wiki

Test

nosetests

Run Examples

Run example_annotate.py for a simple example of annotation with a pre-trained NER model for protein names extraction:

  • python3 example_annotate.py -p 15878741 12625412
  • python3 example_annotate.py -s "This is c.A1003G an example" # see issue #159
  • python3 example_annotate.py -d resources/example.txt # see issue #159
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].