All Projects → Liebeck → spacy-iwnlp

Liebeck / spacy-iwnlp

Licence: MIT license
German lemmatization with IWNLP as extension for spaCy

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to spacy-iwnlp

spacy conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
Stars: ✭ 60 (+172.73%)
Mutual labels:  spacy, spacy-pipeline, spacy-extension
spacymoji
💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 174 (+690.91%)
Mutual labels:  spacy, spacy-pipeline, spacy-extension
Neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks
Stars: ✭ 2,453 (+11050%)
Mutual labels:  spacy, spacy-pipeline, spacy-extension
extractacy
Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)
Stars: ✭ 47 (+113.64%)
Mutual labels:  spacy, spacy-pipeline, spacy-extension
spaczz
Fuzzy matching and more functionality for spaCy.
Stars: ✭ 215 (+877.27%)
Mutual labels:  spacy, spacy-extension
spacy hunspell
✏️ Hunspell extension for spaCy 2.0.
Stars: ✭ 94 (+327.27%)
Mutual labels:  spacy, spacy-extension
contextualSpellCheck
✔️Contextual word checker for better suggestions
Stars: ✭ 274 (+1145.45%)
Mutual labels:  spacy, spacy-extension
augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
Stars: ✭ 101 (+359.09%)
Mutual labels:  spacy, spacy-extension
spacy-langdetect
A fully customisable language detection pipeline for spaCy
Stars: ✭ 86 (+290.91%)
Mutual labels:  spacy, spacy-extension
amrlib
A python library that makes AMR parsing, generation and visualization simple.
Stars: ✭ 107 (+386.36%)
Mutual labels:  spacy, spacy-extension
hmrb
Python Rule Processing Engine 🏺
Stars: ✭ 65 (+195.45%)
Mutual labels:  spacy, spacy-extension
Pytextrank
Python implementation of TextRank for phrase extraction and summarization of text documents
Stars: ✭ 1,675 (+7513.64%)
Mutual labels:  spacy, spacy-extension
Semantic-Textual-Similarity
Natural Language Processing using NLTK and Spacy
Stars: ✭ 30 (+36.36%)
Mutual labels:  spacy
prodigy-scratch
Prodigy thing(z)
Stars: ✭ 13 (-40.91%)
Mutual labels:  spacy
NLP Quickbook
NLP in Python with Deep Learning
Stars: ✭ 516 (+2245.45%)
Mutual labels:  spacy
DrFAQ
DrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.
Stars: ✭ 29 (+31.82%)
Mutual labels:  spacy
spacy-sentence-bert
Sentence transformers models for SpaCy
Stars: ✭ 88 (+300%)
Mutual labels:  spacy
Holmes Extractor
Information extraction from English and German texts based on predicate logic
Stars: ✭ 233 (+959.09%)
Mutual labels:  spacy
Quora QuestionPairs DL
Kaggle Competition: Using deep learning to solve quora's question pairs problem
Stars: ✭ 54 (+145.45%)
Mutual labels:  spacy
topic modelling financial news
Topic modelling on financial news with Natural Language Processing
Stars: ✭ 51 (+131.82%)
Mutual labels:  spacy

spacy-iwnlp

license Build Status

This package uses the spaCy 3.0 extensions to add IWNLP-py as German lemmatizer directly into your spaCy pipeline.

Please report bugs with spacy-iwnlp as issue in IWNLP-py.

Usage

import spacy
from spacy_iwnlp import spaCyIWNLP
nlp = spacy.load('de_core_news_sm')
nlp.add_pipe('iwnlp', config={'lemmatizer_path': 'data/IWNLP.Lemmatizer_20181001.json'})
doc = nlp('Wir mögen Fußballspiele mit ausgedehnten Verlängerungen.')
for token in doc:
    print('POS: {}\tIWNLP:{}'.format(token.pos_, token._.iwnlp_lemmas))

Installation

  1. Use pip to install spacy-iwnlp
pip install spacy-iwnlp
  1. Download the latest processed IWNLP dump from http://lager.cs.uni-duesseldorf.de/NLP/IWNLP/IWNLP.Lemmatizer_20181001.zip and unzip it.

Local development

Use develop.py to extend the functionality

Update PIP package

python setup.py sdist bdist_wheel 
python -m twine upload dist/PACKAGENAME-VERSION.tar.gz

Citation

Please include the following BibTeX if you use IWNLP in your work:

@InProceedings{liebeck-conrad:2015:ACL-IJCNLP,
  author    = {Liebeck, Matthias  and  Conrad, Stefan},
  title     = {{IWNLP: Inverse Wiktionary for Natural Language Processing}},
  booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
  year      = {2015},
  publisher = {Association for Computational Linguistics},
  pages     = {414--418},
  url       = {http://www.aclweb.org/anthology/P15-2068}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].