All Projects → proycon → foliapy

proycon / foliapy

Licence: GPL-3.0 License
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to foliapy

folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+330.77%)
Mutual labels:  xml, computational-linguistics, folia
ucto
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …
Stars: ✭ 58 (+346.15%)
Mutual labels:  computational-linguistics, folia
frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+438.46%)
Mutual labels:  computational-linguistics, folia
granitic
Web/micro-services and IoC framework for Golang developers
Stars: ✭ 32 (+146.15%)
Mutual labels:  xml
onixcheck
ONIX validation library and commandline tool
Stars: ✭ 20 (+53.85%)
Mutual labels:  xml
xrechnung-schematron
Schematron rules for the German CIUS (XRechnung) of EN16931:2017
Stars: ✭ 19 (+46.15%)
Mutual labels:  xml
CISTEM
Stemmer for German
Stars: ✭ 33 (+153.85%)
Mutual labels:  computational-linguistics
spring-mvc3-javaconfig
A Java Spring MVC 3 app configured without XML. Also uses Servlet 3 API to bypass web.xml
Stars: ✭ 23 (+76.92%)
Mutual labels:  xml
xspec
XSpec is a unit test and behaviour-driven development (BDD) framework for XSLT, XQuery, and Schematron.
Stars: ✭ 91 (+600%)
Mutual labels:  xml
MultiStateToggleButton
Android's ToggleButton offers only two states, MultiStateToggleButton fixes this by offering as many states depending on the number of drawable resources passed in.
Stars: ✭ 20 (+53.85%)
Mutual labels:  xml
SAF-T-AO
Official XSD from the Government of Angola for use in SAF-T AO
Stars: ✭ 42 (+223.08%)
Mutual labels:  xml
sembei
🍘 単語分割を経由しない単語埋め込み 🍘
Stars: ✭ 14 (+7.69%)
Mutual labels:  computational-linguistics
Server-Help
💻 This VSTO Add-In allows the user to ping a list of servers and creates a file for Microsoft Remote Desktop Manager an Excel table. This is used for quickly determining which servers are offline in a list. It is written in 3 different versions as a VSTO Add-In in C# and VB.NET as well as a VBA Add-In.
Stars: ✭ 21 (+61.54%)
Mutual labels:  xml
fiet
Fiết is a RSS feed parser in Elixir, which focuses on extensibility, speed, and standard compliance
Stars: ✭ 23 (+76.92%)
Mutual labels:  xml
ph-commons
Java 1.8+ Library with tons of utility classes required in all projects
Stars: ✭ 23 (+76.92%)
Mutual labels:  xml
wikipron
Massively multilingual pronunciation mining
Stars: ✭ 167 (+1184.62%)
Mutual labels:  computational-linguistics
GDX-HTML
using HTML + CSS + JS to build libGDX UI!
Stars: ✭ 21 (+61.54%)
Mutual labels:  xml
medialytics
A basic, free tool that shows information about Plex Media Server content
Stars: ✭ 31 (+138.46%)
Mutual labels:  xml
asl
A C++ cross-platform library including JSON, XML, HTTP, Sockets, WebSockets, threads, processes, logs, file system, CSV, INI files, etc.
Stars: ✭ 44 (+238.46%)
Mutual labels:  xml
xml2json
Xml To Json
Stars: ✭ 32 (+146.15%)
Mutual labels:  xml

FoLiA Library for Python

https://github.com/proycon/foliapy/actions/workflows/foliapy.yml/badge.svg?branch=master Documentation Status http://applejack.science.ru.nl/lamabadge.php/foliapy Project Status: Active – The project has reached a stable, usable state and is being actively developed. Latest release in the Python Package Index

This Python module provides an extensive library for parsing, creating and otherwise processing documents in the Format for Linguistic Annotation, aka FoLiA. It has been in active development since 2010 and used by numerous Natural Language Processing (NLP) tools.

This library used to be part of PyNLPL (pynlpl.formats.folia), but has been migrated to this standalone library in spring 2019.

Acknowledgement

FoLiA development is funded in the scope of the larger CLARIN-NL project and its successor CLARIAH.

The following modules are available:

  • folia.main - The main library
  • folia.setdefinition - A module for FoLiA Set Definitions
  • folia.fql - Support for the FoLiA Query Language (FQL)

Installation

Download and install the latest stable version directly from the Python Package Index with pip install folia (or pip3 for Python 3 on most systems). For global installations prepend sudo.

Alternatively, clone this repository and run python setup.py install (or python3 setup.py install for Python 3 on most system. Prepend sudo for global installations.

The Python FoLiA library is also included in our LaMachine distribution.

Documentation

API Documentation and tutorials can be found here .

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].