Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

proycon / Pynlpl

Licence: gpl-3.0

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

Programming Languages

python

139335 projects - #7 most used programming language

Labels

machine-learning library nlp natural-language-processing text-processing nlp-library linguistics

Projects that are alternatives of or similar to Pynlpl

Colibri Core

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

Stars: ✭ 112 (-73.71%)

Mutual labels: library, linguistics, text-processing

Lingua Franca

Mycroft's multilingual text parsing and formatting library

Stars: ✭ 51 (-88.03%)

Mutual labels: library, natural-language-processing, text-processing

Fastnlp

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

Stars: ✭ 2,441 (+473%)

Mutual labels: natural-language-processing, nlp-library, text-processing

Ml Classify Text Js

Machine learning based text classification in JavaScript using n-grams and cosine similarity

Stars: ✭ 38 (-91.08%)

Mutual labels: library, natural-language-processing

Nlp profiler

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.

Stars: ✭ 181 (-57.51%)

Mutual labels: natural-language-processing, nlp-library

Stringi

THE String Processing Package for R (with ICU)

Stars: ✭ 204 (-52.11%)

Mutual labels: natural-language-processing, text-processing

Keita

My personal toolkit for PyTorch development.

Stars: ✭ 124 (-70.89%)

Mutual labels: library, natural-language-processing

Libasciidoc

A Golang library for processing Asciidoc files.

Stars: ✭ 129 (-69.72%)

Mutual labels: library, text-processing

Nlp bahasa resources

A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia

Stars: ✭ 158 (-62.91%)

Mutual labels: library, natural-language-processing

mlconjug3

A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.

Stars: ✭ 47 (-88.97%)

Mutual labels: linguistics, nlp-library

corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Stars: ✭ 16 (-96.24%)

Mutual labels: linguistics, text-processing

TextDatasetCleaner

🔬 Очистка датасетов от мусора (нормализация, препроцессинг)

Stars: ✭ 27 (-93.66%)

Mutual labels: linguistics, text-processing

Textvec

Text vectorization tool to outperform TFIDF for classification tasks

Stars: ✭ 167 (-60.8%)

Mutual labels: natural-language-processing, text-processing

Nlpre

Python library for Natural Language Preprocessing (NLPre)

Stars: ✭ 158 (-62.91%)

Mutual labels: natural-language-processing, text-processing

Chatbot ner

chatbot_ner: Named Entity Recognition for chatbots.

Stars: ✭ 273 (-35.92%)

Mutual labels: natural-language-processing, nlp-library

Awesome Pytorch List

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

Stars: ✭ 12,475 (+2828.4%)

Mutual labels: natural-language-processing, nlp-library

Stanza Old

Stanford NLP group's shared Python tools.

Stars: ✭ 142 (-66.67%)

Mutual labels: natural-language-processing, text-processing

Pycantonese

Cantonese Linguistics and NLP in Python

Stars: ✭ 147 (-65.49%)

Mutual labels: natural-language-processing, linguistics

Acl Anthology

Data and software for building the ACL Anthology.

Stars: ✭ 168 (-60.56%)

Mutual labels: library, natural-language-processing

NLP-tools

Useful python NLP tools (evaluation, GUI interface, tokenization)

Stars: ✭ 39 (-90.85%)

Mutual labels: text-processing, nlp-library

View All Similar Projects ➔

PyNLPl - Python Natural Language Processing Library

.. image:: https://travis-ci.org/proycon/pynlpl.svg?branch=master :target: https://travis-ci.org/proycon/pynlpl

.. image:: http://readthedocs.org/projects/pynlpl/badge/?version=latest :target: http://pynlpl.readthedocs.io/en/latest/?badge=latest :alt: Documentation Status

.. image:: http://applejack.science.ru.nl/lamabadge.php/pynlpl :target: http://applejack.science.ru.nl/languagemachines/

.. image:: https://zenodo.org/badge/759484.svg :target: https://zenodo.org/badge/latestdoi/759484

The library is a divided into several packages and modules. It works on Python 2.7, as well as Python 3.

The following modules are available:

pynlpl.datatypes - Extra datatypes (priority queues, patterns, tries)
pynlpl.evaluation - Evaluation & experiment classes (parameter search, wrapped progressive sampling, class evaluation (precision/recall/f-score/auc), sampler, confusion matrix, multithreaded experiment pool)
pynlpl.formats.cgn - Module for parsing CGN (Corpus Gesproken Nederlands) part-of-speech tags
pynlpl.formats.folia - Extensive library for reading and manipulating the documents in FoLiA <http://proycon.github.io/folia>_ format (Format for Linguistic Annotation).
pynlpl.formats.fql - Extensive library for the FoLiA Query Language (FQL), built on top of pynlpl.formats.folia. FQL is currently documented here <https://github.com/proycon/foliadocserve>__.
pynlpl.formats.cql - Parser for the Corpus Query Language (CQL), as also used by Corpus Workbench and Sketch Engine. Contains a convertor to FQL.
pynlpl.formats.giza - Module for reading GIZA++ word alignment data
pynlpl.formats.moses - Module for reading Moses phrase-translation tables.
pynlpl.formats.sonar - Largely obsolete module for pre-releases of the SoNaR corpus, use pynlpl.formats.folia instead.
pynlpl.formats.timbl - Module for reading Timbl output (consider using python-timbl <https://github.com/proycon/python-timbl>_ instead though)
pynlpl.lm.lm - Module for simple language model and reader for ARPA language model data as well (used by SRILM).
pynlpl.search - Various search algorithms (Breadth-first, depth-first, beam-search, hill climbing, A star, various variants of each)
pynlpl.statistics - Frequency lists, Levenshtein, common statistics and information theory functions
pynlpl.textprocessors - Simple tokeniser, n-gram extraction

Installation

Download and install the latest stable version directly from the Python Package Index with pip install pynlpl (or pip3 for Python 3 on most systems). For global installations prepend sudo.

Alternatively, clone this repository and run python setup.py install (or python3 setup.py install for Python 3 on most system. Prepend sudo for global installations.

This software may also be found in the certain Linux distributions, such as the latest versions as Debian/Ubuntu, as python-pynlpl and python3-pynlpl. PyNLPL is also included in our LaMachine <http://proycon.github.io/LaMachine>_ distribution.

Documentation

API Documentation can be found here <http://pynlpl.readthedocs.io/en/latest/>__.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 426

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗