PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

Stars: ✭ 426 (+1190.91%)

Mutual labels: linguistics

Rime Cantonese

Rime Cantonese input schema | 粵語拼音輸入方案

Stars: ✭ 173 (+424.24%)

Mutual labels: linguistics

mlconjug3

A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.

Stars: ✭ 47 (+42.42%)

Mutual labels: linguistics

Pycantonese

Cantonese Linguistics and NLP in Python

Stars: ✭ 147 (+345.45%)

Mutual labels: linguistics

mystem

CGo bindings to Yandex.Mystem

Stars: ✭ 28 (-15.15%)

Mutual labels: linguistics

Colibri Core

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

Stars: ✭ 112 (+239.39%)

Mutual labels: linguistics

langua

A suite of language tools

Stars: ✭ 29 (-12.12%)

Mutual labels: linguistics

dev

PHOIBLE data and development.

Stars: ✭ 90 (+172.73%)

Mutual labels: linguistics

Wikipron

Massively multilingual pronunciation mining

Stars: ✭ 99 (+200%)

Mutual labels: linguistics

clinical nlp elastic

Clinical NLP Analysis with Elasticsearch and Kibana

Stars: ✭ 32 (-3.03%)

Mutual labels: linguistics

corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Stars: ✭ 16 (-51.52%)

Mutual labels: linguistics

treebender

A HDPSG-inspired symbolic natural language parser written in Rust

Stars: ✭ 24 (-27.27%)

Mutual labels: linguistics

lambda-notebook

Lambda Notebook: Formal Semantics in Jupyter

Stars: ✭ 16 (-51.52%)

Mutual labels: linguistics

neural-net-linguistics

Papers about NN and linguistics

Stars: ✭ 14 (-57.58%)

Mutual labels: linguistics

lingtypology

R package for linguistic cartography and typological databases search

Stars: ✭ 47 (+42.42%)

Mutual labels: linguistics

Lexpredict Lexnlp

LexNLP by LexPredict

Stars: ✭ 439 (+1230.3%)

Mutual labels: linguistics

nyt-first-said

Tweets when words are published for the first time in the NYT

Stars: ✭ 222 (+572.73%)

Mutual labels: linguistics

lameta

The Metadata Editor for Transparent Archiving of language document materials

Stars: ✭ 18 (-45.45%)

Mutual labels: linguistics

pylangacq

Language Acquisition Research Tools

Stars: ✭ 33 (+0%)

Mutual labels: linguistics

TextGridTools

Read, write, and manipulate Praat TextGrid files with Python

Stars: ✭ 84 (+154.55%)

Mutual labels: linguistics

pfootprint

Political Discourse Analysis Using Pre-Trained Word Vectors.

Stars: ✭ 20 (-39.39%)

Mutual labels: linguistics

NatLang

NatLang is an English parser with an extensible grammar

Stars: ✭ 20 (-39.39%)

Mutual labels: linguistics

Awesome Linguistics

A curated list of anything remotely related to linguistics

Stars: ✭ 207 (+527.27%)

Mutual labels: linguistics

Nltk data

NLTK Data

Stars: ✭ 675 (+1945.45%)

Mutual labels: linguistics

Hangulize

Korean Alphabet Transcription

Stars: ✭ 184 (+457.58%)

Mutual labels: linguistics

LangPad

A word processor/dictionary/generally useful tool for linguistics.

Stars: ✭ 20 (-39.39%)

Mutual labels: linguistics

Prosodic

Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.

Stars: ✭ 162 (+390.91%)

Mutual labels: linguistics

OpenGNT

Open Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources

Stars: ✭ 55 (+66.67%)

Mutual labels: linguistics

Hangulize

Hangulize transcribes non-Korean words into Hangul

Stars: ✭ 152 (+360.61%)

Mutual labels: linguistics

libpalaso

Palaso Library: A set of .Net libraries useful for developers of Language Software.

Stars: ✭ 36 (+9.09%)

Mutual labels: linguistics

Ipa Dict

Monolingual wordlists with pronunciation information in IPA

Stars: ✭ 139 (+321.21%)

Mutual labels: linguistics

rsyntaxtree

Syntax tree generator made with Ruby and RMagic

Stars: ✭ 62 (+87.88%)

Mutual labels: linguistics

Ichiran

Linguistic tools for texts in Japanese language

Stars: ✭ 120 (+263.64%)

Mutual labels: linguistics

verbecc

Complete Conjugation of any Verb using Machine Learning for French, Spanish, Portuguese, Italian and Romanian

Stars: ✭ 45 (+36.36%)

Mutual labels: linguistics

Pyconll

A minimal, pure Python library to interface with CoNLL-U format files.

Stars: ✭ 104 (+215.15%)

Mutual labels: linguistics

duree

Durée: the longest book ever written.

Stars: ✭ 67 (+103.03%)

Mutual labels: linguistics

KoParadigm

KoParadigm: Korean Inflectional Paradigm Generator

Stars: ✭ 48 (+45.45%)

Mutual labels: linguistics

Awesome Sentiment Analysis

😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤

Stars: ✭ 816 (+2372.73%)

Mutual labels: linguistics

Weixin public corpus

微信公众号语料库

Stars: ✭ 465 (+1309.09%)

Mutual labels: linguistics

spanish-corpora

Unannotated Spanish 3 Billion Words Corpora

Stars: ✭ 61 (+84.85%)

Mutual labels: linguistics

folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…

Stars: ✭ 56 (+69.7%)

Mutual labels: linguistics

ngramr

R package to query the Google Ngram Viewer

Stars: ✭ 46 (+39.39%)

Mutual labels: linguistics

1-60 of 66 similar projects

›