PyconllA minimal, pure Python library to interface with CoNLL-U format files.
Stars: â 104 (+316%)
Elpisđ WIP software for creating speech recognition models.
Stars: â 101 (+304%)
WikipronMassively multilingual pronunciation mining
Stars: â 99 (+296%)
FlatFoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
Stars: â 93 (+272%)
TextannotationgraphsA modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
Stars: â 73 (+192%)
BetaAn open source reimplementation of Benny Brodda's BETA in Python
Stars: â 65 (+160%)
Yesterday I LearnedBrainfarts are caused by the rupturing of the cerebral sphincter.
Stars: â 50 (+100%)
PsychopyFor running psychology and neuroscience experiments
Stars: â 1,020 (+3980%)
PhonemesJason Riggle's chart of phonological features in JSON format + extras
Stars: â 33 (+32%)
Awesome Sentiment Analysisđđđđ A curated list of Sentiment Analysis methods, implementations and misc. đĽđđąđ¤
Stars: â 816 (+3164%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: â 426 (+1604%)
rsyntaxtreeSyntax tree generator made with Ruby and RMagic
Stars: â 62 (+148%)
spanish-corporaUnannotated Spanish 3 Billion Words Corpora
Stars: â 61 (+144%)
treebenderA HDPSG-inspired symbolic natural language parser written in Rust
Stars: â 24 (-4%)