linguisticsdownEasy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-88.24%)
expletivesExpletives vomiting library...
Stars: ✭ 12 (-94.12%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (-72.55%)
OnsetA language evolution simulator, using realistic phonetic changes.
Stars: ✭ 30 (-85.29%)
PsychopyFor running psychology and neuroscience experiments
Stars: ✭ 1,020 (+400%)
eliza-rsA rust implementation of ELIZA - a natural language processing program developed by Joseph Weizenbaum in 1966.
Stars: ✭ 48 (-76.47%)
PyconllA minimal, pure Python library to interface with CoNLL-U format files.
Stars: ✭ 104 (-49.02%)
ngramrR package to query the Google Ngram Viewer
Stars: ✭ 46 (-77.45%)
spanish-corporaUnannotated Spanish 3 Billion Words Corpora
Stars: ✭ 61 (-70.1%)
mystemCGo bindings to Yandex.Mystem
Stars: ✭ 28 (-86.27%)
proiel-treebankOfficial releases of the PROIEL treebank of ancient Indo-European languages
Stars: ✭ 30 (-85.29%)
Yesterday I LearnedBrainfarts are caused by the rupturing of the cerebral sphincter.
Stars: ✭ 50 (-75.49%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-86.76%)
IchiranLinguistic tools for texts in Japanese language
Stars: ✭ 120 (-41.18%)
lingvo--Ner-ruNamed entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-81.37%)
mlconjug3A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (-76.96%)
HangulizeHangulize transcribes non-Korean words into Hangul
Stars: ✭ 152 (-25.49%)
languaA suite of language tools
Stars: ✭ 29 (-85.78%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+108.82%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-51.47%)
event-embedding-multitask*SEM 2018: Learning Distributed Event Representations with a Multi-Task Approach
Stars: ✭ 22 (-89.22%)
concepticon-dataThe curation repository for the data behind Concepticon.
Stars: ✭ 25 (-87.75%)
OpenGNTOpen Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
Stars: ✭ 55 (-73.04%)
nyt-first-saidTweets when words are published for the first time in the NYT
Stars: ✭ 222 (+8.82%)
BetaAn open source reimplementation of Benny Brodda's BETA in Python
Stars: ✭ 65 (-68.14%)
dureeDurée: the longest book ever written.
Stars: ✭ 67 (-67.16%)
CorpuscrawlerCrawler for linguistic corpora
Stars: ✭ 127 (-37.75%)
TossiChooses correct Korean particle morphs for arbitrary words.
Stars: ✭ 160 (-21.57%)
lametaThe Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-91.18%)
PhonemesJason Riggle's chart of phonological features in JSON format + extras
Stars: ✭ 33 (-83.82%)
NatLangNatLang is an English parser with an extensible grammar
Stars: ✭ 20 (-90.2%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (-45.1%)
LangPadA word processor/dictionary/generally useful tool for linguistics.
Stars: ✭ 20 (-90.2%)
libpalasoPalaso Library: A set of .Net libraries useful for developers of Language Software.
Stars: ✭ 36 (-82.35%)
Rime CantoneseRime Cantonese input schema | 粵語拼音輸入方案
Stars: ✭ 173 (-15.2%)
verbeccComplete Conjugation of any Verb using Machine Learning for French, Spanish, Portuguese, Italian and Romanian
Stars: ✭ 45 (-77.94%)
KoParadigmKoParadigm: Korean Inflectional Paradigm Generator
Stars: ✭ 48 (-76.47%)
Elpis🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (-50.49%)
devPHOIBLE data and development.
Stars: ✭ 90 (-55.88%)
rsyntaxtreeSyntax tree generator made with Ruby and RMagic
Stars: ✭ 62 (-69.61%)
PycantoneseCantonese Linguistics and NLP in Python
Stars: ✭ 147 (-27.94%)
lambda-notebookLambda Notebook: Formal Semantics in Jupyter
Stars: ✭ 16 (-92.16%)
treebenderA HDPSG-inspired symbolic natural language parser written in Rust
Stars: ✭ 24 (-88.24%)
lingtypologyR package for linguistic cartography and typological databases search
Stars: ✭ 47 (-76.96%)
FlatFoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
Stars: ✭ 93 (-54.41%)
TextGridToolsRead, write, and manipulate Praat TextGrid files with Python
Stars: ✭ 84 (-58.82%)
HangulizeKorean Alphabet Transcription
Stars: ✭ 184 (-9.8%)
ProsodicProsodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
Stars: ✭ 162 (-20.59%)
Ipa DictMonolingual wordlists with pronunciation information in IPA
Stars: ✭ 139 (-31.86%)
TextannotationgraphsA modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
Stars: ✭ 73 (-64.22%)
wikipronMassively multilingual pronunciation mining
Stars: ✭ 167 (-18.14%)