linguisticsdownEasy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-48.94%)
devPHOIBLE data and development.
Stars: ✭ 90 (+91.49%)
PycantoneseCantonese Linguistics and NLP in Python
Stars: ✭ 147 (+212.77%)
TextGridToolsRead, write, and manipulate Praat TextGrid files with Python
Stars: ✭ 84 (+78.72%)
Rime CantoneseRime Cantonese input schema | 粵語拼音輸入方案
Stars: ✭ 173 (+268.09%)
rsyntaxtreeSyntax tree generator made with Ruby and RMagic
Stars: ✭ 62 (+31.91%)
pylangacqLanguage Acquisition Research Tools
Stars: ✭ 33 (-29.79%)
dureeDurée: the longest book ever written.
Stars: ✭ 67 (+42.55%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (+138.3%)
BetaAn open source reimplementation of Benny Brodda's BETA in Python
Stars: ✭ 65 (+38.3%)
OpencorporaA web-based engine for creating and annotating textual corpora
Stars: ✭ 204 (+334.04%)
PhonemesJason Riggle's chart of phonological features in JSON format + extras
Stars: ✭ 33 (-29.79%)
TossiChooses correct Korean particle morphs for arbitrary words.
Stars: ✭ 160 (+240.43%)
treebenderA HDPSG-inspired symbolic natural language parser written in Rust
Stars: ✭ 24 (-48.94%)
DoReMIFaSolTéléchargement des données sur le site de l'Insee
Stars: ✭ 25 (-46.81%)
OpenGNTOpen Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
Stars: ✭ 55 (+17.02%)
CorpuscrawlerCrawler for linguistic corpora
Stars: ✭ 127 (+170.21%)
pfootprintPolitical Discourse Analysis Using Pre-Trained Word Vectors.
Stars: ✭ 20 (-57.45%)
lametaThe Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-61.7%)
Elpis🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (+114.89%)
NatLangNatLang is an English parser with an extensible grammar
Stars: ✭ 20 (-57.45%)
TextannotationgraphsA modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
Stars: ✭ 73 (+55.32%)
Awesome LinguisticsA curated list of anything remotely related to linguistics
Stars: ✭ 207 (+340.43%)
Yesterday I LearnedBrainfarts are caused by the rupturing of the cerebral sphincter.
Stars: ✭ 50 (+6.38%)
atlascliPython API for the MongoDB Atlas API
Stars: ✭ 14 (-70.21%)
PsychopyFor running psychology and neuroscience experiments
Stars: ✭ 1,020 (+2070.21%)
HangulizeKorean Alphabet Transcription
Stars: ✭ 184 (+291.49%)
Awesome Sentiment Analysis😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤
Stars: ✭ 816 (+1636.17%)
rdomainsClassifying the content of domains
Stars: ✭ 47 (+0%)
ProsodicProsodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
Stars: ✭ 162 (+244.68%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+806.38%)
hocassian-people-neo4jNoSQL可视化人脉图谱项目:非关系型数据库作为更符合人脑记忆的数据展现形式,在未来理论会成为应用界的主流,希望该项目能够成为推动HelpDesk、数据可视化、数据看板等IT基础能力持续降低上手门槛的起点。
Stars: ✭ 26 (-44.68%)
spanish-corporaUnannotated Spanish 3 Billion Words Corpora
Stars: ✭ 61 (+29.79%)
HangulizeHangulize transcribes non-Korean words into Hangul
Stars: ✭ 152 (+223.4%)
concepticon-dataThe curation repository for the data behind Concepticon.
Stars: ✭ 25 (-46.81%)
geodaDataData package for accessing GeoDa datasets using R
Stars: ✭ 15 (-68.09%)
wikipronMassively multilingual pronunciation mining
Stars: ✭ 167 (+255.32%)
Ipa DictMonolingual wordlists with pronunciation information in IPA
Stars: ✭ 139 (+195.74%)
mystemCGo bindings to Yandex.Mystem
Stars: ✭ 28 (-40.43%)
poesyPoetic processing, for Python.
Stars: ✭ 28 (-40.43%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+19.15%)
IchiranLinguistic tools for texts in Japanese language
Stars: ✭ 120 (+155.32%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-42.55%)
proiel-treebankOfficial releases of the PROIEL treebank of ancient Indo-European languages
Stars: ✭ 30 (-36.17%)
PyconllA minimal, pure Python library to interface with CoNLL-U format files.
Stars: ✭ 104 (+121.28%)
lingvo--Ner-ruNamed entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-19.15%)
WonderfulPolishLanguageThis is a repository created for the list of resources for learning and exploring Wonderful Polish language.
Stars: ✭ 31 (-34.04%)
eliza-rsA rust implementation of ELIZA - a natural language processing program developed by Joseph Weizenbaum in 1966.
Stars: ✭ 48 (+2.13%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (+110.64%)
PLNmodelsA collection of Poisson lognormal models for multivariate count data analysis
Stars: ✭ 44 (-6.38%)
crminer⛔ ARCHIVED ⛔ Fetch 'Scholary' Full Text from 'Crossref'
Stars: ✭ 17 (-63.83%)
nyt-first-saidTweets when words are published for the first time in the NYT
Stars: ✭ 222 (+372.34%)
node-rest-api-starterThis repository is a template to avoid rewriting all the basic authentication code for REST API's built with Express.js, MongoDB.
Stars: ✭ 30 (-36.17%)