jacksonllee / pylangacq

Licence: MIT license

Language Acquisition Research Tools

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to pylangacq

linguistics problems

Natural language processing in examples and games

Stars: ✭ 23 (-30.3%)

Mutual labels: linguistics, computational-linguistics

folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…

Stars: ✭ 56 (+69.7%)

Mutual labels: linguistics, computational-linguistics

wikipron

Massively multilingual pronunciation mining

Stars: ✭ 167 (+406.06%)

Mutual labels: linguistics, computational-linguistics

Wikipron

Massively multilingual pronunciation mining

Stars: ✭ 99 (+200%)

Mutual labels: linguistics

Pyconll

A minimal, pure Python library to interface with CoNLL-U format files.

Stars: ✭ 104 (+215.15%)

Mutual labels: linguistics

Prosodic

Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.

Stars: ✭ 162 (+390.91%)

Mutual labels: linguistics

pfootprint

Political Discourse Analysis Using Pre-Trained Word Vectors.

Stars: ✭ 20 (-39.39%)

Mutual labels: linguistics

Textannotationgraphs

A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.

Stars: ✭ 73 (+121.21%)

Mutual labels: linguistics

Awesome Linguistics

A curated list of anything remotely related to linguistics

Stars: ✭ 207 (+527.27%)

Mutual labels: linguistics

Hangulize

Hangulize transcribes non-Korean words into Hangul

Stars: ✭ 152 (+360.61%)

Mutual labels: linguistics

Pycantonese

Cantonese Linguistics and NLP in Python

Stars: ✭ 147 (+345.45%)

Mutual labels: linguistics

Colibri Core

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

Stars: ✭ 112 (+239.39%)

Mutual labels: linguistics

Rime Cantonese

Rime Cantonese input schema | 粵語拼音輸入方案

Stars: ✭ 173 (+424.24%)

Mutual labels: linguistics

Elpis

🙊 WIP software for creating speech recognition models.

Stars: ✭ 101 (+206.06%)

Mutual labels: linguistics

esapp

An unsupervised Chinese word segmentation tool.

Stars: ✭ 13 (-60.61%)

Mutual labels: computational-linguistics

Flat

FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.

Stars: ✭ 93 (+181.82%)

Mutual labels: linguistics

Opencorpora

A web-based engine for creating and annotating textual corpora

Stars: ✭ 204 (+518.18%)

Mutual labels: linguistics

Ipa Dict

Monolingual wordlists with pronunciation information in IPA

Stars: ✭ 139 (+321.21%)

Mutual labels: linguistics

Corpuscrawler

Crawler for linguistic corpora

Stars: ✭ 127 (+284.85%)

Mutual labels: linguistics

Tossi

Chooses correct Korean particle morphs for arbitrary words.

Stars: ✭ 160 (+384.85%)

Mutual labels: linguistics

View All Similar Projects ➔

PyLangAcq: Language Acquisition Research in Python

Full documentation: https://pylangacq.org

PyLangAcq is a Python library for language acquisition research.

Easy access to CHILDES and other TalkBank datasets
Intuitive Python data structures for flexible data access and manipulation
Standard developmental measures readily available: Mean length of utterance (MLU), type-token ratio (TTR), and index of productive syntax (IPSyn)
Direct support and powerful extensions possible for CHAT-formatted conversational datasets more generally

Download and Install

To download and install the most recent version:

$ pip install --upgrade pylangacq

Ready for more? Check out the Quickstart page.

Support

If you have found PyLangAcq useful and would like to offer support, buying me a coffee would go a long way!

How to Cite

PyLangAcq is authored and maintained by Jackson L. Lee.

Lee, Jackson L., Ross Burkholder, Gallagher B. Flinn, and Emily R. Coppess. 2016. Working with CHAT transcripts in Python. Technical report TR-2016-02, Department of Computer Science, University of Chicago.

@TechReport{lee-et-al-pylangacq:2016,
   Title       = {Working with CHAT transcripts in Python},
   Author      = {Lee, Jackson L. and Burkholder, Ross and Flinn, Gallagher B. and Coppess, Emily R.},
   Institution = {Department of Computer Science, University of Chicago},
   Year        = {2016},
   Number      = {TR-2016-02},
}

License

MIT License. Please see LICENSE.txt in the GitHub source code for details.

The test data files included come from CHILDES, and have a CC BY-NC-SA 3.0 license instead; please also see src/pylangacq/tests/README.md in the GitHub source code for details.

Changelog

Please see CHANGELOG.md.

Setting up a Development Environment

The latest code under development is available on Github at jacksonllee/pylangacq. To obtain this version for experimental features or for development:

$ git clone https://github.com/jacksonllee/pylangacq.git
$ cd pylangacq
$ pip install -r dev-requirements.txt
$ pip install -e .

To run tests and styling checks:

$ pytest -vv --doctest-modules --cov=pylangacq pylangacq docs/source
$ flake8 pylangacq
$ black --check pylangacq

To build the documentation website files:

$ python docs/source/build_docs.py

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

jacksonllee / pylangacq

Programming Languages

Labels

Projects that are alternatives of or similar to pylangacq

PyLangAcq: Language Acquisition Research in Python

Download and Install

Support

Links

How to Cite

License

Changelog

Setting up a Development Environment