All Projects → jacksonllee → pylangacq

jacksonllee / pylangacq

Licence: MIT license
Language Acquisition Research Tools

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to pylangacq

linguistics problems
Natural language processing in examples and games
Stars: ✭ 23 (-30.3%)
Mutual labels:  linguistics, computational-linguistics
folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+69.7%)
Mutual labels:  linguistics, computational-linguistics
wikipron
Massively multilingual pronunciation mining
Stars: ✭ 167 (+406.06%)
Mutual labels:  linguistics, computational-linguistics
Wikipron
Massively multilingual pronunciation mining
Stars: ✭ 99 (+200%)
Mutual labels:  linguistics
Pyconll
A minimal, pure Python library to interface with CoNLL-U format files.
Stars: ✭ 104 (+215.15%)
Mutual labels:  linguistics
Prosodic
Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
Stars: ✭ 162 (+390.91%)
Mutual labels:  linguistics
pfootprint
Political Discourse Analysis Using Pre-Trained Word Vectors.
Stars: ✭ 20 (-39.39%)
Mutual labels:  linguistics
Textannotationgraphs
A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
Stars: ✭ 73 (+121.21%)
Mutual labels:  linguistics
Awesome Linguistics
A curated list of anything remotely related to linguistics
Stars: ✭ 207 (+527.27%)
Mutual labels:  linguistics
Hangulize
Hangulize transcribes non-Korean words into Hangul
Stars: ✭ 152 (+360.61%)
Mutual labels:  linguistics
Pycantonese
Cantonese Linguistics and NLP in Python
Stars: ✭ 147 (+345.45%)
Mutual labels:  linguistics
Colibri Core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (+239.39%)
Mutual labels:  linguistics
Rime Cantonese
Rime Cantonese input schema | 粵語拼音輸入方案
Stars: ✭ 173 (+424.24%)
Mutual labels:  linguistics
Elpis
🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (+206.06%)
Mutual labels:  linguistics
esapp
An unsupervised Chinese word segmentation tool.
Stars: ✭ 13 (-60.61%)
Mutual labels:  computational-linguistics
Flat
FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
Stars: ✭ 93 (+181.82%)
Mutual labels:  linguistics
Opencorpora
A web-based engine for creating and annotating textual corpora
Stars: ✭ 204 (+518.18%)
Mutual labels:  linguistics
Ipa Dict
Monolingual wordlists with pronunciation information in IPA
Stars: ✭ 139 (+321.21%)
Mutual labels:  linguistics
Corpuscrawler
Crawler for linguistic corpora
Stars: ✭ 127 (+284.85%)
Mutual labels:  linguistics
Tossi
Chooses correct Korean particle morphs for arbitrary words.
Stars: ✭ 160 (+384.85%)
Mutual labels:  linguistics

PyLangAcq: Language Acquisition Research in Python

Full documentation: https://pylangacq.org


PyPI version Supported Python versions CircleCI Builds

PyLangAcq is a Python library for language acquisition research.

  • Easy access to CHILDES and other TalkBank datasets
  • Intuitive Python data structures for flexible data access and manipulation
  • Standard developmental measures readily available: Mean length of utterance (MLU), type-token ratio (TTR), and index of productive syntax (IPSyn)
  • Direct support and powerful extensions possible for CHAT-formatted conversational datasets more generally

Download and Install

To download and install the most recent version:

$ pip install --upgrade pylangacq

Ready for more? Check out the Quickstart page.

Support

If you have found PyLangAcq useful and would like to offer support, buying me a coffee would go a long way!

Links

How to Cite

PyLangAcq is authored and maintained by Jackson L. Lee.

Lee, Jackson L., Ross Burkholder, Gallagher B. Flinn, and Emily R. Coppess. 2016. Working with CHAT transcripts in Python. Technical report TR-2016-02, Department of Computer Science, University of Chicago.

@TechReport{lee-et-al-pylangacq:2016,
   Title       = {Working with CHAT transcripts in Python},
   Author      = {Lee, Jackson L. and Burkholder, Ross and Flinn, Gallagher B. and Coppess, Emily R.},
   Institution = {Department of Computer Science, University of Chicago},
   Year        = {2016},
   Number      = {TR-2016-02},
}

License

MIT License. Please see LICENSE.txt in the GitHub source code for details.

The test data files included come from CHILDES, and have a CC BY-NC-SA 3.0 license instead; please also see src/pylangacq/tests/README.md in the GitHub source code for details.

Changelog

Please see CHANGELOG.md.

Setting up a Development Environment

The latest code under development is available on Github at jacksonllee/pylangacq. To obtain this version for experimental features or for development:

$ git clone https://github.com/jacksonllee/pylangacq.git
$ cd pylangacq
$ pip install -r dev-requirements.txt
$ pip install -e .

To run tests and styling checks:

$ pytest -vv --doctest-modules --cov=pylangacq pylangacq docs/source
$ flake8 pylangacq
$ black --check pylangacq

To build the documentation website files:

$ python docs/source/build_docs.py
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].