All Projects → gooofy → Py Nltools

gooofy / Py Nltools

Licence: apache-2.0
A collection of basic python modules for spoken natural language processing

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Py Nltools

Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+123.91%)
Mutual labels:  natural-language-processing, speech-recognition, tts
Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (+186.96%)
Mutual labels:  tokenizer, natural-language-processing
Kadot
Kadot, the unsupervised natural language processing library.
Stars: ✭ 108 (+134.78%)
Mutual labels:  tokenizer, natural-language-processing
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-54.35%)
Mutual labels:  tts, speech-recognition
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (+189.13%)
Mutual labels:  natural-language-processing, speech-recognition
Greynir
The greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (+2.17%)
Mutual labels:  tokenizer, natural-language-processing
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+15.22%)
Mutual labels:  tts, speech-recognition
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+13.04%)
Mutual labels:  tts, speech-recognition
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1728.26%)
Mutual labels:  tts, speech-recognition
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+126.09%)
Mutual labels:  tokenizer, speech-recognition
Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+156.52%)
Mutual labels:  natural-language-processing, speech-recognition
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1078.26%)
Mutual labels:  speech-recognition, tts
Transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+121078.26%)
Mutual labels:  natural-language-processing, speech-recognition
Thot
Thot toolkit for statistical machine translation
Stars: ✭ 53 (+15.22%)
Mutual labels:  tokenizer, natural-language-processing
Ios ml
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Stars: ✭ 1,409 (+2963.04%)
Mutual labels:  natural-language-processing, speech-recognition
Udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+247.83%)
Mutual labels:  tokenizer, natural-language-processing
Lingvo
Lingvo
Stars: ✭ 2,361 (+5032.61%)
Mutual labels:  speech-recognition, tts
Deep Learning Drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+21023.91%)
Mutual labels:  natural-language-processing, speech-recognition
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+93.48%)
Mutual labels:  tts, speech-recognition
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+852.17%)
Mutual labels:  tokenizer, natural-language-processing

py-nltools

A collection of abstraction layers and support functions that form the natural language processing foundation of the Zamia AI project:

  • phonetics: translation functions between various phonetic alphabets (IPA, X-SAMPA, X-ARPABET, ...)
  • tts: abstraction layer towards using eSpeak NG, MaryTTS, SVOX Pico TTS or a remote TTS server and sequitur g2p
  • asr: abstraction layer towards using kaldi-asr and pocketsphinx, models can be found here: http://goofy.zamia.org/voxforge/
  • sequiturclient: g2p using sequitur
  • pulseplayer: audio playback through pulseaudio
  • pulserecorder: audio recording through pulseaudio
  • tokenizer: english, french and german word tokenizers aimed at spoken language applications
  • threadpool: simple thread pool implementation
  • vad: Voice Activity Detection finite state machine based on webrtc VAD
  • macro_engine: Simple macro engine aimed at generating natural language expansions

I plan to add modules as I need them in the Zamia AI projects. Some modules like phonetics and tokenizer have some overlap with larger projects like NLTK or spaCy - my modules tend to be more hands-on and simple minded than these and therefore are in no way meant to replace them.

ifndef::imagesdir[:imagesdir: images]

ifndef::env-github[] [ditaa,"highlevel"] .... +-----------------------------------------------------------------------------------------------+ | nltools | | +-----------+ +-----------+ +------------+ +--------------+ | | | tokenizer | | phonetics | | threadpool | | macro_engine | | | +-----------+ +-----------+ +------------+ +--------------+ | | | | +-----------+ +-----------+ +-----------+ +-----------+ +-----------+ | | | tts | | asr | | vad | | g2p | | audio | | | +-----------+ +-----------+ +-----------+ +-----------+ +-----------+ | | | | | | | | +-----------------------------------------------------------------------------------------------+ | | | | |
+--------+---------+ +------+----+ | | | | | | | | | | | v v v v v v v v +------+ +--------+ +------+ +-------+ +-----------+ +--------+ +----------+ +------------+ | mary | | eSpeak | | pico | | kaldi | | cmusphinx | | webrtc | | sequitur | | pulseaudio | +------+ +--------+ +------+ +-------+ +-----------+ +--------+ +----------+ +------------+ .... endif::env-github[] ifdef::env-github[] image::highlevel.png[Highlevel Diagram] endif::env-github[]

Requirements


*Note*: probably incomplete.

* Python 2.7 
* for TTS one or more of:
  - MaryTTS, py-marytts
  - espeak-ng, py-espeak-ng
  - SVOX Pico TTS, py-picotts
* for ASR one or more of:
  - kaldi-asr 5.1, py-kaldi-asr
  - pocketsphinx
* sequitur
* pulseaudio
* webrtc

License
~~~~~~~

My own code is Apache-2.0 licensed unless otherwise noted in the script's copyright
headers.

Some scripts and files are based on works of others, in those cases it is my
intention to keep the original license intact. Please make sure to check the
copyright headers inside for more information.

Authors
~~~~~~~

Guenter Bartsch <[email protected]>
Paul Guyot <[email protected]>

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].