Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (+126.09%)

Mutual labels: tokenizer, speech-recognition

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (+156.52%)

Mutual labels: natural-language-processing, speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+1078.26%)

Mutual labels: speech-recognition, tts

Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+121078.26%)

Mutual labels: natural-language-processing, speech-recognition

Thot

Thot toolkit for statistical machine translation

Stars: ✭ 53 (+15.22%)

Mutual labels: tokenizer, natural-language-processing

Ios ml

List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.

Stars: ✭ 1,409 (+2963.04%)

Mutual labels: natural-language-processing, speech-recognition

Udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit

Stars: ✭ 160 (+247.83%)

Mutual labels: tokenizer, natural-language-processing

Lingvo

Stars: ✭ 2,361 (+5032.61%)

Mutual labels: speech-recognition, tts

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

Stars: ✭ 9,717 (+21023.91%)

Mutual labels: natural-language-processing, speech-recognition

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+93.48%)

Mutual labels: tts, speech-recognition

Open Korean Text

Open Korean Text Processor - An Open-source Korean Text Processor

Stars: ✭ 438 (+852.17%)

Mutual labels: tokenizer, natural-language-processing

View All Similar Projects ➔

py-nltools

A collection of abstraction layers and support functions that form the natural language processing foundation of the Zamia AI project:

phonetics: translation functions between various phonetic alphabets (IPA, X-SAMPA, X-ARPABET, ...)
tts: abstraction layer towards using eSpeak NG, MaryTTS, SVOX Pico TTS or a remote TTS server and sequitur g2p
asr: abstraction layer towards using kaldi-asr and pocketsphinx, models can be found here: http://goofy.zamia.org/voxforge/
sequiturclient: g2p using sequitur
pulseplayer: audio playback through pulseaudio
pulserecorder: audio recording through pulseaudio
tokenizer: english, french and german word tokenizers aimed at spoken language applications
threadpool: simple thread pool implementation
vad: Voice Activity Detection finite state machine based on webrtc VAD
macro_engine: Simple macro engine aimed at generating natural language expansions

I plan to add modules as I need them in the Zamia AI projects. Some modules like phonetics and tokenizer have some overlap with larger projects like NLTK or spaCy - my modules tend to be more hands-on and simple minded than these and therefore are in no way meant to replace them.

ifndef::imagesdir[:imagesdir: images]

Requirements


*Note*: probably incomplete.

* Python 2.7 
* for TTS one or more of:
  - MaryTTS, py-marytts
  - espeak-ng, py-espeak-ng
  - SVOX Pico TTS, py-picotts
* for ASR one or more of:
  - kaldi-asr 5.1, py-kaldi-asr
  - pocketsphinx
* sequitur
* pulseaudio
* webrtc

License
~~~~~~~

My own code is Apache-2.0 licensed unless otherwise noted in the script's copyright
headers.

Some scripts and files are based on works of others, in those cases it is my
intention to keep the original license intact. Please make sure to check the
copyright headers inside for more information.

Authors
~~~~~~~

Guenter Bartsch <[email protected]>
Paul Guyot <[email protected]>

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 46

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗