Python wrapper for Espeak and Mbrola, for simple local TTS
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Software Automatic Mouth - Tiny Speech Synthesizer