Speech signal processing and classificationFront-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].
Stars: ✭ 155 (+604.55%)
Mutual labels: natural-language-processing, speech-processing
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+436.36%)
Mutual labels: natural-language-processing, speech-processing
Named Entity Recognitionname entity recognition with recurrent neural network(RNN) in tensorflow
Stars: ✭ 20 (-9.09%)
Mutual labels: natural-language-processing
LanguageShared repository for open-sourced projects from the Google AI Language team.
Stars: ✭ 860 (+3809.09%)
Mutual labels: natural-language-processing
CiffCornell Instruction Following Framework
Stars: ✭ 23 (+4.55%)
Mutual labels: natural-language-processing
Twitter Bot👻 Markov chain-based Japanese twitter bot
Stars: ✭ 12 (-45.45%)
Mutual labels: natural-language-processing
Kts linguisticsSpellcheck, phonetics, text processing and more
Stars: ✭ 18 (-18.18%)
Mutual labels: natural-language-processing
Mongolian BertPre-trained Mongolian BERT models
Stars: ✭ 21 (-4.55%)
Mutual labels: natural-language-processing
PkePython Keyphrase Extraction module
Stars: ✭ 855 (+3786.36%)
Mutual labels: natural-language-processing
SpagoSelf-contained Machine Learning and Natural Language Processing library in Go
Stars: ✭ 854 (+3781.82%)
Mutual labels: natural-language-processing
Drl4nlp.scratchpadNotes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (+18.18%)
Mutual labels: natural-language-processing
Nlp tutorialsOverview of NLP tools and techniques in python
Stars: ✭ 14 (-36.36%)
Mutual labels: natural-language-processing
Spacy Transformers🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Stars: ✭ 919 (+4077.27%)
Mutual labels: natural-language-processing
NrafundedThe NRA is paying off Congress at the expense of lives.
Stars: ✭ 20 (-9.09%)
Mutual labels: politics
Nlp With RubyCurated List: Practical Natural Language Processing done in Ruby
Stars: ✭ 907 (+4022.73%)
Mutual labels: natural-language-processing
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+3740.91%)
Mutual labels: natural-language-processing
Node Api.ai[DEPRECATED] Ultimate Node.JS SDK for api.ai
Stars: ✭ 12 (-45.45%)
Mutual labels: natural-language-processing
RexREx: Relation Extraction. Modernized re-write of the code in the master's thesis: "Relation Extraction using Distant Supervision, SVMs, and Probabalistic First-Order Logic"
Stars: ✭ 21 (-4.55%)
Mutual labels: natural-language-processing