supikiti / Pncc
Licence: mit
A implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Pncc
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+32.5%)
Mutual labels: speech-recognition, speech-processing
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2002.5%)
Mutual labels: speech-recognition, speech-processing
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+275%)
Mutual labels: speech-recognition, speech-processing
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+505%)
Mutual labels: speech-recognition, speech-processing
scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-57.5%)
Mutual labels: speech-recognition, speech-processing
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+135%)
Mutual labels: speech-recognition, speech-processing
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+77.5%)
Mutual labels: speech-recognition, speech-processing
Formant Analyzer
iOS application for finding formants in spoken sounds
Stars: ✭ 43 (+7.5%)
Mutual labels: speech-recognition, speech-processing
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+460%)
Mutual labels: speech-recognition, speech-processing
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-32.5%)
Mutual labels: speech-recognition, speech-processing
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+265%)
Mutual labels: speech-recognition, speech-processing
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+1582.5%)
Mutual labels: speech-recognition, speech-processing
Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+195%)
Mutual labels: speech-recognition, speech-processing
torchsubband
Pytorch implementation of subband decomposition
Stars: ✭ 63 (+57.5%)
Mutual labels: speech-recognition, speech-processing
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (+17.5%)
Mutual labels: speech-recognition, speech-processing
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-55%)
Mutual labels: speech-recognition, speech-processing
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+412.5%)
Mutual labels: speech-recognition, speech-processing
Uspeech
Speech recognition toolkit for the arduino
Stars: ✭ 448 (+1020%)
Mutual labels: speech-recognition, speech-processing
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1810%)
Mutual labels: speech-recognition, speech-processing
PNCC implementation in Python
Based on
- http://www.cs.cmu.edu/~robust/Papers/OnlinePNCC_V25.pdf
- https://www.eurasip.org/Proceedings/Eusipco/Eusipco2015/papers/1570104069.pdf
put a slide about PNCC
Installation:
Clone and install requirements.
cd ~
git clone https://github.com/supikiti/PNCC.git
cd PNCC
pip install -r requirements.txt
PNCC features
If you want to change the PNCC parameters, the following parameters are supported.
def pncc(signal=audio_wave, n_fft=512, sr=16000, winlen=0.020, winstep=0.010,
n_mels=128, n_pncc=13):
Parameter | Description |
---|---|
signal | the audio signal from which to compute features. Should be an (N, 1) array |
n_fft | the FFT size. Default is 512. |
sr | the samplerate of the signal we are working with. |
winlen | the length of the analysis window in seconds. Default is 0.020s.(25 milliseconds) |
winstep | the step between successive windows in seconds. Default is 0.010.(10 milliseconds) |
n_mels | the number of filters in the filterbank, default 128. |
n_pncc | the number of cepstrum to return, default 13. |
Licence
Author
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].