All Projects → supikiti → Pncc

supikiti / Pncc

Licence: mit
A implementation of Power Normalized Cepstral Coefficients: PNCC

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pncc

react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+32.5%)
Mutual labels:  speech-recognition, speech-processing
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2002.5%)
Mutual labels:  speech-recognition, speech-processing
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+275%)
Mutual labels:  speech-recognition, speech-processing
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+505%)
Mutual labels:  speech-recognition, speech-processing
scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-57.5%)
Mutual labels:  speech-recognition, speech-processing
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+135%)
Mutual labels:  speech-recognition, speech-processing
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+77.5%)
Mutual labels:  speech-recognition, speech-processing
Formant Analyzer
iOS application for finding formants in spoken sounds
Stars: ✭ 43 (+7.5%)
Mutual labels:  speech-recognition, speech-processing
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+460%)
Mutual labels:  speech-recognition, speech-processing
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-32.5%)
Mutual labels:  speech-recognition, speech-processing
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+265%)
Mutual labels:  speech-recognition, speech-processing
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+1582.5%)
Mutual labels:  speech-recognition, speech-processing
Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+195%)
Mutual labels:  speech-recognition, speech-processing
torchsubband
Pytorch implementation of subband decomposition
Stars: ✭ 63 (+57.5%)
Mutual labels:  speech-recognition, speech-processing
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (+17.5%)
Mutual labels:  speech-recognition, speech-processing
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-55%)
Mutual labels:  speech-recognition, speech-processing
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+412.5%)
Mutual labels:  speech-recognition, speech-processing
Uspeech
Speech recognition toolkit for the arduino
Stars: ✭ 448 (+1020%)
Mutual labels:  speech-recognition, speech-processing
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1810%)
Mutual labels:  speech-recognition, speech-processing
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+1790%)
Mutual labels:  speech-recognition

PNCC implementation in Python

Based on

put a slide about PNCC

Installation:

Clone and install requirements.

cd ~
git clone https://github.com/supikiti/PNCC.git
cd PNCC
pip install -r requirements.txt

PNCC features

If you want to change the PNCC parameters, the following parameters are supported.

def pncc(signal=audio_wave, n_fft=512, sr=16000, winlen=0.020, winstep=0.010,
         n_mels=128, n_pncc=13):
Parameter Description
signal the audio signal from which to compute features. Should be an (N, 1) array
n_fft the FFT size. Default is 512.
sr the samplerate of the signal we are working with.
winlen the length of the analysis window in seconds. Default is 0.020s.(25 milliseconds)
winstep the step between successive windows in seconds. Default is 0.010.(10 milliseconds)
n_mels the number of filters in the filterbank, default 128.
n_pncc the number of cepstrum to return, default 13.

Licence

MIT

Author

supikiti

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].