All Projects → LimingShi → Bayesian-Pitch-Tracking-Using-Harmonic-model

LimingShi / Bayesian-Pitch-Tracking-Using-Harmonic-model

Licence: GPL-2.0 license
Pitch detection and pitch tracking, voicing unvoicing detection (VAD),基音检测

Programming Languages

matlab
3953 projects
C++
36643 projects - #6 most used programming language

Projects that are alternatives of or similar to Bayesian-Pitch-Tracking-Using-Harmonic-model

imogen
ultimate vocal harmonizer
Stars: ✭ 169 (+141.43%)
Mutual labels:  pitch-detection
Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Stars: ✭ 76 (+8.57%)
Mutual labels:  speech-analysis
vamp-aubio-plugins
aubio plugins for Vamp
Stars: ✭ 38 (-45.71%)
Mutual labels:  onset-detection
AnotherBadBeatSaberClone
This is a discontinued but perhaps helpful VR project created during my Master's degree at FH Wedel.
Stars: ✭ 22 (-68.57%)
Mutual labels:  onset-detection
node-pitchfinder
A compilation of pitch detection algorithms for Javascript.
Stars: ✭ 25 (-64.29%)
Mutual labels:  pitch-detection
Arduino Pitch-Detector
Pitch Detection on Arduino using Autocorrelation
Stars: ✭ 21 (-70%)
Mutual labels:  pitch-detection
Arduino-FrequencyDetector
Fast audio frequency detector without fft for plain Arduino and Attiny85. Whistle switch example included.
Stars: ✭ 22 (-68.57%)
Mutual labels:  pitch-detection
my-voice-analysis
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
Stars: ✭ 164 (+134.29%)
Mutual labels:  speech-analysis
scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-75.71%)
Mutual labels:  speech-analysis
localized-narratives
Localized Narratives
Stars: ✭ 60 (-14.29%)
Mutual labels:  speech-analysis
wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (-22.86%)
Mutual labels:  speech-analysis
magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Stars: ✭ 76 (+8.57%)
Mutual labels:  speech-analysis
myprosody
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Stars: ✭ 162 (+131.43%)
Mutual labels:  speech-analysis
YANGstraight source
Analytic signal-based source information analysis for YANGstraight and real-time interactive tools
Stars: ✭ 31 (-55.71%)
Mutual labels:  speech-analysis

Bayesian Pitch Tracking Using Harmonic model

A fast pitch tracking algorithm using the harmonic model.

The article for this work is available in PUBLISHED, PREPRINT

How to run

This project contains both the MATLAB and CPP code

For the MATLAB code:

Run run_white_example.m (white Gaussian noise) or run_colored_example.m (factory noise) in MATLAB in the BF0NLS_MATLAB folder

Run main.m in MATLAB in the BF0NLS_realtimeDemo_MATLAB folder

Examples

Figure 1: Pitch estimates for speech signals under 0 dB white Gaussian noise (Running time on my laptop is around 2.6 s).

Figure 2: Pitch estimates for speech signals under 0 dB factory noise (Running time on my laptop is around 9.3 s, and prewhitening is used).

Figure 3: Pitch estimates for music signals (vibrato flute sound) under 0 dB white Gaussian noise (Running time on my laptop is around 32.2 s).

How to cite

L. Shi, J. K. Nielsen, J. R. Jensen, M. A. Little, and M. G. Chris- tensen, “Robust bayesian pitch tracking based on the harmonic model,” IEEE/ACM Trans. Audio, Speech, and Lang. Process., vol. 27, no. 11, pp. 1737–1751, Nov 2019.

References

This fast computation of the likelihood function is based on the fast pitch estimation method proposed in

Fast fundamental frequency estimation: Making a statistically efficient estimator computationally efficient. Nielsen, Jesper Kjær; Jensen, Tobias Lindstrøm; Jensen, Jesper Rindom; Christensen, Mads Græsbøll; Jensen, Søren Holdt. In: Signal Processing, 135, 2017, pp. 188-197.

Bayesian Model Comparison With the g-Prior. Nielsen, Jesper Kjær; Christensen, Mads Græsbøll; Cemgil, Ali Taylan; Jensen, Søren Holdt. In: IEEE Transactions on Signal Processing, 62 (1), 2014, pp. 225-238.

where the source code is available in https://github.com/jkjaer/fastF0Nls

This noise PSD tracker used for prewhitening is based on the method proposed in

Gerkmann, T. & Hendriks, R. C. Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay, IEEE Trans Audio, Speech, Language Processing, 2012, 20, 1383-1393

where the source code is available in http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html

Questions

If you have any question regarding to the theory and code, feel free to contact

Liming Shi, Aalborg university, Email: [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].