PitchYinProbabilistic¶
streaming mode | Pitch category
Inputs¶
signal
(real) - the input mono audio signal
Outputs¶
pitch
(vector_real) - the output pitch estimations
voicedProbabilities
(vector_real) - the voiced probabilities
Parameters¶
frameSize
(integer ∈ (0, ∞), default = 2048) :the frame size of FFT
hopSize
(integer ∈ [1, ∞), default = 256) :the hop size with which the pitch is computed
lowRMSThreshold
(real ∈ (0, 1], default = 0.1) :the low RMS amplitude threshold
outputUnvoiced
(string ∈ {zero, abs, negative}, default = negative) :whether output unvoiced frame. zero: output non-voiced pitch as 0.; abs: output non-voiced pitch as absolute values; negative: output non-voiced pitch as negative values
preciseTime
(bool ∈ {true, false}, default = false) :use non-standard precise YIN timing (slow).
sampleRate
(real ∈ (0, ∞), default = 44100) :the sampling rate of the audio signal [Hz]
Description¶
This algorithm computes the pitch track of a mono audio signal using probabilistic Yin algorithm.
The input mono audio signal is preprocessed with a FrameCutter to segment into frameSize chunks with a overlap hopSize.
The pitch frequencies, probabilities and RMS values of the chunks are then calculated by PitchYinProbabilities algorithm. The results of all chunks are aggregated into a Essentia pool.
The pitch frequencies and probabilities are finally sent to PitchYinProbabilitiesHMM algorithm to get a smoothed pitch track and a voiced probability.
- References:
[1] M. Mauch and S. Dixon, “pYIN: A Fundamental Frequency Estimator Using Probabilistic Threshold Distributions,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014)Project Report, 2004
Source code¶
See also¶
FrameCutter (standard) FrameCutter (streaming) PitchYin (standard) PitchYin (streaming) PitchYinProbabilistic (standard) PitchYinProbabilities (standard) PitchYinProbabilities (streaming) PitchYinProbabilitiesHMM (standard) PitchYinProbabilitiesHMM (streaming) RMS (standard) RMS (streaming)