TensorflowInputFSDSINet¶
standard mode | Spectral category
Inputs¶
frame
(vector_real) - the audio frame
Outputs¶
bands
(vector_real) - the log-compressed mel bands
Description¶
This algorithm computes mel bands from an audio frame with the specific parametrization required by the FSD-SINet models.
- References:
[1] Fonseca, E., Ferraro, A., & Serra, X. (2021). Improving sound event classification by increasing shift invariance in convolutional neural networks. arXiv preprint arXiv:2107.00623. [2] https://github.com/edufonseca/shift_sec
Source code¶
See also¶
TensorflowInputFSDSINet (streaming)