TensorflowInputVGGish¶

streaming mode | Spectral category

Inputs¶

frame (vector_real) - the audio frame

bands (vector_real) - the log compressed mel bands

This algorithm computes mel-bands specific to the input of VGGish-based models.

References:

[1] Gemmeke, J. et. al., AudioSet: An ontology and human-labelled dataset for audio events, ICASSP 2017

[2] Hershey, S. et. al., CNN Architectures for Large-Scale Audio Classification, ICASSP 2017

C++ source code

C++ header file