Missing IDCT in MFCC computation of "vggishPreprocess" function
Afficher commentaires plus anciens
Hello all
I am try to go deep into audio signal feature extraction and, in literature, I read about Mel Frequency Cepstral Coefficients (MFCCs). Avoiding longer tenuous explanation, MFCC are obtained by performing the inverse cosine transform of the log spectrum of the audio signal (including frequency warping to the log scale). By assembling these coefficients, one obtains the Mel spectrum.
I was very happy to notice that matlab includes the "vggish" function, to automatically obtain the Mel spectrums from a signal. Anyways, by going through the "vggish" function code, I noticed that no inverse cosine transform is included. I was expecting to find the "idct" function, or an analogous one, but the procedure seems to stop performing the log of the fourier transform. Also, the latter is not squared to obtain the spectrum.
Is this a different procedure to obtain MFCC? Any references that describe such procedure?
Thanks in advance
Roberto
Réponse acceptée
Plus de réponses (0)
Catégories
En savoir plus sur Pretrained Models dans Centre d'aide et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!