Gaussian Mixture Model for speech recognition
Afficher commentaires plus anciens
Hi all! I'm implementing a tool for speech recognition (command based).
My training data are 21 commands (7 different commands with 3 utterances for each). I did:
- the pre-processing phase (silence removal and end-point detection)
- the features extraction phase (with MFCC calculation).
So, for every utterance in my training set, i have a MFCC matrix with 12 columns (12=number of MFCC) and as much rows as the number of frames i divided the signal.
For the recognition phase, i was wondering to use the gmdistribution tool.
I read this article:
http://www.mathworks.it/company/newsletters/digest/2010/jan/word-recognition-system-matlab.html but i didn't understand this code line:
% model = gmdistribution.fit(MFCCtraindata,M);
What is the MFCCtraindata parameter?
Is it the MFCC matrix associated with every utterance?
For each command i have 3 utterances, so i have 3 different MFCC matrixes.
How can i do to create a unique gmm if, for every command, i will got 3 different gmm?
Any kind of help will be appreciated.
Thank you!!
Réponses (5)
Castalia
le 8 Mar 2013
0 votes
Rania Ziedan
le 22 Oct 2015
0 votes
i really need help in the same issue if you handled it could you help me thanks in advance
MUZITIANXINJIE
le 26 Juin 2016
0 votes
Yes,I want,but no one help me! I really need to use the deep learning tu classfy the voice recognition . thanks for your help.
yasir riaz
le 21 Déc 2016
0 votes
please help
hanieh rafiee
le 19 Fév 2017
0 votes
Hi Is the answer to your question receipts? Will you help me please?
Catégories
En savoir plus sur Speech Recognition dans Centre d'aide et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!