MATLAB Answers

MATLAB code for language identification using audio signal

39 views (last 30 days)
Harsh Kaushik
Harsh Kaushik on 3 Sep 2019
Commented: Dheeraj Singh on 9 Sep 2019
what is th ematlab code for identifying language using its audio signals.


Sign in to comment.

Answers (1)

Dheeraj Singh
Dheeraj Singh on 9 Sep 2019
For any identifying language you will need huge dataset of the words in that language if you want to use any Machine learning techniques. Also, the letters should be in English letters as most of the speech to text converter give output as English letters.
For converting audio to text, you can use the following file exchange link:
After getting text you can refer to the following link for text classification using Deep Learning:
For more on speech and text classification refer to the following link:


Show 1 older comment
Dheeraj Singh
Dheeraj Singh on 9 Sep 2019
I'm not sure if we can identify the language without extracting what is being spoken.
Walter Roberson
Walter Roberson on 9 Sep 2019
Sufficiently different languages have different statistics of sounds. It is possible for someone with experience to distinguish by subconsciously assessing the statistics even if they do not know the language being spoken.
If I hear a random language in passing, I can probably pick out the fact that it is German even though I have never studied German and it is not especially common where I am: I have encountered enough German in movies and so-on over the years to know what it sounds like. But I would have a hard time picking out whether it was Low German or High German or Swiss German.
It takes me a while to figure out whether someone is peaking Swedish or Danish.
Italian has been pretty common in media over the years, even if only in stereotypes; I can easily pick out out from (say) Spanish. But I might "hear" Castilian Spanish as if it were a different language than Latin American Spanish.
I cannot distinguish the Russian-like languages by ear with the (lack of) experience I have. But I can tell Ukranian from Russian without understanding either.
Dheeraj Singh
Dheeraj Singh on 9 Sep 2019
Yeah sure, we can do it using only the audio signal if we only want the language being spoken and not what is being spoken.
In this case we can use the Audio toolbox for feature extraction of the audio signal and then classify them using different Machine Learning techniques.

Sign in to comment.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by