Emotion Recognition from Speech Signal Using Mel-Frequency Cepstral Coefficients


9th International Conference on Electrical and Electronics Engineering (ELECO), Bursa, Turkey, 26 - 28 November 2015, pp.1254-1257 identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • City: Bursa
  • Country: Turkey
  • Page Numbers: pp.1254-1257
  • Karadeniz Technical University Affiliated: Yes


In this paper, mel-frequency cepstral coefficients are investigated for emotional content of speech signal. The features are extracted from spoken utterance. When these features are extracted, speech signal is divided small frames and each frame overlap a part of previous frame. The purpose of this overlap operation is to provide a smooth transition from one frame to the other and, to prevent information loss in the end of the frame. The length of frame and scroll time is important for emotion recognition applications. Also, we investigated the effects of different length frames and scroll times on the classification success of four emotions which are defined as happy, angry, neutral and sad. Those emotions were classified by using Support Vector Machine and kNearest Neighbors algorithms. In this study to determine the classification success, 10-Fold Cross Validation method was used and the maximum success rate was obtained as 98.7 %.