A new approach with score-level fusion for the classification of a speaker age and gender


Yucesoy E., NABIYEV V.

COMPUTERS & ELECTRICAL ENGINEERING, vol.53, pp.29-39, 2016 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 53
  • Publication Date: 2016
  • Doi Number: 10.1016/j.compeleceng.2016.06.002
  • Journal Name: COMPUTERS & ELECTRICAL ENGINEERING
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.29-39
  • Keywords: Age and gender recognition, Spectral features, Prosodic features, Score-level fusion, Gaussian Mixture Model, Support Vector Machines, RECOGNITION, EXTRACTION
  • Karadeniz Technical University Affiliated: Yes

Abstract

In this study a new approach for classifying speakers according to their age and genders is proposed. This approach is composed of score-level fusion of seven sub-systems. In this fused system, which provides improved performance in three classification categories (age, gender and age & gender), spectral and prosodic features extracted from short-duration phone conversations are used with Gaussian Mixture Model (GMM), Support Vector Machine (SVM) and GMM supervector-based SVM classifiers. Also, by examining individual and various combinations of each system, the effect of feature types and classification methods on performance is investigated. With the proposed system, classification success rates are obtained 90.4%, 54.1%, and 53.5% in gender, age and age & gender categories respectively. (C) 2016 Elsevier Ltd. All rights reserved.