• DocumentCode
    2485460
  • Title

    Gender classification in two Emotional Speech databases

  • Author

    Kotti, Margarita ; Kotropoulos, Constantine

  • Author_Institution
    Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Gender classification is a challenging problem, which finds applications in speaker indexing, speaker recognition, speaker diarization, annotation and retrieval of multimedia databases, voice synthesis, smart human-computer interaction, biometrics, social robots etc. Although it has been studied for more than thirty years, by no means it is a solved problem. Processing emotional speech in order to identify speakerpsilas gender makes the problem even more interesting. A large pool of 1379 features is created including 605 novel features. A branch and bound feature selection algorithm is applied to select a subset of 15 features among the 1379 originally extracted. Support vector machines with various kernels are tested as gender classifiers, when applied to two databases, namely: the Berlin database of Emotional Speech and the Danish Emotional Speech database. The reported classification results out perform those obtained by state-of-the-art techniques, since a perfect classification accuracy is obtained.
  • Keywords
    audio databases; emotion recognition; feature extraction; gender issues; image classification; speaker recognition; support vector machines; tree searching; branch-and-bound feature selection algorithm; emotional speech database; multimedia database annotation; multimedia database retrieval; smart human-computer interaction; speaker diarization; speaker gender classification; speaker gender identification; speaker indexing; speaker recognition; support vector machine; voice synthesis; Biometrics; Human robot interaction; Indexing; Information retrieval; Intelligent robots; Multimedia databases; Spatial databases; Speaker recognition; Speech processing; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761624
  • Filename
    4761624