• DocumentCode
    1696530
  • Title

    Co-training succeeds in Computational Paralinguistics

  • Author

    Zixing Zhang ; Jun Deng ; Schuller, Bjorn

  • Author_Institution
    Machine Intell. & Signal Process. Group, Tech. Univ. Munchen, München, Germany
  • fYear
    2013
  • Firstpage
    8505
  • Lastpage
    8509
  • Abstract
    Data sparsity is one of the major bottlenecks in the field of Computational Paralinguistics. Partially supervised learning approaches can help leverage this problem without the need of cost-intensive human labelling efforts. We thus investigate the feasibility of cotraining for exemplary paralinguistic speech analysis tasks spanning along the time-continuum: from short-term-related emotion to mid-term-related sleepiness and finally to long-term trait of gender. By dividing the acoustic feature space with two views as independent and sufficient as possible, the semi-supervised learning approach of co-training selects instances with high confidence scores in each view, and agglomerates them along with their predictions into initial training sets per iteration. Our experimental results on official Interspeech Computational Paralinguistics Challenge tasks effectively demonstrate co-training´s superiority over the baseline formed by single-view self-training, especially for the short- and medium-term tasks emotion and sleepiness recognition.
  • Keywords
    computational linguistics; iterative methods; learning (artificial intelligence); speech synthesis; acoustic feature space; co-training; co-training superiority; computational paralinguistics; data sparsity; exemplary paralinguistic speech analysis; iteration; medium-term tasks emotion; midterm-related sleepiness; partially supervised learning; semi-supervised learning approach; short-term tasks emotion; short-term-related emotion; single-view self-training; sleepiness recognition; Acoustics; Databases; Emotion recognition; Semisupervised learning; Sleep; Speech; Training; Co-Training; Computational Paralinguistics; Emotion; Gender; Semi-supervised Learning; Sleepiness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639325
  • Filename
    6639325