• DocumentCode
    2330241
  • Title

    Improvement of a speech recognizer for standardized medical assessment of children´s speech by integration of prior knowledge

  • Author

    Bocklet, Tobias ; Maier, Andreas ; Eysholdt, Ulrich ; Noth, Elmar

  • Author_Institution
    Dept. of Phoniatrics & Paediatric Audiology, Univ. Clinics Erlangen, Erlangen, Germany
  • fYear
    2010
  • fDate
    12-15 Dec. 2010
  • Firstpage
    259
  • Lastpage
    264
  • Abstract
    Speech recognition of children is a more difficult task than speech recognition of adults. This problem is amplified for children with articulation disorders like cleft lip and palate (CLP). In this work we improved our automatic speech recognition system by integrating prior knowledge. Prior knowledge focuses on two different aspects: A test-dependent language modeling and an age-dependent acoustic modeling. These two approaches are merged at the end to different test- and age-dependent recognizers. We evaluated our system on a dataset of 35 children with CLP. Significant improvements could be found on this dataset. With our baseline system we achieved a negative word accuarcy (WA) of -11.0%. By an extended language modeling we achieved 27.5%. The age-dependent recognition system gains a huge improvement and achieves aWA of 42.6%. With the significant improvements in WA it is possible to perform an automatic detection and identification of specific words. Thus, we took the first step towards a speech assessment on word and subword level.
  • Keywords
    diseases; medical computing; natural language processing; patient diagnosis; speech recognition; speech synthesis; age dependent acoustic modeling; articulation disorders; automatic speech recognition system; children speech recogniser; cleft lip and palate; language modeling; prior knowledge integration; standardized medical assessment; test dependent language modeling; vocal tract length normalized; Age-dependent acoustic Modeling; Children´s speech; Language Modeling; Pathologic Speech; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2010 IEEE
  • Conference_Location
    Berkeley, CA
  • Print_ISBN
    978-1-4244-7904-7
  • Electronic_ISBN
    978-1-4244-7902-3
  • Type

    conf

  • DOI
    10.1109/SLT.2010.5700861
  • Filename
    5700861