• DocumentCode
    1329449
  • Title

    Reduced sets of subword units for continuous speech recognition of Portuguese

  • Author

    Cerqueira Bisp dos Santos, S. ; Alcaim, A.

  • Author_Institution
    CETUC-PUC, Rio de Janeiro, Brazil
  • Volume
    36
  • Issue
    6
  • fYear
    2000
  • fDate
    3/16/2000 12:00:00 AM
  • Firstpage
    586
  • Lastpage
    588
  • Abstract
    An investigation is presented concerning two sets of subword units for continuous speech recognition, which are based on the characteristics of the Portuguese language. In the first set, with 149 units, it is considered that syllables which contain an epenthetic vowel can be formed by two CV (consonant-vowel) units. In the second set, with 254 units, these syllables are regarded as CCV (consonant-consonant-vowel) units. The recognition performance obtained with the two unit sets are comparable if a bigram model is used at the unit level. In this case, the first set is definitely preferable because it enables the complexity to be significantly reduced
  • Keywords
    computational complexity; speech recognition; Portuguese language; bigram model; complexity reduction; consonant-consonant-vowel units; consonant-vowel units; continuous speech recognition; epenthetic vowel; recognition performance; subword units; syllables;
  • fLanguage
    English
  • Journal_Title
    Electronics Letters
  • Publisher
    iet
  • ISSN
    0013-5194
  • Type

    jour

  • DOI
    10.1049/el:20000446
  • Filename
    840185