• DocumentCode
    302322
  • Title

    Context-dependent units for vocabulary-independent Spanish speech recognition

  • Author

    Villarrubia, L. ; Gomez, L.H. ; Elvira, J.M. ; Torrecilla, J.C.

  • Author_Institution
    Speech Technol. Group, Telefonica Investigacion y Desarrollo, Madrid, Spain
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    451
  • Abstract
    One of the most important issues in large-vocabulary speech recognition systems is the vocabulary-independence. In order to achieve this goal, modeling of vocabulary words by subwords units is mandatory. This paper studies the use of different subword units, comparing context-independent and context-dependent units (biphones, triphones and artificially generated triphones). From the experimental results we conclude the enormous importance of context dependence units and the good behaviour of biphone units compared to triphones for Spanish speech recognition considering the database limitations and the hardware restrictions in real systems. In order to consider the most important contextual effects in Castilian speech recognition we propose a new and simple approach based on the use of left and right-side biphones. Recognition results of 91.03% correct for vocabulary dependent data (448 surnames), and 89.6% correct for vocabulary independent data (955 surnames), using only 467 senones and 4 mixtures per state, on real telephone speech and speaker independent data are reported
  • Keywords
    speech recognition; Castilian speech recognition; Spain; artificially generated triphones; biphones; context-dependent units; context-independent and context-dependent unit; large-vocabulary speech recognition; subwords units; surnames; triphones; vocabulary words; vocabulary-independence; vocabulary-independent Spanish speech recognition; Acoustic testing; Chemical technology; Context modeling; Databases; Hardware; Natural languages; Speech recognition; Telephony; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.541130
  • Filename
    541130