Reduced sets of subword units for continuous speech recognition of Portuguese

Author

Cerqueira Bisp dos Santos, S. ; Alcaim, A.

Author_Institution

CETUC-PUC, Rio de Janeiro, Brazil

Volume

Issue

fYear

2000

fDate

3/16/2000 12:00:00 AM

Firstpage

586

Lastpage

588

Abstract

An investigation is presented concerning two sets of subword units for continuous speech recognition, which are based on the characteristics of the Portuguese language. In the first set, with 149 units, it is considered that syllables which contain an epenthetic vowel can be formed by two CV (consonant-vowel) units. In the second set, with 254 units, these syllables are regarded as CCV (consonant-consonant-vowel) units. The recognition performance obtained with the two unit sets are comparable if a bigram model is used at the unit level. In this case, the first set is definitely preferable because it enables the complexity to be significantly reduced

Keywords

computational complexity; speech recognition; Portuguese language; bigram model; complexity reduction; consonant-consonant-vowel units; consonant-vowel units; continuous speech recognition; epenthetic vowel; recognition performance; subword units; syllables;

fLanguage

English

Journal_Title

Electronics Letters

Publisher

iet

ISSN

0013-5194

Type

jour

DOI

10.1049/el:20000446

Filename

840185

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1329449