Title :
Automatic extraction of acoustic prototypes for large vocabulary speech recognition by using speaker-independent features
Author :
Colla, Anna Maria
Author_Institution :
Elettronica San Giorgio, Genova, Italy
Abstract :
A description is presented of AUTOSEGM, a novel method for automatic extraction of acoustic prototypes for large-vocabulary speech recognition systems based on diphone-like subword segments. AUTOSEGM compares favorably with previous methods for automatic extraction of diphone-like prototypes in that it does not require a set of training templates derived from another talker to bootstrap the new talker´s prototypes. AUTOSEGM exploits only general speaker-independent acoustic/phonetic knowledge, in the form of a general language model and standard acoustic parameters (cepstral derivative, smoothed energy contour). The core of AUTOSEGM is a self-segmentation procedure: the speech material is segmented into pseudosyllables, within which some occurrences of the diphone-like segments are located. A recognition performance of about 78% correct for the top candidate was achieved in a large-vocabulary isolated word recognition task
Keywords :
speech recognition; AUTOSEGM; acoustic prototypes; automatic extraction; diphone-like subword segments; isolated word recognition; large vocabulary speech recognition; pseudosyllables; self-segmentation procedure; speaker-independent features; Acoustic materials; Acoustic signal detection; Automatic speech recognition; Decoding; Loudspeakers; Maintenance; Natural languages; Prototypes; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
DOI :
10.1109/ICASSP.1989.266371