مرکز منطقه ای اطلاع رساني علوم و فناوري - Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

DocumentCode :

2022630

Title :

Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

Author :

OHNO, Sumio ; Hirose, Keikichi ; Fujasaki, H.

Author_Institution :

Dept. of Electron. Eng., Tokyo Univ., Japan

Volume :

fYear :

1993

fDate :

27-30 April 1993

Firstpage :

578

Abstract :

The authors propose a novel method of normalization based on linear transformation of acoustic features of input speech using only one isolated utterance each of the five vowels of Japanese by each individual speaker. Experiments on isolated word recognition combining the proposed normalization method and multiple-template DP matching showed a marked improvement in the recognition rate, especially for smaller numbers of templates per word. The proposed method gives consistently higher word recognition scores than the four-dimensional representation on the Karhunen-Loeve transformation, and also gives higher scores than the original 16-dimensional representation of filter-bank outputs, especially when the number of templates is small. Together with the fact that this method reduces the dimension of the feature vector by a factor of four, the results demonstrate the validity of the proposed method.<>

Keywords :

feature extraction; speech recognition; Japanese; linear transformation of acoustic features; multiple speakers; recognition rate; spoken word recognition system; templates; utterance normalisation; validity; vowel features;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on

Conference_Location :

Minneapolis, MN, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.1993.319373

Filename :

319373

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2022630