مرکز منطقه ای اطلاع رساني علوم و فناوري - Synthesis of speaker-adaptive word templates by concatenation of the monosyllabic sounds

DocumentCode :

2999822

Title :

Synthesis of speaker-adaptive word templates by concatenation of the monosyllabic sounds

Author :

Niimi, Yasuhisa ; Kobayashi, Yutaka

Author_Institution :

Kyoto Institute of Technology, Kyoto, Japan

Volume :

fYear :

1986

fDate :

31503

Firstpage :

2651

Lastpage :

2654

Abstract :

This paper describes a new method for synthesizing speaker-adaptive templates in an isolated word recognition system based on the time-warping algorithm. We prepare in advance a template for each word class by averaging utterances spoken by a few talkers, and use it as a guide to excerpt the subpattern of an isolated monosyllable that is best matched against a syllable embedded in the word. The excerpted subpatterns are concatenated and smoothed at their boundaries to create speaker-adaptive word templates. The method was evaluated through the recognition of 44 Japanese city names spoken by 19 male talkers. The synthesized templates for each talker gave the average recognition rates of 95.2%. This shows that the proposed procedure is promising although the problem of compensating coarticulation effects is left unsolved.

Keywords :

Acoustic emission; Cities and towns; Computer science; Concatenated codes; Guidelines; Hidden Markov models; Isolation technology; Loudspeakers; Stochastic processes; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.

Type :

conf

DOI :

10.1109/ICASSP.1986.1168681

Filename :

1168681

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2999822