DocumentCode :
384196
Title :
Constructing speech processing systems on universal phonetic codes accompanied with reference acoustic models
Author :
Tanaka, Kazuyo ; Kojima, Hiroaki ; Fujimura, Nahoko ; Itoh, Yoshiaki
Volume :
3
fYear :
2002
fDate :
2002
Firstpage :
728
Abstract :
This paper proposes a novel speech processing framework, where all of the speech data are once encoded into universal phonetic code (UPC) sequences and speech processing systems, such as speech recognition, retrieval, digesting, are constructed on this UPC domain. First of all, we introduce an IPA-based sub-phonetic segment (SPS) set as the UPC to deal with multilingual speech. In the UPC (SPS) domain, each UPC accompanies a reference acoustic model which is independent of real acoustic models used in the encoding process. Processing, such as recognition, in the UPC domain is conducted based on the distance between UPC sequences estimated by using the reference acoustic models. We confirm the proposed framework by constructing a speech recognition and a vocabulary-free speech retrieval system on the SPS domain. We show several experimental results on these systems, using Japanese and English speech data sets.
Keywords :
encoding; speech coding; speech recognition; encoding; reference acoustic models; speech processing systems; speech recognition; speech retrieval; subphonetic segment; universal phonetic code; Encoding; Indexing; Information retrieval; Information science; Information systems; Libraries; Speech coding; Speech processing; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2002. Proceedings. 16th International Conference on
ISSN :
1051-4651
Print_ISBN :
0-7695-1695-X
Type :
conf
DOI :
10.1109/ICPR.2002.1048079
Filename :
1048079
Link To Document :
بازگشت