DocumentCode
417216
Title
Yet another acoustic representation of speech sounds
Author
Minematsu, Nobuaki
Author_Institution
Graduate Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
This paper proposes yet another representation of speech sounds. The proposed speech modeling can remove both multiplicative and linear transformational distortion from speech theoretically. It means that speech sounds are represented without being affected by any static distortion inevitably involved in production, encoding, transmission, decoding, and hearing processes, such as differences in vocal tract length, gender, age, microphone, room, line, auditory characteristics, and so on. The method acoustically models not individual phones but their entire system, where only acoustic interrelation embedded in all the kinds of phones is focused. Since the method provides us with no absolute acoustic properties of phones, it cannot recognize or synthesize even a single phone. On the contrary, the proposed method is shown to be able to be applied to pronunciation assessment effectively and reliably, where the proficiency of pronunciation is estimated without using acoustic models of the individual phones directly in the matching.
Keywords
pattern matching; signal representation; speech processing; speech recognition; acoustic interrelation; acoustic representation; linear transformational distortion; matching; multiplicative distortion; phones; pronunciation assessment; speech modeling; speech sounds; static distortion; Acoustic distortion; Context modeling; Decoding; Encoding; Information science; Loudspeakers; Microphones; Speech processing; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326053
Filename
1326053
Link To Document