DocumentCode
388452
Title
Segmentation of continuous speech by using multidimensional scaling techniques
Author
Charbonneau, Gérard R. ; Moussa, Tarek
Author_Institution
Université de Paris XI, Orsay Cedex, France
Volume
7
fYear
1982
fDate
30072
Firstpage
2012
Lastpage
2014
Abstract
Continuous speech is digitized at the rate of 20480 Hz. Power spectra are taken on 1024 blocks shifted from 256 to 256 samples. These spectra are divided into 25 channels chosen to discriminate at best peaks and valleys. For each spectrum i, and each channel j, the sum Pij of the components is computed. A multidimensional scaling analysis is done on the matrix Pij. This gives a 7-dimension space in which variations of the spectra versus time are represented by a moving spot. The main result is that a transition between two spoken sounds induces a significant variation on, at least one, and generally several axes. This can be used for segmenting and recognizing the continuous speech at the acoustical level. The first attempts have given modest results, but great improvements are expected soon.
Keywords
Acoustic signal processing; Attenuation; Low pass filters; Microphones; Multidimensional signal processing; Multidimensional systems; Sampling methods; Spectral analysis; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type
conf
DOI
10.1109/ICASSP.1982.1171834
Filename
1171834
Link To Document