Title :
Time-based clustering for phonetic segmentation
Author :
Eberman, Brian ; Goldenthal, William
Author_Institution :
Digital Equipment Corp., Cambridge, MA, USA
Abstract :
The paper describes an approach to speech segmentation. Unlike approaches based on spectral measurements, the algorithm iteratively clusters on an LPC representation of time waveform blocks. The algorithm uses a generalized maximum likelihood criterion for deciding when two neighboring pieces of the signal should be joined. The paper describes the algorithm and shows that it yields superior results when compared to metrics based on spectral or cepstral measurements
Keywords :
maximum likelihood estimation; speech recognition; LPC representation; generalized maximum likelihood criterion; iteratively clustered algorithm; metrics; neighboring signal piece joining; phonetic segmentation; speech segmentation; time waveform blocks; time-based clustering; Acceleration; Cepstral analysis; Clustering algorithms; Glass; Iterative algorithms; Linear predictive coding; Performance gain; Speech; Testing; Time measurement;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607829