DocumentCode
312199
Title
Time-based clustering for phonetic segmentation
Author
Eberman, Brian ; Goldenthal, William
Author_Institution
Digital Equipment Corp., Cambridge, MA, USA
Volume
2
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1225
Abstract
The paper describes an approach to speech segmentation. Unlike approaches based on spectral measurements, the algorithm iteratively clusters on an LPC representation of time waveform blocks. The algorithm uses a generalized maximum likelihood criterion for deciding when two neighboring pieces of the signal should be joined. The paper describes the algorithm and shows that it yields superior results when compared to metrics based on spectral or cepstral measurements
Keywords
maximum likelihood estimation; speech recognition; LPC representation; generalized maximum likelihood criterion; iteratively clustered algorithm; metrics; neighboring signal piece joining; phonetic segmentation; speech segmentation; time waveform blocks; time-based clustering; Acceleration; Cepstral analysis; Clustering algorithms; Glass; Iterative algorithms; Linear predictive coding; Performance gain; Speech; Testing; Time measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607829
Filename
607829
Link To Document