DocumentCode
294645
Title
Automatic prosodic segmentation by F0 clustering using superpositional modeling
Author
Nakai, Mitsuru ; Singer, Harald ; Sagisaka, Yoshinori ; Shimodaira, Hiroshi
Author_Institution
Tohoku Univ., Sendai, Japan
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
624
Abstract
In this paper, we propose an automatic method for detecting accent phrase boundaries in Japanese continuous speech by using F0 information. In the training phase, hand labeled accent patterns are parameterized according to a superpositional model proposed by Fujisaki, and assigned to some clusters by a clustering method, in which accent templates are calculated as centroid of each cluster. In the segmentation phase, automatic N-best extraction of boundaries is performed by one-stage DP matching between the reference templates and the target F0 contour. About 90% of accent phrase boundaries were correctly detected in speaker independent experiments with the ATR Japanese continuous speech database
Keywords
natural languages; sequences; speech recognition; ATR Japanese continuous speech database; F0 clustering; F0 information; Japanese continuous speech; accent phrase boundaries; accent templates; automatic N-best extraction; automatic prosodic segmentation; clustering method; hand labeled accent patterns; one-stage DP matching; speaker independent experiments; superpositional modeling; training phase; Clustering methods; Costs; Data mining; Databases; Equations; Frequency; Pattern recognition; Spatial databases; Speech recognition; Stochastic processes; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479675
Filename
479675
Link To Document