Title :
Extraction of pitch register from expressive speech in Japanese
Author :
Jinfu Ni ; Shiga, Yoshinori ; Hori, Chiori
Author_Institution :
Universal Commun. Res. Inst., Spoken Language Commun. Lab., Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
Abstract :
Human uses intonation to make focal prominence to give emphasis that highlights the focus of speech. Automatic extraction of proper intonation features from a speech corpus is desirous for processing speech prosody, especially in the context of speech synthesis. This paper presents a method to extract pitch register from observed F0 contours for this purpose. The method utilizes a constrained tone transformation technique under an assumption that lexical accents are confined to parallel high and low tone lines with a limited constant span. Consequently, the extracted pitch register captures dynamic range variation of the pitch accents of an utterance. The method is evaluated by objective tests upon a large-scale expressive speech corpus. A finding is that proper intonation manifested in pitch register in Japanese is very comparable with English intonation in the sense of structural form.
Keywords :
feature extraction; speech synthesis; English intonation; Japanese; automatic extraction; constrained tone transformation technique; dynamic range variation; focal prominence; large-scale expressive speech corpus; lexical accents; low tone lines; pitch accents; pitch register; proper intonation features; speech synthesis; Measurement; Registers; Fundamental frequency analysis; intonation proper; pitch decomposition; pitch register; speech prosody;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178875