DocumentCode
2276207
Title
Automatic Speech Sentence Segmentation from Multi-paragraph Databases
Author
Zhang Wei ; Pang Minhui ; Du Ranran ; Liu Yayu
Author_Institution
Dept. of Comput. Sci. & Technol., Ocean Univ. of China, Qingdao, China
Volume
1
fYear
2010
fDate
13-14 March 2010
Firstpage
721
Lastpage
724
Abstract
Speech sentence is the input of automatic phonetic segmentation or transcription. This paper discusses our efforts on automatic speech sentence segmentation from multi-paragraph speech databases for building Text-To-Speech (TTS) system speech corpus automatically. We present a) a system of automatic speech sentence segmentation from broadcasting audio based on forced alignment technique, in which a checking Mechanism based on speech recognition technique is also used, b) an iterative algorithm to improve the system, c) a music detector based on a scheme combination of Variable Duration Hidden Markov Model (VDHMM) and Gaussian Mixture Model (GMM). Experiments show that the improved system has 98.93% of Sentence Accurate Rate (SAR) and generates 646 correct sentences, compared with 97.85% of SAR, and 155 correct sentences in original system.
Keywords
Gaussian processes; hidden Markov models; speech processing; speech recognition; speech synthesis; Gaussian mixture model; automatic phonetic segmentation; automatic speech sentence segmentation; broadcasting audio; forced alignment technique; iterative algorithm; multiparagraph speech databases; music detector; sentence accurate rate; speech recognition technique; text-to-speech system speech corpus; variable duration hidden Markov model; Broadcasting; Databases; Detectors; Hidden Markov models; Iterative algorithms; Marine technology; Sea measurements; Speech processing; Speech synthesis; Viterbi algorithm; automatic speech sentence segmentation; multi-paragraph speech database; speech process;
fLanguage
English
Publisher
ieee
Conference_Titel
Measuring Technology and Mechatronics Automation (ICMTMA), 2010 International Conference on
Conference_Location
Changsha City
Print_ISBN
978-1-4244-5001-5
Electronic_ISBN
978-1-4244-5739-7
Type
conf
DOI
10.1109/ICMTMA.2010.784
Filename
5458513
Link To Document