DocumentCode :
1615201
Title :
Speech/non-speech detection in Malay language spontaneous speech
Author :
Izzad, M. ; Jamil, Nursuriati ; Bakar, Zainab Abu
Author_Institution :
Computer Science Department Faculty of computer and Mathematical Sciences Universiti Teknologi MARA 40450, Shah Alam, Selangor Malaysia
fYear :
2013
Firstpage :
219
Lastpage :
224
Abstract :
The goal of this work is to discriminate speech and non-speech segments in Malay language spontaneous speech as speech/non-speech detection is important in many speech processing applications. Inaccurate sentence boundaries are a major cause of errors in automatic speech recognition and a preprocessing stage that segments the speech signal into periods of speech and non-speech is invaluable in improving the recognition accuracy. We proposed a combination of three audio features that is energy, zero crossing rate (ZCR) and fundamental frequency (F0) for the speech/non-speech detection as each feature has unique properties to differentiate speech and non-speech segments. Experiments are conducted on one-hour Malay language spontaneous speech consisting of more than 20,000 speech/non-speech segments. An accuracy evaluation reveals that the proposed method achieved 97.8% accuracy rate. Non-speech segments will further be used as candidates of sentence boundary in our next experiment.
Keywords :
Accuracy; Feature extraction; Frequency measurement; Noise measurement; Speech; Speech processing; Speech recognition; prosodic features; sentence boundary detection; speech/non-speech detection; spontaneous speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Management and Telecommunications (ComManTel), 2013 International Conference on
Conference_Location :
Ho Chi Minh City, Vietnam
Print_ISBN :
978-1-4673-2087-0
Type :
conf
DOI :
10.1109/ComManTel.2013.6482394
Filename :
6482394
Link To Document :
بازگشت