Title :
On-Line Speech/Music Segmentation for Broadcast News Domain
Author :
Marko Kos;Matej Grasic;Damjan Vlaj;Zdravko Kacic
Author_Institution :
Fac. of Electr. Eng. & Comput. Sci., Univ. of Maribor, Maribor, Slovenia
Abstract :
This paper presents novel feature-group for on-line speech/music segmentation for broadcast news domain. The features are based on mel-frequency cepstral coefficients variance (MFCCV). The idea behind the feature-group construction is the energy variation in a narrow frequency sub-band. The variation is bigger for speech than for music. For feature discrimination and segmentation ability evaluation the radio broadcast database was used. Results show that MFCCV features perform better than the classic MFCC features. The MFCCV features are very convenient speech/music discriminator for automatic speech recognition system where MFCC features are used, as they perform better than classic MFCC features and only one additional calculation step is needed.
Keywords :
"Music","Mel frequency cepstral coefficient","Radio broadcasting","Automatic speech recognition","Entropy","Robustness","Computer science","Cepstral analysis","Spatial databases","Frequency modulation"
Conference_Titel :
Systems, Signals and Image Processing, 2009. IWSSIP 2009. 16th International Conference on
Print_ISBN :
978-1-4244-4530-1
DOI :
10.1109/IWSSIP.2009.5367789