Title :
Music tonality features for speech/music discrimination
Author :
Sell, Gregory ; Clark, P.
Author_Institution :
Human Language Technol. Center, Excellence Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
We introduce a novel set of features for speech/music discrimination derived from chroma vectors, a feature that represents musical tonality. These features are shown to outperform other commonly used features in multiple conditions and corpora. Even when trained on mismatched data, the new features perform well on their own and also combine with existing features for further improvement. We report 97.1% precision on speech and 93.0% precision on music for the Broadcast News corpus using a simple classifier trained on a mismatched corpus.
Keywords :
music; speech processing; Broadcast News corpus; chroma vectors; mismatched data; music tonality features; simple classifier; speech-music discrimination; Detectors; Multiple signal classification; Music; Speech; Speech processing; Vectors; amplitude modulation; chroma; music detection; voice activity detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854048