DocumentCode :
763511
Title :
Note segmentation and quantization for music information retrieval
Author :
Adams, Norman H. ; Bartsch, Mark A. ; Wakefield, Gregory H.
Author_Institution :
Electr. Eng. & Comput. Sci. Dept., Univ. of Michigan, Ann Arbor, MI, USA
Volume :
14
Issue :
1
fYear :
2006
Firstpage :
131
Lastpage :
141
Abstract :
Much research in music information retrieval has focused on query-by-humming systems, which search melodic databases using sung queries. The database retrieval aspect of such systems has received considerable attention, but query processing and the melodic representation have not been examined as carefully. Common methods for query processing are based on musical intuition and historical momentum rather than specific performance criteria; existing systems often employ rudimentary note segmentation or coarse quantization of note estimates. In this work, we examine several alternative query processing methods as well as quantized melodic representations. One common difficulty with designing query-by-humming systems is the coupling between system components. We address this issue by measuring the performance of the query processing system both in isolation and coupled with a retrieval system. We first measure the segmentation performance of several note estimators. We then compute the retrieval accuracy of an experimental query-by-humming system that uses the various note estimators along with varying degrees of pitch and duration quantization. The results show that more advanced query processing can improve both segmentation performance and retrieval performance, although the best segmentation performance does not necessarily yield the best retrieval performance. Further, coarsely quantizing the melodic representation generally degrades retrieval accuracy.
Keywords :
audio databases; music; quantisation (signal); query processing; database retrieval; duration quantization; historical momentum; melodic databases; melodic representation; music information retrieval; musical intuition; note segmentation; pitch; quantization; query processing methods; query-by-humming systems; sung queries; Audio databases; Content based retrieval; Degradation; Multiple signal classification; Music information retrieval; Natural languages; Quantization; Query processing; Speech processing; Music information retrieval; pitch; pitch quantization; query-by-example; segmentation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.854088
Filename :
1561271
Link To Document :
بازگشت