Title :
A robust approach to sequence classification
Author :
Li, Ming ; Sleep, Ronan
Author_Institution :
Sch. of Comput. Sci., East Anglia Univ., Norwich
Abstract :
We report results for classification of representations of music, spoken words, and text documents. Experimental comparisons with other state-of-the-art algorithms yield improved results for all three examples. We use a support vector machine (SVM) as our classifier in all experiments. This is driven by a kernel matrix of similarity measures between the sequences. Our similarity measure is based on n-grams of varying length (multi-grams), weighted to reflect discrimination ability. To alleviate the problem of the exponential growth of feature size with n, we use a modified LZ78 algorithm (Z. Jacob and L. Abraham, 1978) to guide feature selection. Our method exhibits good performance over the three widely distinct tasks reported here, and is very computationally efficient and may therefore be useful in real time applications
Keywords :
music; natural languages; support vector machines; text analysis; feature selection; kernel matrix; modified LZ78 algorithm; music representation classification; sequence classification; similarity measurement; spoken words representation classification; support vector machine; text documents representation classification; Frequency; Hidden Markov models; Kernel; Length measurement; Music information retrieval; Quantization; Robustness; Speech recognition; Support vector machine classification; Support vector machines;
Conference_Titel :
Tools with Artificial Intelligence, 2005. ICTAI 05. 17th IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2488-5
DOI :
10.1109/ICTAI.2005.16