Title :
Sequential k-nearest neighbor pattern recognition for usable speech classification
Author :
Shah, Jashmin K. ; Smolenski, Brett Y. ; Yantorno, Robert E. ; Iyer, Ananth N.
Author_Institution :
Speech Process. Lab., Temple Univ., Philadelphia, PA, USA
Abstract :
The accuracy of speech processing techniques degrades when operating in a co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally degraded but still useful for speech processing application such as speaker identification. Usable speech measures are features that are extracted from the co-channel signal to distinguish between usable and unusable speech. In this paper, a new usable speech extraction technique is presented. The new method extracts features recursively and variable length segmentation is performed by making sequential decisions on the k-NN pattern classifier class assignments. This new approach is able to identify 79% of available usable speech segments with 21% false alarms and it requires lesser amount of data to make accurate decisions compared to previously presented methods.
Keywords :
feature extraction; pattern classification; speaker recognition; cochannel environment; cochannel speech; k-NN pattern classifier class assignments; sequential decisions; speaker identification; speech measures; speech processing techniques; usable speech extraction technique; usable speech segmentation; usable speech segments; variable length segmentation; Abstracts; Classification algorithms; Feature extraction; Speech; Uniform resource locators; Co-channel Speech; Sequential Detection; Speaker Identification; Usable Speech; k-Nearest Neighbor Classifier;
Conference_Titel :
Signal Processing Conference, 2004 12th European
Conference_Location :
Vienna
Print_ISBN :
978-320-0001-65-7