Title : 
An integrated approach to the detection and classification of accents/dialects for a spoken document retrieval system
         
        
            Author : 
Gray, Sharmistha ; Hansen, John H L
         
        
            Author_Institution : 
Dept. of Speech, Language & Hearing Sci., Colorado Univ., Boulder, CO
         
        
        
        
        
        
            Abstract : 
In this study, an integrated approach to accent/dialect detection and classification is proposed, which can be used for enhancing Rich indexing of historical spoken documents with accent/dialect information. A next generation spoken document retrieval (SDR) system would require a more diverse set of speech criteria including speaker, accent/dialect, language, stress/emotion and environment content. The proposed accent/dialect tagging system for SDR is based on several recent advances in a multi-dimensional space. Here, temporal and spectral based features including the stochastic trajectory model (STM), pitch structure, formant location and voice onset time (VOT) are considered. Mono-phone based STM (MP-STM) is shown to be the most successful for dialect classification with an average rate of 96.5% for read speech and 72.5% for spontaneous speech, while classifying four dialects. An example of next generation Rich transcript indexing for conversational speech to simulate SDR is also presented
         
        
            Keywords : 
classification; indexing; information retrieval; natural languages; speech processing; Rich transcript indexing; accent detection; accent-dialect tagging system; dialect classification; dialect detection; formant location; pitch structure; speech criteria; spoken document retrieval system; stochastic trajectory model; voice onset time; Auditory system; Automatic speech recognition; Content based retrieval; Indexing; Information retrieval; Natural languages; Robustness; Speech enhancement; Speech processing; Stress;
         
        
        
        
            Conference_Titel : 
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
         
        
            Conference_Location : 
San Juan
         
        
            Print_ISBN : 
0-7803-9478-X
         
        
            Electronic_ISBN : 
0-7803-9479-8
         
        
        
            DOI : 
10.1109/ASRU.2005.1566480