DocumentCode :
3132772
Title :
Active learning for accent adaptation in Automatic Speech Recognition
Author :
Nallasamy, Udhyakumar ; Metze, Florian ; Schultz, Tanja
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
360
Lastpage :
365
Abstract :
We experiment with active learning for speech recognition in the context of accent adaptation. We adapt a source recognizer on the target accent by selecting a relatively small, matched subset of utterances from a large, untranscribed and multi-accented corpus for human transcription. Traditionally, active learning in speech recognition has relied on uncertainty based sampling to choose the most informative data for manual labeling. Such an approach doesn´t include explicit relevance criterion during data selection, which is crucial for choosing utterances to match the target accent, from datasets with wide-ranging speakers of different accents. We formulate a cross-entropy based relevance measure to complement uncertainty based sampling for active learning to aid accent adaptation. We evaluate the algorithm on two different setups for Arabic and English accents and show that our approach performs favorably to conventional data selection. We analyze the results to show the effectiveness of our approach in finding the most relevant subset of utterances for improving the speech recognizer on the target accent.
Keywords :
learning (artificial intelligence); natural language processing; sampling methods; speech recognition; Arabic accents; English accents; accent adaptation; active learning; automatic speech recognition; cross-entropy based relevance measure; data selection; explicit relevance criterion; human transcription; manual labeling; source recognizer; uncertainty based sampling; Adaptation models; Entropy; Labeling; Measurement uncertainty; Speech; Speech recognition; Uncertainty; Accent Adaptation; Active Learning; Automatic Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424250
Filename :
6424250
Link To Document :
بازگشت