DocumentCode
3132772
Title
Active learning for accent adaptation in Automatic Speech Recognition
Author
Nallasamy, Udhyakumar ; Metze, Florian ; Schultz, Tanja
Author_Institution
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2012
fDate
2-5 Dec. 2012
Firstpage
360
Lastpage
365
Abstract
We experiment with active learning for speech recognition in the context of accent adaptation. We adapt a source recognizer on the target accent by selecting a relatively small, matched subset of utterances from a large, untranscribed and multi-accented corpus for human transcription. Traditionally, active learning in speech recognition has relied on uncertainty based sampling to choose the most informative data for manual labeling. Such an approach doesn´t include explicit relevance criterion during data selection, which is crucial for choosing utterances to match the target accent, from datasets with wide-ranging speakers of different accents. We formulate a cross-entropy based relevance measure to complement uncertainty based sampling for active learning to aid accent adaptation. We evaluate the algorithm on two different setups for Arabic and English accents and show that our approach performs favorably to conventional data selection. We analyze the results to show the effectiveness of our approach in finding the most relevant subset of utterances for improving the speech recognizer on the target accent.
Keywords
learning (artificial intelligence); natural language processing; sampling methods; speech recognition; Arabic accents; English accents; accent adaptation; active learning; automatic speech recognition; cross-entropy based relevance measure; data selection; explicit relevance criterion; human transcription; manual labeling; source recognizer; uncertainty based sampling; Adaptation models; Entropy; Labeling; Measurement uncertainty; Speech; Speech recognition; Uncertainty; Accent Adaptation; Active Learning; Automatic Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location
Miami, FL
Print_ISBN
978-1-4673-5125-6
Electronic_ISBN
978-1-4673-5124-9
Type
conf
DOI
10.1109/SLT.2012.6424250
Filename
6424250
Link To Document