DocumentCode :
1695675
Title :
Minimal-resource phonetic language models to summarize untranscribed speech
Author :
Chen, Nancy F. ; Bin Ma ; Haizhou Li
Author_Institution :
Inst. for Infocomm Res., A*STAR, Singapore, Singapore
fYear :
2013
Firstpage :
8357
Lastpage :
8361
Abstract :
We propose to extract summary sentences from lexically untranscribed speech via phone tokenization. We use decoded phone sequences instead of words to train language models to infer semantically significant utterances. Phone tokens yield comparable results to words on the TDT-2 English corpus, yet require significantly less linguistic resources - no need for automatic speech recognition (ASR): (1) Using decoded phones of high phone error rate (78.7%) leads to comparable results to using ASR-decoded words. (2) Tokenizing English audio using a Czech phone recognizer leads to comparable results to using English words from closed-captions. These trends parallel those established in spoken language recognition and have practical significance: we can potentially summarize speech passages of resource-poor languages by leveraging existing tools developed on resource-rich languages.
Keywords :
natural language processing; speech recognition; Czech phone recognizer; English audio; minimal resource phonetic language model; phone sequence; phone tokenization; semantically significant utterance; spoken language recognition; summary sentence extraction; untranscribed speech summary; Accuracy; Error analysis; Hidden Markov models; Pragmatics; Silicon; Speech; Speech recognition; audio indexing; extractive speech summarization; phone recognition; phone tokenization; spoken document retrieval; spoken language understanding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639295
Filename :
6639295
Link To Document :
بازگشت