DocumentCode :
3718764
Title :
Speech summarization using clustered frames acoustic units for Persian news
Author :
Maryam Asadolahzade Kermanshahi;Mohammad Mehdi Homayounpur
Author_Institution :
Department of Computer Engineering, Amirkabir University of Technology, Tehran, Iran
fYear :
2015
Firstpage :
313
Lastpage :
318
Abstract :
In this paper we present a method for news speech summarization that can select directly the important sentences from a spoken document. In this paper instead of using the conventional speech units such as phonemes and words, a set of new speech units obtained by a clustering technique are used. Our speech summarization method uses these new acoustic units instead of words for speech summarization. To obtain the new speech units, simply we extract frames from training set of summarization corpus and cluster them using K-means algorithm. To summarize a spoken document, each sentence of the document is represented by a sequences of new acoustic units. Then we use n-grams of new acoustic units for summarization. Experiments were conducted on a Persian news dataset, and encouraging results were obtained. The results were compared to speech summarization method using a phoneme recognition system according to ROUGE-N and ROUGE-L measures.
Keywords :
"MATLAB","Measurement","Speech","Acoustics","Indexes","Support vector machines"
Publisher :
ieee
Conference_Titel :
Computer and Knowledge Engineering (ICCKE), 2015 5th International Conference on
Type :
conf
DOI :
10.1109/ICCKE.2015.7365848
Filename :
7365848
Link To Document :
بازگشت