DocumentCode :
589226
Title :
Semantic Indexing of Video Simulations for Enhancing Medical Care During Crises
Author :
Shuangshuang Jiang ; Frigui, Hichem ; Calhoun, A.W.
Author_Institution :
CECS Dept., Univ. of Louisville, Louisville, KY, USA
Volume :
1
fYear :
2012
fDate :
12-15 Dec. 2012
Firstpage :
520
Lastpage :
525
Abstract :
We propose a machine learning based speaker segmentation and identification system that provides the physician with automated tools to segment, semantically index and retrieve specific segments from a large database of medical simulation video sessions. Instead of working directly in the original feature space, our approach maps low-level audio features to more meaningful histogram descriptors using possibilistic membership functions. The parameters of the membership functions are learned from representative training data. Using 4 medical simulation videos, we show that our approach outperforms existing speaker identification algorithms using standard MFCC and PLP speech features. The proposed speaker identification algorithm was integrated in a GUI to facilitate video retrieval and analysis. Using this GUI, the physician can efficiently identify who spoke and when. In addition, our system can extract useful statistics and features in a completely unsupervised way.
Keywords :
health care; indexing; learning (artificial intelligence); speaker recognition; video retrieval; GUI; MFCC; PLP; histogram descriptors; low-level audio features; machine learning based speaker identification; machine learning based speaker segmentation; medical care enhancement; medical simulation video sessions; possibilistic membership functions; semantic indexing; video retrieval; Feature extraction; Histograms; Medical services; Mel frequency cepstral coefficient; Prototypes; Speech; Training; Clustering; Fuzzy k-nearest neighbor ($K$-NN) classifier; Possibilistic-histogram; Speaker identification; Speaker segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
Type :
conf
DOI :
10.1109/ICMLA.2012.95
Filename :
6406616
Link To Document :
بازگشت