DocumentCode
2917312
Title
Modeling human activities as speech
Author
Chen, Chia-Chih ; Aggarwal, J.K.
Author_Institution
Dept. of ECE, Univ. of Texas at Austin, Austin, TX, USA
fYear
2011
fDate
20-25 June 2011
Firstpage
3425
Lastpage
3432
Abstract
Human activity recognition and speech recognition appear to be two loosely related research areas. However, on a careful thought, there are several analogies between activity and speech signals with regard to the way they are generated, propagated, and perceived. In this paper, we propose a novel action representation, the action spectrogram, which is inspired by a common spectrographic representation of speech. Different from sound spectrogram, an action spectrogram is a space-time-frequency representation which characterizes the short-time spectral properties of body parts´ movements. While the essence of the speech signal is the variation of air pressure in time, our method models activities as the likelihood time series of action associated local interest patterns. This low-level process is realized by learning boosted window classifiers from spatially quantized spatio-temporal interest features. We have tested our algorithm on a variety of human activity datasets and achieved superior results.
Keywords
gesture recognition; signal classification; spectral analysis; speech recognition; time series; action representation; action spectrogram; body parts movements; boosted window classifiers; human activities modeling; human activity datasets; human activity recognition; likelihood time series; local interest patterns; short-time spectral property; sound spectrogram; space-time-frequency representation; spatially quantized spatio-temporal interest features; spectrographic speech representation; speech recognition; speech signals; Feature extraction; Humans; Speech; Speech recognition; Time series analysis; Training; Videos;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location
Providence, RI
ISSN
1063-6919
Print_ISBN
978-1-4577-0394-2
Type
conf
DOI
10.1109/CVPR.2011.5995555
Filename
5995555
Link To Document