مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2917312

Title :

Modeling human activities as speech

Author :

Chen, Chia-Chih ; Aggarwal, J.K.

Author_Institution :

Dept. of ECE, Univ. of Texas at Austin, Austin, TX, USA

fYear :

2011

fDate :

20-25 June 2011

Firstpage :

3425

Lastpage :

3432

Abstract :

Human activity recognition and speech recognition appear to be two loosely related research areas. However, on a careful thought, there are several analogies between activity and speech signals with regard to the way they are generated, propagated, and perceived. In this paper, we propose a novel action representation, the action spectrogram, which is inspired by a common spectrographic representation of speech. Different from sound spectrogram, an action spectrogram is a space-time-frequency representation which characterizes the short-time spectral properties of body parts´ movements. While the essence of the speech signal is the variation of air pressure in time, our method models activities as the likelihood time series of action associated local interest patterns. This low-level process is realized by learning boosted window classifiers from spatially quantized spatio-temporal interest features. We have tested our algorithm on a variety of human activity datasets and achieved superior results.

Keywords :

gesture recognition; signal classification; spectral analysis; speech recognition; time series; action representation; action spectrogram; body parts movements; boosted window classifiers; human activities modeling; human activity datasets; human activity recognition; likelihood time series; local interest patterns; short-time spectral property; sound spectrogram; space-time-frequency representation; spatially quantized spatio-temporal interest features; spectrographic speech representation; speech recognition; speech signals; Feature extraction; Humans; Speech; Speech recognition; Time series analysis; Training; Videos;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on

Conference_Location :

Providence, RI

ISSN :

1063-6919

Print_ISBN :

978-1-4577-0394-2

Type :

conf

DOI :

10.1109/CVPR.2011.5995555

Filename :

5995555

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2917312