DocumentCode
1296390
Title
Point Process Models for Spotting Keywords in Continuous Speech
Author
Jansen, Aren ; Niyogi, Partha
Author_Institution
Dept. of Comput. Sci., Univ. of Chicago, Chicago, IL, USA
Volume
17
Issue
8
fYear
2009
Firstpage
1457
Lastpage
1470
Abstract
We investigate the hypothesis that the linguistic content underlying human speech may be coded in the pattern of timings of various acoustic ldquoeventsrdquo (landmarks) in the speech signal. This hypothesis is supported by several strands of research in the fields of linguistics, speech perception, and neuroscience. In this paper, we put these scientific motivations to the test by formulating a point process-based computational framework for the task of spotting keywords in continuous speech. We find that even with a noisy and extremely sparse phonetic landmark-based point process representation, keywords can be spotted with accuracy levels comparable to recently studied hidden Markov model-based keyword spotting systems. We show that the performance of our keyword spotting system in the high-precision regime is better predicted by the median duration of the keyword rather than simply the number of its constituent syllables or phonemes. When we are confronted with very few (in the extreme case, zero) examples of the keyword in question, we find that constructing a keyword detector from its component syllable detectors provides a viable approach.
Keywords
computational linguistics; speech coding; speech recognition; component syllable detector; continuous speech; linguistic content; point process-based computational framework; sparse phonetic landmark-based point process representation; speech coding; speech recognition; spotting keyword; Acoustic testing; Detectors; Hidden Markov models; Humans; Neuroscience; Signal processing; Speech processing; Speech recognition; Timing; Vocabulary; Keyword spotting; point processes; speech recognition;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2009.2021307
Filename
5200685
Link To Document