DocumentCode
3661040
Title
A biologically inspired onset and offset speech segmentation approach
Author
Andrew K. Abel;Dean Hunter;Leslie S. Smith
Author_Institution
Computing Science and Mathematics, University of Stirling, FK9 4LA, Scotland
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
1
Lastpage
8
Abstract
A key component in the processing of speech is the division of longer input sounds into a number of smaller sections. For speech interpretation it is generally easier to classify single sections. Similarly, when processing speech for other purposes (e.g. speech filtering), it can be easier and more relevant to process individual phonemes. Here, we propose a biologically inspired speech segmentation technique that filters the speech into multiple bandpassed channels using a Gammatone filterbank, and then uses an essentially energy-based spike coding technique in order to find the onsets and offsets present in an audio signal. These onsets and offsets are then processed using leaky integrate-and-fire neurons, and the spikes from these used to determine the speech segmentation. We evaluate this new system using a quantitative evaluation metric, and the promising results of segmentation of both clean speech and speech in noise demonstrate the effectiveness of this technique.
Keywords
"Biology","Signal resolution"
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2015 International Joint Conference on
Electronic_ISBN
2161-4407
Type
conf
DOI
10.1109/IJCNN.2015.7280347
Filename
7280347
Link To Document