DocumentCode
2532293
Title
An investigation into the effect of pitch transformation on children speech recognition
Author
Ghai, Shweta ; Sinha, Rohit
Author_Institution
Dept. of Electron. & Commun. Eng., Indian Inst. of Technol. Guwahati, Guwahati
fYear
2008
fDate
19-21 Nov. 2008
Firstpage
1
Lastpage
6
Abstract
The degradation in the automatic speech recognition performance of the adult speech trained models for children speech data is a well known problem. In this work, motivated by the voice conversion approaches for addressing the acoustic mis-match between the adult and children speech, we investigated the effect of pitch transformation on children speech on telephone-based connected digit recognition task. Our preliminary results indicate that the effect of pitch transformation on the recognition performance of the children speech varies with their average pitch values. With the reduction of pitch, an improvement of 10% was observed in the speech recognition performance for children having pitch values more than 300 Hz. We have also proposed an explanation for this performance improvement based on the study of filter-bank smoothing in front-end signal processing.
Keywords
acoustic signal processing; audio signal processing; channel bank filters; speech recognition; acoustic mismatch; automatic speech recognition; children speech recognition; filter-bank smoothing; front-end signal processing; pitch transformation; telephone-based connected digit recognition task; voice conversion approaches; Acoustical engineering; Automatic speech recognition; Degradation; Loudspeakers; Maximum likelihood linear regression; Mel frequency cepstral coefficient; Signal processing; Smoothing methods; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2008 - 2008 IEEE Region 10 Conference
Conference_Location
Hyderabad
Print_ISBN
978-1-4244-2408-5
Electronic_ISBN
978-1-4244-2409-2
Type
conf
DOI
10.1109/TENCON.2008.4766828
Filename
4766828
Link To Document