Title :
Improved tone recognition for fluent Mandarin speech based on new inter-syllabic features and robust pitch extraction
Author :
Lin, Wan-yi ; Lee, Lin-rhan
Author_Institution :
Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fDate :
30 Nov.-3 Dec. 2003
Abstract :
Tone recognition for fluent Mandarin speech has always been a very difficult problem, because the pitch contours vary seriously with the context conditions and the complicated tone behavior is difficult to analyze. A new set of four inter-syllabic features are identified to characterize quantitatively such pitch contour variation with respect to the context conditions. In addition, a robust pitch extraction method is proposed by integrating the adaptive Gabor representation (AGR) and instantaneous frequency amplitude spectrum (IFAS). Experimental results indicate that accurate pitch values can be extracted under various noisy conditions, and the tone recognition accuracy can be improved significantly.
Keywords :
acoustic noise; feature extraction; natural languages; parameter estimation; spectral analysis; speech recognition; adaptive Gabor representation; fluent Mandarin speech; instantaneous frequency amplitude spectrum; inter-syllabic features; pitch contour variation; pitch estimation; robust pitch extraction; tone recognition; Context; Degradation; Frequency estimation; Natural languages; Robustness; Spectrogram; Speech analysis; Speech recognition; Tin; Working environment noise;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318447