DocumentCode :
177974
Title :
Neural networks for supervised pitch tracking in noise
Author :
Kun Han ; DeLiang Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
1488
Lastpage :
1492
Abstract :
Determination of pitch in noise is challenging because of corrupted harmonic structure. In this paper, we extract pitch using supervised learning, where probabilistic pitch states are directly learned from noisy speech. We investigate two alternative neural networks modeling the pitch states given observations. The first one is the feedforward deep neural network (DNN), which is trained on static frame-level features. The second one is the recurrent deep neural network (RNN) capable of learning the temporal dynamics trained on sequential frame-level features. Both DNNs and RNNs produce accurate probabilistic outputs of pitch states, which are then connected into pitch contours by Viterbi decoding. Our systematic evaluation shows that the proposed pitch tracking approaches are robust to different noise conditions and significantly outperform current state-of-the-art pitch tracking techniques.
Keywords :
Viterbi decoding; feedforward neural nets; learning (artificial intelligence); probability; recurrent neural nets; speech coding; DNN; RNN; Viterbi decoding; corrupted harmonic structure; feedforward deep neural network; noise conditions; noisy speech; pitch contours; pitch determination; pitch extraction; probabilistic pitch states output; recurrent deep neural network; sequential frame-level features; supervised learning; supervised pitch tracking; temporal dynamics; Hidden Markov models; Probabilistic logic; Signal to noise ratio; Speech; Training; Viterbi algorithm; Deep neural networks; Pitch estimation; Recurrent neural networks; Supervised learning; Viterbi decoding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6853845
Filename :
6853845
Link To Document :
بازگشت