DocumentCode
2703570
Title
Probabilistic and Bottle-Neck Features for LVCSR of Meetings
Author
Grezl, Frantisek ; Karafiat, Martin ; Kontar, S. ; Cernocky, Jan
Author_Institution
Speech@FIT Group, Brno Univ. of Technol., Czech Republic
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
In recent years, probabilistic features became an integral part of state-of-the-are LVCSR systems. In this work, we are exploring the possibility of obtaining the features directly from neural net without the necessity of converting output probabilities to features suitable for subsequent GMM-HMM system. We experimented with 5-layer MLP with bottle-neck in the middle layer. After training such a neural net, we used outputs of the bottle-neck as features for GMM-HMM recognition system. The benefits are twofold: first, improvement was gained when these features are used instead of the probabilistic features, second, the size of the system was reduced, as only part of the neural net is used. The experiments were performed on meetings recognition task defined in MST RT´05 evaluation.
Keywords
Gaussian processes; hidden Markov models; speech recognition; GMM; HMM; LVCSR; bottle-neck features; meetings recognition task; probabilistic features; recognition system; Discrete cosine transforms; Discrete transforms; Feature extraction; Hidden Markov models; Merging; NIST; Neural networks; Performance evaluation; Principal component analysis; Speech recognition; LVCSR; Probabilistic features; TRAP-based features; bottle-neck features; meeting recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367023
Filename
4218211
Link To Document