Title :
Full-rank linear-chain NeuroCRF for sequence labeling
Author :
Rondeau, Marc-Antoine ; Yi Su
Author_Institution :
McGill Univ., Montreal, QC, Canada
Abstract :
Inspired by the success of deep neural network-hidden Markov model (DNN-HMM) in acoustic modeling for automatic speech recognition, a number of researchers from various fields have independently proposed the idea of combining DNN and conditional random fields (CRFs). Despite their subtle differences, this class of models is collectively referred to as “NeuroCRF” in this paper. We focus our attention on applying a linear-chain NeuroCRF to the fundamental and ubiquitous problem of sequence labeling in natural language processing with distributed word representations. We question the necessity of previous works´ use of the neural network to learn a low-rank emission feature matrix, added to a transition feature matrix. By modeling a full-rank feature matrix directly, we show that statistically significant gains can be achieved on the CoNLL-2000 syntactic chunking task, without harming performance on tasks with low dependencies between consecutive labels, such as the CoNLL-2003 named entity recognition task.
Keywords :
hidden Markov models; natural language processing; neural nets; speech recognition; CoNLL-2000 syntactic chunking task; DNN-HMM; acoustic modeling; automatic speech recognition; conditional random fields; deep neural network; distributed word representations; entity recognition task; full-rank linear-chain NeuroCRF; hidden Markov model; low-rank emission feature matrix; natural language processing; sequence labeling; transition feature matrix; ubiquitous problem; Artificial neural networks; Feature extraction; Hidden Markov models; Labeling; Syntactics; Training; Neural networks; conditional random fields; spoken language understanding;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178979