Title :
HMM and neural network based speech act detection
Author_Institution :
Language Tech. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
We present an incremental lattice generation approach to speech act detection for spontaneous and overlapping speech in telephone conversations (CallHome Spanish). At each stage of the process it is therefore possible to use different models after the initial HMM models have generated a reasonable set of hypothesis. These lattices can be processed further by more complex models. This study shows how neural networks can be used very effectively in the classification of speech acts. We find that speech acts can be classified better using the neural net based approach than using the more classical ngram backoff model approach. The best resulting neural network operates only on unigrams and the integration of the ngram backoff model as a prior to the model reduces the performance of the model. The neural network can therefore more likely be robust against errors from an LVCSR system and can potentially be trained from a smaller database
Keywords :
grammars; hidden Markov models; neural nets; signal classification; signal detection; speech processing; telephony; CallHome Spanish; HMM; LVCSR system; database; incremental lattice generation; model performance; neural networks; ngram backoff model; overlapping speech; speech act classification; speech act detection; spontaneous speech; telephone conversations; unigrams; Databases; Hidden Markov models; Interactive systems; Lattices; Natural languages; Neural networks; Robustness; Speech; Telephony; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758171