DocumentCode :
1697002
Title :
Exceptions in language as learned by the multi-factor sparse plus low-rank language model
Author :
Hutchinson, Brian ; Ostendorf, Mari ; Fazel, Maryam
Author_Institution :
Electr. Eng. Dept., Univ. of Washington, Seattle, WA, USA
fYear :
2013
Firstpage :
8580
Lastpage :
8584
Abstract :
Word usage is influenced by diverse factors, including topic, genre and various speaker/author characteristics. To characterize these aspects of language, we introduce the “Multi-Factor Sparse Plus Low Rank” exponential language model, which allows supervised joint training of arbitrary overlapping factor-specific model components. This flexible architecture has the advantage of being highly interpretable. The elements of sparse parameter matrices can be viewed as factor-dependent corrections (e.g. topic- or speaker-dependent phenomena). In topic modeling experiments on conversational telephone speech, we obtain modest perplexity reductions over an n-gram baseline and demonstrate topic-dependent keyword extraction that leads to a 13% (absolute) improvement in precision over TFIDF. We also show how keywords can be jointly learned for speakers, roles and topics in a study of Supreme Court oral arguments.
Keywords :
computational linguistics; learning (artificial intelligence); matrix algebra; natural language processing; speech processing; TFIDF; arbitrary overlapping factor-specific model components; conversational telephone speech; factor-dependent corrections; flexible architecture; modest perplexity reductions; multifactor sparse plus low-rank exponential language model; n-gram baseline; sequential language behavior; sparse parameter matrices; speaker-author characteristics; supervised joint training; supervised learning; supreme court oral arguments; topic modeling; topic-dependent keyword extraction; word usage; Adaptation models; Computational modeling; Data models; Joints; Sparse matrices; Speech; Training; Language modeling; keyword extraction; sparse plus low rank decomposition; topic models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639340
Filename :
6639340
Link To Document :
بازگشت