Title :
Alignment between a technical paper and presentation sheets using a hidden Markov model
Author :
Hayama, Tessai ; Nanba, Hidetsugu ; Kunifuji, Susumu
Author_Institution :
Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
Abstract :
We have been studying the automatic generation of presentation sheets from a technical paper. Our approach consists of obtaining a set of rules for generating presentation sheets by applying machine learning techniques to many pairs of technical papers and their presentation sheets collected from the World Wide Web. As a first step, in this paper, we propose a method for aligning technical papers and presentation sheets. Our method is based on Jing´s method, which uses a hidden Markov model (HMM). Although this method is useful to align short sentences in newspaper articles, it is inapplicable to align sentences in a paper including charts and long sentences. Therefore, we analyse features of papers and sheets, such as information from text appearance, and propose an alignment method that combines the use of these features and Jing´s method. The evaluation shows that our alignment method performed effectively.
Keywords :
Internet; document handling; hidden Markov models; knowledge acquisition; learning (artificial intelligence); natural languages; Jing method; World Wide Web; document handling; hidden Markov model; knowledge acquisition; machine learning techniques; presentation sheet; rule generation; technical paper alignment; Cities and towns; Equations; Hidden Markov models; Information analysis; Machine learning; Paper technology; Performance evaluation; TV; Teletext; Web sites;
Conference_Titel :
Active Media Technology, 2005. (AMT 2005). Proceedings of the 2005 International Conference on
Print_ISBN :
0-7803-9035-0
DOI :
10.1109/AMT.2005.1505278