DocumentCode :
660916
Title :
Improving Search Query Matching for Electronic TV Program Guide Data Extraction
Author :
Kiselev, Denis ; Rzepka, Rafal ; Araki, Kotaro
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
fYear :
2013
fDate :
16-18 Sept. 2013
Firstpage :
146
Lastpage :
149
Abstract :
This paper describes a system for searching the Web-based Japanese TV program guide. The system features using morphological parsing and part-of-speech analysis to locate words with nominal and attributive semantic features in the query. Such words are matched mandatorily when searching the TV program guide text, while other words are matched optionally. Moreover, certain words and morphemes are removed from the query as they are considered to have little semantic value. The system checks every query against a stop list of such words and morphemes. Other processing methods, e.g. reversing the search phrase word order and allowing "zero or more words" between the search target words, are also utilized. The present paper uses TV guide search examples to demonstrate how the proposed method can improve Japanese TV program data search results. The paper also contains a few ideas about ways the method could be used for other languages.
Keywords :
Internet; pattern matching; query processing; television; text analysis; TV program guide text; Web-based Japanese TV program guide; attributive semantic features; electronic TV program guide data extraction; morphological parsing; nominal semantic features; part-of-speech analysis; search query matching; word location; word matching; Broadcasting; Educational institutions; Meteorology; Search problems; Semantics; TV; EPG; Information Retrieval; Lexical Semantics; Morphological Parsing; NLP; Query Processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing (ICSC), 2013 IEEE Seventh International Conference on
Conference_Location :
Irvine, CA
Type :
conf
DOI :
10.1109/ICSC.2013.34
Filename :
6693509
Link To Document :
بازگشت