DocumentCode :
3323133
Title :
Characterizing Genres of Web Pages: Genre Hybridism and Individualization
Author :
Santini, Marina
Author_Institution :
Univ. of Brighton
fYear :
2007
fDate :
Jan. 2007
Firstpage :
71
Lastpage :
71
Abstract :
When dealing with genres of Web pages, there are two important aspects to be taken into account. On the one hand, the Web is fluid, unstable and fast-paced. On the other hand, genres on the Web are instantiated in Web pages, which are a complex type of document, more composite and unpredictable than paper documents. These two aspects are interwoven and often result in classification hurdles. In this paper, the author suggests analyzing these classification problems in terms of two broad textual phenomena: genre hybridism and individualization. The identification of these two phenomena helps pinpoint the range of flexibility that an automatic classification system should have. More precisely, genre hybridism accounts for multi-genre variation within the individual Web page, while individualization refers to absence of any recognized genre in a Web page. In a few words, the aim of this paper is to show that Web pages need a zero-to-multi-genre classification scheme, i.e. a scheme that allows zero genre or multi-genre classification, in addition to the traditional single-genre classification
Keywords :
Internet; classification; Web page; classification; genre hybridism; genre individualization; Data mining; Face detection; Information retrieval; Labeling; Multidimensional systems; Software libraries; Stress; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 2007. HICSS 2007. 40th Annual Hawaii International Conference on
Conference_Location :
Waikoloa, HI
ISSN :
1530-1605
Electronic_ISBN :
1530-1605
Type :
conf
DOI :
10.1109/HICSS.2007.124
Filename :
4076514
Link To Document :
بازگشت