DocumentCode :
1584922
Title :
Flexible Web document analysis for delivery to narrow-bandwidth devices
Author :
Penn, Gerald ; Hu, Jianying ; Luo, Hengbin ; McDonald, Ryan
Author_Institution :
Language Modeling Res., Lucent Technol. Bell Labs., Murray Hill, NJ, USA
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
1074
Lastpage :
1078
Abstract :
We propose a set of baseline heuristics for identifying genuinely tabular information and news links in HTML documents. A prototype implementation of these heuristics is described for delivering content from news providers´ home pages to a narrow-bandwidth device such as a portable digital assistant or cellular phone display. Its evaluation on 75 Web sites is provided, along with a discussion of topics for future research
Keywords :
Internet; document image processing; hypermedia markup languages; information resources; HTML; Internet; Web document analysis; Web sites; World Wide Web; cellular phone display; heuristics; home pages; narrow-bandwidth devices; news links; portable digital assistant; tabular information; Cellular phones; Computer displays; Computer science; Educational institutions; HTML; Laboratories; Multimedia systems; Portals; Prototypes; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953951
Filename :
953951
Link To Document :
بازگشت