DocumentCode
3000104
Title
Classification of durian characteristics for semantic representation from web documents
Author
Bakar, Z.A. ; Ismail, Khairul Nurmazianna
Author_Institution
Fac. of Comput. & Math. Sci., Dept. of Comput. Sci., Univ. Teknol. MARA (UiTM), Shah Alam, Malaysia
fYear
2012
fDate
21-24 Oct. 2012
Firstpage
1
Lastpage
5
Abstract
The Web contains enormous size of information that is represented in various document structures. The information is scattered and redundant. Currently, search engine is the main medium for retrieving this information. Yet, the most popular search engine cannot satisfy user query. Alternatively, semantic technology can alleviate this problem. In this paper, only relevant web HTML documents on durian also known as king of fruits are chosen. The characteristics of durian will be extracted from those HTML documents. These characteristics are then employed in semantic representation and stored along with their Uniform Resource Identifier (URI) in Resource Description Framework (RDF). The RDF provides the ontology link to many other web documents on durian. Experiment on 40 HTML documents provides eleven new characteristics of durian that can be represent in RDF for semantic search engine.
Keywords
Internet; agricultural products; document handling; food products; hypermedia markup languages; information retrieval; pattern classification; search engines; semantic Web; RDF; URI; Web HTML documents; durian characteristics; fruits; information retrieval; ontology link; resource description framework; search engine; semantic representation; semantic technology; uniform resource identifier; Agriculture; Government; HTML; Ontologies; Resource description framework; Semantics; Durian; HTML; RDF; semantic;
fLanguage
English
Publisher
ieee
Conference_Titel
E-Learning, E-Management and E-Services (IS3e), 2012 IEEE Symposium on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4673-2390-1
Type
conf
DOI
10.1109/IS3e.2012.6414956
Filename
6414956
Link To Document