DocumentCode
1572751
Title
A collaborative approach to building evaluated web pages datasets
Author
Barros, Ricardo ; Rodrigues Nt, J.A. ; Filho, Heraldo J A Carneiro ; Ferreira, Flora ; Fernandes, O.C. ; Silva, Carlos Eduardo P ; Ribeiro, André L G ; Xexéo, Geraldo B. ; De Souza, Jano M.
Author_Institution
Grad. Sch. of Eng., UFRJ - Fed. Univ. of Rio de Janeiro, Rio de Janeiro
fYear
2009
Firstpage
668
Lastpage
673
Abstract
In order to evaluate information retrieval algorithms it is imperative to use a dataset as a test database. However, access to such datasets is often difficult and expensive, since building them is a time-consuming and costly task. This paper presents a collaborative approach to dataset creation that uses a data quality evaluation technique based on fuzzy theory, to assist users in selecting suitable Web documents for their datasets. These documents are automatically captured by a crawler and assessed on information derived from their metadata.
Keywords
Internet; fuzzy set theory; groupware; information retrieval; meta data; Web document; Web pages dataset; collaborative approach; data quality evaluation technique; fuzzy set theory; information retrieval; metadata; time-consuming; Algorithm design and analysis; Buildings; Collaborative work; Crawlers; Databases; Fuzzy logic; Information retrieval; International collaboration; Testing; Web pages; Cooperative Work; Data Quality; Dataset Building; Fuzzy Theory; Information Retrieval; Web Document Metadata;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Supported Cooperative Work in Design, 2009. CSCWD 2009. 13th International Conference on
Conference_Location
Santiago
Print_ISBN
978-1-4244-3534-0
Electronic_ISBN
978-1-4244-3535-7
Type
conf
DOI
10.1109/CSCWD.2009.4968135
Filename
4968135
Link To Document