DocumentCode
3764639
Title
Classifying web hierarchically using multi label tree classifier
Author
Daya Gupta;Harsh Tripathi;Mayukh Maitra
Author_Institution
Department of Computer Science and Software Engineering, Delhi Technological University, New Delhi, India
fYear
2015
Firstpage
1
Lastpage
6
Abstract
Classification and extraction of web finds its applications in semantic web, searching and information extraction. The first part of the paper deals with the problem of classifying web pages, according to their content. Further, the methodology to classify web pages hierarchically in order to achieve topic-wise modeling of websites using multi label tree classifier, a variant of classification where instances may belong to multiple classes at the same time. Data from an implementation of multi label tree classifier shows marked improvements in processing multi-class classification in comparison to conventional hierarchical classification techniques.
Keywords
"Web pages","Training","Support vector machines","Feature extraction","Dictionaries","Multimedia communication","Classification algorithms"
Publisher
ieee
Conference_Titel
India Conference (INDICON), 2015 Annual IEEE
Electronic_ISBN
2325-9418
Type
conf
DOI
10.1109/INDICON.2015.7443337
Filename
7443337
Link To Document