Title :
On the path to mine multi word expressions from Slovak web space
Author :
Telepovska, H. ; Baco, M. ; Genci, J. ; Olostiak, M.
Author_Institution :
Tech. Univ. of Kosice, Kosice, Slovakia
Abstract :
The paper presents the current ongoing project aimed at documenting static and dynamic characteristics of the Slovak website. The aim of the first part of the project is to elaborate statistics regarding number of second level domains in Slovak web space. Other additional information about each domain have been processed such as determining whether the domain is functional or is dead, recording the relevant IP address, the effort to determine the period of change content domain, etc. Continuously generated data, moreover, allow presenting the dynamics of changes in the number of domains, or some attributes. In the second part of the project we plan to focus on getting static and dynamic characteristics of the Slovak vocabulary - mapping the current vocabulary, watching new words or phrases and so on.
Keywords :
IP networks; Web sites; data mining; vocabulary; IP address; Slovak Web space; Slovak vocabulary mapping; Slovak website; change dynamics; dynamic characteristics; multiword expressions mining; static characteristics; Conferences; Databases; Dictionaries; Electronic learning; Partitioning algorithms; Vocabulary; Web sites;
Conference_Titel :
Emerging eLearning Technologies and Applications (ICETA), 2014 IEEE 12th International Conference on
Print_ISBN :
978-1-4799-7739-0
DOI :
10.1109/ICETA.2014.7107559