Title :
Formation of the Characteristic Word Sets for the Optimization of Information Retrieval Processes
Author :
Voloshynovska, Iryna ; Andreychuk, Nadiya
Author_Institution :
Dept. of Appl. Linguistics, Lviv Polytech. Nat. Univ., Lviv
Abstract :
This paper suggests that the principal component analysis can be applied to the corpus of scientific texts to determine the characteristic set of words revealing the stage of scientific advance. The main words attributed to the progress in applied and fundamental science are extracted and the respective verb sets are formed. The possibility of these sets application in the information retrieval is discussed.
Keywords :
information retrieval; natural sciences computing; optimisation; principal component analysis; text analysis; applied science; characteristic word sets formation; fundamental science; information retrieval process; optimization; principal component analysis; scientific text corpus; Conducting materials; Data mining; Information analysis; Information retrieval; Natural language processing; Performance analysis; Personal communication networks; Physics; Principal component analysis; Information retrieval; Principal component; Semantic segmentation; Text corpus;
Conference_Titel :
CAD Systems in Microelectronics, 2007. CADSM '07. 9th International Conference - The Experience of Designing and Applications of
Conference_Location :
Lviv-Polyana
Print_ISBN :
966-533-587-0
DOI :
10.1109/CADSM.2007.4297663