DocumentCode
3025449
Title
Towards Automatic Discovery of co-authorship Networks in the Brazilian Academic Areas
Author
Mena-Chalco, Jesús P. ; Cesar, Roberto M.
Author_Institution
Dept. of Comput. Sci., Univ. of Sao Paulo, Sao Paulo, Brazil
fYear
2011
fDate
5-8 Dec. 2011
Firstpage
53
Lastpage
60
Abstract
In Brazil, individual curricula vitae of academic researchers, that are mainly composed of professional information and scientific productions, are managed into a single software platform called Lattes. Currently, the information gathered from this platform is typically used to evaluate, analyze and document the scientific productions of Brazilian research groups. Despite the fact that the Lattes curricula has semi-structured information, the analysis procedure for medium and large groups becomes a time consuming and highly error-prone task. In this paper, we describe an extension of the script Lattés (an open-source knowledge extraction system from the Lattes platform), for analysing individuals Lattes curricula and automatically discover large-scale co-authorship networks for any academic area. Given some knowledge domain (academic area), the system automatically allows to identify researchers associated with the academic area, extract every list of scientific productions of the researchers, discretized by type and publication year, and for each paper, identify the co-authors registered in the Lattes Platform. The system also allows the generation of different types of networks which may be used to study the characteristics of academic areas at large scale. In particular, we explored the node´s degree and Author Rank measures for each identified researcher. Finally, we confirm through experiments that the system facilitates a simple way to generate different co-authorship networks. To the best of our knowledge, this is the first study to examine large-scale co-authorship networks for any Brazilian academic area.
Keywords
knowledge acquisition; text analysis; Brazilian academic area; Lattes curricula; author rank measure; automatic discovery; coauthorship network; open-source knowledge extraction; Bibliometrics; Data mining; Databases; HTML; Production; Redundancy; academic areas; co-authorship networks; knowledge extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
e-Science Workshops (eScienceW), 2011 IEEE Seventh International Conference on
Conference_Location
Stockholm
Print_ISBN
978-1-4673-0026-1
Type
conf
DOI
10.1109/eScienceW.2011.31
Filename
6130731
Link To Document