• DocumentCode
    3025449
  • Title

    Towards Automatic Discovery of co-authorship Networks in the Brazilian Academic Areas

  • Author

    Mena-Chalco, Jesús P. ; Cesar, Roberto M.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sao Paulo, Sao Paulo, Brazil
  • fYear
    2011
  • fDate
    5-8 Dec. 2011
  • Firstpage
    53
  • Lastpage
    60
  • Abstract
    In Brazil, individual curricula vitae of academic researchers, that are mainly composed of professional information and scientific productions, are managed into a single software platform called Lattes. Currently, the information gathered from this platform is typically used to evaluate, analyze and document the scientific productions of Brazilian research groups. Despite the fact that the Lattes curricula has semi-structured information, the analysis procedure for medium and large groups becomes a time consuming and highly error-prone task. In this paper, we describe an extension of the script Lattés (an open-source knowledge extraction system from the Lattes platform), for analysing individuals Lattes curricula and automatically discover large-scale co-authorship networks for any academic area. Given some knowledge domain (academic area), the system automatically allows to identify researchers associated with the academic area, extract every list of scientific productions of the researchers, discretized by type and publication year, and for each paper, identify the co-authors registered in the Lattes Platform. The system also allows the generation of different types of networks which may be used to study the characteristics of academic areas at large scale. In particular, we explored the node´s degree and Author Rank measures for each identified researcher. Finally, we confirm through experiments that the system facilitates a simple way to generate different co-authorship networks. To the best of our knowledge, this is the first study to examine large-scale co-authorship networks for any Brazilian academic area.
  • Keywords
    knowledge acquisition; text analysis; Brazilian academic area; Lattes curricula; author rank measure; automatic discovery; coauthorship network; open-source knowledge extraction; Bibliometrics; Data mining; Databases; HTML; Production; Redundancy; academic areas; co-authorship networks; knowledge extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science Workshops (eScienceW), 2011 IEEE Seventh International Conference on
  • Conference_Location
    Stockholm
  • Print_ISBN
    978-1-4673-0026-1
  • Type

    conf

  • DOI
    10.1109/eScienceW.2011.31
  • Filename
    6130731