• DocumentCode
    3248404
  • Title

    Automatic generation of natural language summaries for Java classes

  • Author

    Moreno, L. ; Aponte, Jairo ; Sridhara, Giriprasad ; Marcus, Andrian ; Pollock, Lori ; Vijay-Shanker, K.

  • Author_Institution
    Wayne State Univ., Detroit, MI, USA
  • fYear
    2013
  • fDate
    20-21 May 2013
  • Firstpage
    23
  • Lastpage
    32
  • Abstract
    Most software engineering tasks require developers to understand parts of the source code. When faced with unfamiliar code, developers often rely on (internal or external) documentation to gain an overall understanding of the code and determine whether it is relevant for the current task. Unfortunately, the documentation is often absent or outdated. This paper presents a technique to automatically generate human readable summaries for Java classes, assuming no documentation exists. The summaries allow developers to understand the main goal and structure of the class. The focus of the summaries is on the content and responsibilities of the classes, rather than their relationships with other classes. The summarization tool determines the class and method stereotypes and uses them, in conjunction with heuristics, to select the information to be included in the summaries. Then it generates the summaries using existing lexicalization tools. A group of programmers judged a set of generated summaries for Java classes and determined that they are readable and understandable, they do not include extraneous information, and, in most cases, they are not missing essential information.
  • Keywords
    Java; natural language processing; software engineering; system documentation; Java classes; automatic generation; essential information; external documentation; human readable summary; internal documentation; lexicalization tools; natural language summary; software engineering tasks; source code; stereotypes; summarization tool; Context; Documentation; Educational institutions; Java; Natural languages; Production facilities; Taxonomy; Source code summarization; documentation generation; program comprehension;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Program Comprehension (ICPC), 2013 IEEE 21st International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1063-6897
  • Type

    conf

  • DOI
    10.1109/ICPC.2013.6613830
  • Filename
    6613830