• DocumentCode
    3318298
  • Title

    Summarising company announcements

  • Author

    Dale, Robert ; Lei, Li ; De Vries, Hugo ; Gardiner, Mary ; Tilbrook, Marc

  • Author_Institution
    Centre for Language Technol., Macquarie Univ., North Ryde, NSW, Australia
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    651
  • Lastpage
    656
  • Abstract
    This paper describes work that attempts to use language technology as a solution to the problem of information overload. The specific domain of application is the database of company announcements accessible via the Web site of the Australian Stock Exchange to meet regulatory requirements, over 100,000 documents a year are made available via this site, with only limited search facilities. We use a variety of techniques from language technology to make it easier to explore and manage the information in this data set. In this paper, we focus on our use of information extraction, which identifies and extracts important elements of information from a document, and text compaction, which applies linguistically-motived substitutions to reduce potential summary sentences to more compact forms. Together, these techniques provide a way of producing summaries of a significant proportion of the document base.
  • Keywords
    Web sites; document handling; financial data processing; information retrieval; natural languages; stock markets; text analysis; Australian Stock Exchange Web site; company announcements; information extraction; information overload; language technology; linguistically-motived substitutions; summary sentences; text compaction; Australia; Compaction; Data mining; Databases; IEEE news; Information management; Monitoring; Stock markets; Technology management; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598817
  • Filename
    1598817