• DocumentCode
    1662862
  • Title

    Extracting Information about Security Vulnerabilities from Web Text

  • Author

    Mulwad, Varish ; Li, Wenjia ; Joshi, Anupam ; Finin, Tim ; Viswanathan, Krishnamurthy

  • Author_Institution
    Comput. Sci. & Electr. Eng., Univ. of Maryland, Baltimore, MD, USA
  • Volume
    3
  • fYear
    2011
  • Firstpage
    257
  • Lastpage
    260
  • Abstract
    The Web is an important source of information about computer security threats, vulnerabilities and cyber-attacks. We present initial work on developing a framework to detect and extract information about vulnerabilities and attacks from Web text. Our prototype system uses Wikitology, a general purpose knowledge base derived from Wikipedia, to extract concepts that describe specific vulnerabilities and attacks, map them to related concepts from DBpedia and generate machine understandable assertions. Such a framework will be useful in adding structure to already existing vulnerability descriptions as well as detecting new ones. We evaluate our approach against vulnerability descriptions from the National Vulnerability Database. Our results suggest that it can be useful in monitoring streams of text from social media or chat rooms to identify potential new attacks and vulnerabilities or to collect data on the spread and volume of existing ones.
  • Keywords
    Internet; security of data; text analysis; DBpedia; Web text; Wikipedia; Wikitology; chat rooms; computer security threats; cyber-attacks; general purpose knowledge; information extraction; national vulnerability database; security vulnerabilities; social media; text stream monitoring; vulnerability descriptions; Computer security; Data mining; Electronic publishing; Encyclopedias; Internet; entity linking; information extraction; security; vulnerability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Lyon
  • Print_ISBN
    978-1-4577-1373-6
  • Electronic_ISBN
    978-0-7695-4513-4
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2011.26
  • Filename
    6040854