• DocumentCode
    2909805
  • Title

    Predictive model representation and comparison: Towards data and predictive models governance

  • Author

    Makhtar, Mokhairi ; Neagu, Daniel C. ; Ridley, Mick

  • Author_Institution
    Sch. of Comput., Inf. & Media, Univ. of Bradford, Bradford, UK
  • fYear
    2010
  • fDate
    8-10 Sept. 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The increasing variety of data mining tools offers a large palette of types and representation formats for predictive models. Managing the models becomes then a big challenge, as well as reusing the models and keeping the consistency of model and data repositories because of the lack of an agreed representation across the models. The flexibility of XML representation makes it easier to provide solutions for Data and Model Governance (DMG) and support data and model exchange. We choose Predictive Toxicology as an application field to demonstrate our approach to represent predictive models linked to data for DMG. We propose an original structure: Predictive Toxicology Markup Language (PTML) offers a representation scheme for predictive toxicology data and models generated by data mining tools. We also show how this representation offers possibilities to compare models by similarity using our Distance Models Comparison technique. This work is ongoing and first encouraging results for calculating PTML distance are reported hereby.
  • Keywords
    XML; data mining; data structures; database management systems; XML representation flexibility; data and model governance; data mining tools; data repositories; distance models comparison technique; predictive model representation; predictive toxicology data; predictive toxicology markup language; Classification algorithms; Data mining; Data models; Numerical models; Predictive models; Toxicology; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence (UKCI), 2010 UK Workshop on
  • Conference_Location
    Colchester
  • Print_ISBN
    978-1-4244-8774-5
  • Electronic_ISBN
    978-1-4244-8773-8
  • Type

    conf

  • DOI
    10.1109/UKCI.2010.5625573
  • Filename
    5625573